Explore the science of confidence with Lucy Antrobus, as she unveils neuroscience-backed strategies to build and boost confidence through practice, positive energy, and the power of laughter. An essential listen for fostering unshakable self-assurance. Additional materials: www.superdatascience.com/770 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Generative AI in medicine takes center stage as Prof. Zachary Lipton, Chief Scientific Officer at Abridge, joins host Jon Krohn to discuss the significant advancements in AI that are reshaping healthcare. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • The inspiration for Zack to get started in ML and healthcare [03:56] • The hardware required to use Abridge [12:29] • The key data science projects at Abridge right now [35:05] • Abridge's tech stack [59:54] • How Abridge ensures reliability in a high-stakes setting like healthcare [1:07:29] • How Zack’s academic research cross-pollinates with his commercial ML projects [1:21:05] • How Zack’s jazz background molded his entrepreneur and data science journey [1:30:32] Additional materials: www.superdatascience.com/769
Claude 3, LLMs and testing ML performance: Jon Krohn tests out Anthropic’s new model family, Claude 3, which includes the Haiku, Sonnet and Opus models (written in order of their performance power, from least to greatest). Can it stand shoulder to shoulder with other models such as GPT-4 and Gemini 1.0 Ultra? And how important is it for machine learning practitioners to try out these models with their own benchmarks? Jon walks listeners through a test of his own in this Five-Minute Friday. Additional materials: www.superdatascience.com/768 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Jon Krohn sits down with Sebastian Raschka to discuss his latest book, Machine Learning Q and AI, the open-source libraries developed by Lightning AI, how to exploit the greatest opportunities for LLM development, and what’s on the horizon for LLMs. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about Machine Learning Q and AI [04:13] • Sebastian Raschka’s role as Staff Research Engineer at Lightning AI [19:21] • PyTorch Lightning’s and Lightning Fabric’s capabilities [39:32] • Large language models: Opportunities and challenges [43:35] • DoRA vs LoRA [48:56] • How to be a successful AI educator [1:34:18] Additional materials: www.superdatascience.com/767
Kurt Vonnegut's "Player Piano" delivers striking parallels between its dystopian vision and today's AI challenges. This week, Jon Krohn explores the novel's depiction of a world where humans are marginalized by machines, reflecting on the impact of automation on society and the ethical considerations it raises. Tune in as we unpack the timeless relevance of Vonnegut's work to the AI era. Additional materials: www.superdatascience.com/766 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Explore the origins of NumPy and SciPy with their creator, Dr. Travis Oliphant. Discover the journey from personal need to global impact, the challenges overcome, and the future of these essential Python libraries in scientific computing and data science. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com), and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Travis's journey to creating NumPy and SciPy [08:05] • How Anaconda got started [42:24] • How Numba, a high-performance Python compiler, was brought to market [54:48] • Python's influence on the thought processes of scientists and engineers [1:04:21] • The commercial projects that support Travis’s vast open-source efforts and communities [1:10:22] • How to get involved in Travis's commercial projects and communities [1:22:34] • The future of scientific computing and Python libraries [1:29:50] Additional materials: www.superdatascience.com/765
Data science futurists, bestselling authors, and lively how-to guides from the industry’s top practitioners, which range from applying data science for good to using open-source tools for NLP: This is The Super Data Science Podcast’s top ten most listened-to episodes in 2023, hosted by Jon Krohn. A great snapshot of our great content from 2023. Additional materials: www.superdatascience.com/764 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
At Glasswing Ventures, Rudina Seseri wants to be able to answer the question: What has Glasswing Ventures done for the company beyond capital investment? She speaks to Jon Krohn about how her company uses data to assess venture capital investments, the secret sauce of successful AI startups, and why she feels generative AI is only the start of a much broader impact that AI will make in communities and businesses. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Potential interest areas for Series A AI venture capitalists [12:22] • How Glasswing’s AI Palette helps AI startups [23:06] • How data driven the venture capital industry is [27:21] • Advice for adopting services from AI providers [47:21] • Model collapse: Causes and concerns [58:44] • Glasswing’s checklist for AI startups [1:04:59] Additional materials: www.superdatascience.com/763
Jon Krohn presents an insightful overview of Google's groundbreaking Gemini Pro 1.5, a million-token LLM that's transforming the landscape of AI. Discover the innovative aspects of Gemini Pro 1.5, from its extensive context window to its multimodal functionalities, which are broadening the scope of AI technology and signifying a significant leap in data science. Plus, join Jon for a practical demonstration, showcasing the real-world applications, capabilities, and limitation of this advanced language model. Additional materials: www.superdatascience.com/762 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Google's Gemini Ultra takes the spotlight this week, as host Jon Krohn welcomes Lisa Cohen, Google's Director of Data Science and Engineering, for a conversation about the launch of Gemini Ultra. Discover the capabilities of this cutting-edge large language model and how it stands toe-to-toe with GPT-4. Lisa shares her insights on the development, rollout, and potential of Gemini Ultra in reshaping various sectors. Whether you're a data science professional, tech enthusiast, or curious about the future of AI, this episode offers a deep dive into one of the most significant advancements in artificial intelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), and by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Google’s Gemini model family and Lisa's key responsibilities [04:55] • How LLMs will transform the practice of Data Science [19:47] • Lisa on prompt engineering and reinforcement learning from human feedback [24:38] • How to fine-tune Gemini models with Google's Vertex AI [30:52] • How AI-assistants will transform life and work for everyone from data scientists to educators to children [47:14] • The challenges of developing a data-centric culture [57:31] • Centralized vs decentralized data science teams [1:03:50] Additional materials: www.superdatascience.com/761
AI-crafted beer, machine learning for passion projects, and self-taught data science: Jon Krohn and Beau Warren’s hotly anticipated, data-driven, punny lager Krohn&Borg is finally given a taste test in this week’s Five-Minute Friday. Heading to the Species X brewery in Columbus, Ohio, Jon Krohn and Beau Warren launched the beer that had been predicted, optimized and developed by a machine-learning model. Additional materials: www.superdatascience.com/760 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode! This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), by Oracle NetSuite business software (netsuite.com/superdata), and by Intel and HPE Ezmeral Software Solutions (http://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How decoder-only transformers work [15:51] • How cross-attention works in transformers [41:05] • How encoders and decoders work together (an example) [52:46] • How encoder-only architectures excel at understanding natural language [1:20:34] • The importance of masking during self-attention [1:27:08] Additional materials: www.superdatascience.com/759
Explore the groundbreaking Mamba model, a potential game-changer in AI that promises to outpace the traditional Transformer architecture with its efficient, linear-time sequence modeling. Additional materials: www.superdatascience.com/758 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Explore mind-blowing storytelling with Cole Nussbaumer Knaflic in this episode. Audience favorite and author of "Storytelling with You," Cole returns to share essential tips for crafting impactful presentations, emphasizing narrative construction and audience engagement. Learn how to effectively communicate data and stories, enhancing your presentations with insights from a leading expert in the field. This episode is brought to you by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to become a confident communicator [11:59] • How to get rid of filler words [26:32] • How facts alone can't make a strong impact [41:44] • Cole's overview of her book Storytelling with You [55:19] • How to craft an effective presentation [1:00:24] • Common mistakes in virtual presentations [1:09:48] • Cole's virtual presentation setup [1:15:33] • Cole's next book Daphne Draws Data [1:20:23] Additional materials: www.superdatascience.com/757
AlphaGeometry, intuitive AI, and geometric deduction: In this week’s Five-Minute Friday, Super Data Science host Jon Krohn looks into developments from DeepMind, Google’s ground-breaking AI lab, and explores how this is a critical step towards a future of broadly accessible AI solutions across scientific disciplines. Additional materials: www.superdatascience.com/756 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
ChatGPT applications and data-driven beer: Beer brewer and Super Data Science regular listener Beau Warren talks to Jon Krohn about the wonders of “sweaty ales”, how to brew beer with data, and how to get started on creative machine learning projects even without a degree in data science. This episode is brought to you by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • About Species X [06:31] • How to become a certified beer taster [12:37] • How Beau checks the quality of his beer [25:01] • Beau and Jon’s machine learning project [38:02] • About genetic algorithms [52:35] • How to get creativity out of LLMs [1:24:46] Additional materials: www.superdatascience.com/755
Explore the future of coding with poolside co-founder and CEO Jason Warner as he explores the potential of code-specialized LLMs and their revolutionary impact on the developer's role. Tune in for insights on the shift towards an AI-led development paradigm. Additional materials: www.superdatascience.com/754 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Explore the future of collaborative ML workflows in this engaging episode with Dr. Greg Michaelson, Co-Founder of Zerve. Dr. Michaelson introduces the groundbreaking Zerve IDE and Pypelines project, addressing the critical gap in AutoML for commercial use and pinpointing why many A.I. projects don't meet their objectives. Gain insights into steering AI initiatives towards success and enhancing project communication, all in this insightful session. This episode is brought to you by Oracle NetSuite business software (https://netsuite.com/superdata), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Why Zerve IDE is so sorely needed [04:50] • Pypelines: AutoML open-source in python [30:00] • Why most commercial A.I. projects fail and how to ensure they succeed [47:45] • How AutoML will impact the role of the data scientist [53:21] • Greg's background as a pastor and working at DataRobot [1:03:40] • How to develop impressive communication and storytelling skills [1:16:16] Additional materials: www.superdatascience.com/753
Jon Krohn interviews Hilke Schellmann about the ethics of recruitment algorithms, the field’s current state of play, and what can be improved about AI used in recruiting. Additional materials: www.superdatascience.com/752 Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information.
Venture capital and AI, and how to succeed with an AI company in 2024: Rasmus Rothe, Cofounder of Merantix, speaks to Jon Krohn about the Merantix campus in Berlin, how a venture capitalist identifies the best AI startups, the surefire ways for AI company founders to raise venture capital, and the jobs that are most and least vulnerable to disruption by automation. This episode is brought to you by Oracle NetSuite business software (netsuite.com/superdata), by QuickChat customized AI assistants (https://quickchat.ai), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How Merantix started [05:17] • How does Merantix work and how to apply for funding [08:19] • How to secure AI funding [21:02] • How AI companies can prove competitiveness [33:46] • Ensuring AI regulation [41:17] • How AI will change the future of work [56:56] Additional materials: www.superdatascience.com/751
Explore the transformative power of AI in science. Jon Krohn reviews the groundbreaking AI-driven discoveries at MIT and beyond, showcasing how AI is reshaping various scientific fields, from pharmaceuticals to climate science, and pondering the balance between AI's capabilities and human ingenuity. Additional materials: www.superdatascience.com/750 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Data science for clean energy takes center stage as Emily Pastewka from Palmetto joins Jon Krohn this week, exploring innovative paths to a sustainable future. This episode covers the impact of AI on smart energy choices, the creation of a smart grid, and the wide array of professionals required to bring cleantech data solutions to life. This episode is brought to you by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Emily on her Master's in Deep Learning [08:20] • Using AI to solve clean energy challenges at Palmetto [17:22] • The different roles needed to solve cleantech problems [27:33] • How econometrics impacts consumer decision-making [38:56] • How Emily manages high-performing teams [56:30] • The tools and technologies that drive small teams [1:06:58] Additional materials: www.superdatascience.com/749
Artificial General Intelligence gets a new definition: This episode introduces Google DeepMind’s paper, “Levels of AGI: Operationalizing Progress on the Path to AGI”. Hear how its authors have organized narrow and general AI into hierarchical categories defined by human capability, from Level 0 (no AI) and Level 1 (equal to or somewhat better than an unskilled human) to Level 5 (able to outperform 100% of humans). A scary thought? Or a vision of a better future? Host Jon Krohn details the strengths of this research in this Five-Minute Friday. Additional materials: www.superdatascience.com/748 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Attention and transformers in LLMs, the five stages of data processing, and a brand-new Large Language Models A-Z course: Kirill Eremenko joins host Jon Krohn to explore what goes into well-crafted LLMs, what makes Transformers so powerful, and how to succeed as a data scientist in this new age of generative AI. This episode is brought to you by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatwithyourdata), and by Prophets of AI (https://prophetsofai.com), the leading agency for AI experts. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Supply and demand in AI recruitment [08:30] • Kirill and Hadelin's new course on LLMs, “Large Language Models (LLMs), Transformers & GPT A-Z” [15:37] • The learning difficulty in understanding LLMs [19:46] • The basics of LLMs [22:00] • The five building blocks of transformer architecture [36:29] - 1: Input embedding [44:10] - 2: Positional encoding [50:46] - 3: Attention mechanism [54:04] - 4: Feedforward neural network [1:16:17] - 5: Linear transformation and softmax [1:19:16] • Inference vs training time [1:29:12] • Why transformers are so powerful [1:49:22] Additional materials: www.superdatascience.com/747
Jon’s continuous calendar for 2024 is here! Now in an updated format, learn about its unique layout and benefits, and how it can revolutionize your planning for the new year. Additional materials: www.superdatascience.com/746 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
2024 data science trends take the spotlight in this special episode, where Jon joins Sadie St. Lawrence to analyze last year's predictions and delve into the emerging technologies reshaping the field. From AI hardware accelerators to the transformative role of large language models, this episode is a treasure trove of insights for anyone interested in the future of data science. This episode is brought to you by CloudWolf (www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Reviewing predictions for 2023 [05:56] • Sadie's trend predictions for 2024 [20:49] • 1: Hardware evolution [21:17] • 2: LLMOS [35:30] • 3: Slow-thinking model [48:18] • 4: Tool consolidation [54:46] • 5: Workforce Upheaval [58:06] • Jon's predictions [1:06:26] • 1: AI bubble bursting [1:08:11] • 2: Breakthroughs in Edge AI [1:12:22] • Sadie on her productivity planner [1:17:50] Additional materials: www.superdatascience.com/745
2023: A year of great movement and change. Technological developments have rocketed generative AI’s capabilities into the stratosphere of possibilities for future approaches to work, health, and play. Host Jon Krohn recognizes the benefits we have seen over the past year, discusses the important role we all have in ensuring ethics remains at the core of AI development and use, and he ends the year with a musical surprise for his listeners! Additional materials: www.superdatascience.com/744 Interested in sponsoring a SuperDataScience Podcast episode? Visit http://passionfroot.me/superdatascience for sponsorship information.
Chatbots, large language models and generative AI: Founder of Quickchat AI Piotr Grudzień believes the key to any successful AI platform is to ensure it can be tailored to a company’s specific needs. He speaks to host Jon Krohn about helping clients generate realistic and satisfying conversations that help their customer base find what they need quickly. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit http://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • About Quickchat AI and how it works [02:46] • How to successfully set up a conversational AI [23:58] • What “temperature” is in the context of AI [38:38] • How the LLM landscape has changed in recent years [40:24] • The future of generative AI [57:43] • The advantages of an AI accelerator [1:09:38] Additional materials: www.superdatascience.com/743
Join us on a brief journey through the AI world in 2023. A year ago, GPT-3.5 crafting our holiday message was a marvel, but now, with GPT-4's arrival, we're seeing an even more astounding evolution in AI. As we wave goodbye to the trend of generative AI, the Super Data Science Podcast team is bringing a personal touch back. Tune in for our heartfelt Happy Holidays message and a big thank you to all our listeners for your unwavering support. Additional materials: www.superdatascience.com/742 Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.
Data visualization remains at the forefront as Dr. Alberto Cairo from the University of Miami guides us beyond numerical figures, exploring the art of weaving compelling narratives through data. In his book, "The Art of Insight," he reveals the varied motivations driving visualization experts and highlights the serene, meditative process inherent in crafting visualizations. Emphasizing the fusion of scientific principles and personal style for effective data communication, Dr. Cairo also discusses with Jon the impending impact of AI on both interactive and static graphics. This episode is brought to you by Gurobi (https://gurobi.com/sds), the Decision Intelligence Leader, by HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatwithyourdata), and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information. In this episode you will learn: • Alberto's book, The Art of Insight [04:07] • How to transform data into engaging visuals [07:06] • What it takes to enter in a meditation-like flow state when creating visualizations [11:21] • How balancing the science of visualization with one’s personal style [29:29] • The importance of Smart Brevity for great data visualizations [37:32] • How data visualization can drive social change [42:31] • How diversity in designers enriches the field [52:07] • The future of data visualizations [59:10] Additional materials: www.superdatascience.com/741