Episodi

  • Alex Watson is the co-founder and CEO of http://Gretel.ai (Gretel.ai), a startup that offers APIs for creating anonymized and synthetic datasets. Previously he was the founder of http://Harvest.ai (Harvest.ai), whose product Macie, an analytics platform protecting against data breaches, was acquired by AWS.
    Learn more about Alex and Gretel AI:
    http://gretel.ai (http://gretel.ai)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://www.cyou.ai/newsletter (https://www.cyou.ai/newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:15 Introducing Alex Watson
    03:45 How Alex was first exposed to programming
    05:00 Alex's experience starting Harvest AI, getting acquired by AWS, and integrating their product at massive scale
    21:20 How Alex first saw the opportunity for http://Gretel.ai (Gretel.ai)
    24:20 The most exciting use-cases for synthetic data
    28:55 Theoretical guarantees of anonymized data with differential privacy
    36:40 Combining pre-training with synthetic data
    38:40 When to anonymize data and when to synthesize it
    41:25 How Gretel's synthetic data engine works
    44:50 Requirements of a dataset to create a synthetic version
    49:25 Augmenting datasets with synthetic examples to address representation bias
    52:45 How Alex recommends teams get started with http://Gretel.ai (Gretel.ai)
    59:00 Expected accuracy loss from training models on synthetic data
    01:03:15 Biggest surprises from building http://Gretel.ai (Gretel.ai)
    01:05:25 Organizational patterns for protecting sensitive data
    01:07:40 Alex's vision for Gretel's data catalog
    01:11:15 Rapid fire questions


    Links:
    https://gretel.ai/blog (Gretel.ai Blog)
    https://www.wired.com/2010/03/netflix-cancels-contest/ (NetFlix Cancels Recommendation Contest After Privacy Lawsuit)
    https://greylock.com/portfolio-news/the-github-of-data/ (Greylock - The Github of Data)
    https://gretel.ai/blog/improving-massively-imbalanced-datasets-in-machine-learning-with-synthetic-data (Improving massively imbalanced datasets in machine learning with synthetic data)
    https://gretel.ai/blog/deep-dive-on-generating-synthetic-data-for-healthcare (Deep dive on generating synthetic data for Healthcare)
    https://medium.com/gretel-ai/synthetic-data-performance-report-e5a3cd6b1e6d (Gretel’s New Synthetic Performance Report)
    https://www.goodreads.com/book/show/18007564-the-martian (The Martian)
    https://www.penguinrandomhouse.com/books/172832/snow-crash-by-neal-stephenson/ (Snow Crash)
    https://us.macmillan.com/series/themurderbotdiaries/ (The MurderBot Diaries)

  • Radek Osmulski is a fully self-taught machine learning engineer. After getting tired of his corporate job, he taught himself programming and started a new career as a Ruby on Rails developer. He then set out to learn machine learning. Since then, he's been a Fast AI International Fellow, become a Kaggle Master, and is now an AI Data Engineer on the Earth Species Project.
    Learn more about Radek:
    https://www.radekosmulski.com (https://www.radekosmulski.com)
    https://twitter.com/radekosmulski (https://twitter.com/radekosmulski)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://cyou.ai/newsletter (http://cyou.ai/newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:15 How Radek got interested in programming and computer science
    09:00 How Radek taught himself machine learning
    26:40 The skills Radek learned from Fast AI
    39:20 Radek's recommendations for people learning ML now
    51:30 Why Radek is writing a book
    01:01:20 Radek's work at the Earth Species Project
    01:10:15 How the ESP collects animal language data
    01:21:05 Rapid fire questions


    Links:
    https://gumroad.com/l/learn_deep_learning (Radek's Book "Meta-Learning")
    https://www.coursera.org/learn/machine-learning (Andrew Ng ML Coursera)
    https://www.fast.ai (Fast AI)
    https://arxiv.org/abs/1801.06146 (Universal Language Model Fine-tuning for Text Classification)
    https://www.kdnuggets.com/2018/03/machine-learning-efficiently.html (How to do Machine Learning Efficiently)
    https://www.npr.org/2020/02/25/809336135/two-heartbeats-a-minute (NPR - Two Heartbeats a Minute)
    https://www.earthspecies.org/ (Earth Species Project)
    https://www.goodreads.com/book/show/5617966-a-guide-to-the-good-life (A Guide to the Good Life)
    https://store.hbr.org/product/the-origin-of-wealth-evolution-complexity-and-the-radical-remaking-of-economics/777X (The Origin of Wealth)
    https://maketime.blog (Make Time)
    https://plumvillage.org/books/you-are-here/ (You Are Here)

  • Episodi mancanti?

    Fai clic qui per aggiornare il feed.

  • Rodrigo Rivera is a machine learning researcher at the Advanced Data Analytics in Science and Engineering Group at Skoltech and technical director of Samsung Next. He's previously been in data science and research leadership roles at companies all around the world including Rocket Internet and Philip-Morris.Learn more about Rodrigo:https://rodrigo-rivera.com/ (https://rodrigo-rivera.com/)https://twitter.com/rodrigorivr (https://twitter.com/rodrigorivr)Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://www.cyou.ai/newsletter (https://www.cyou.ai/newsletter)Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)Timestamps:03:00 How Rodrigo got started in computer science and started his first company10:40 Rodrigo's experiences leading data science teams at Rocket Internet and PMI26:15 Leaving industry to get a PhD in machine learning28:55 Data science collaboration between business and academia32:45 Rodrigo's research interest in time series data39:25 Topological data analysis45:35 Framing effective research as a startup48:15 Neural Prophet01:04:10 The potential future of Julia for numerical computing01:08:20 Most exciting opportunities for ML in industry01:15:05 Rodrigo's advice for listeners01:17:00 Rapid fire questionsLinks:https://scholar.google.de/citations?user=nQGmpjUAAAAJ&hl=en (Rodrigo's Google Scholar)http://adase.group (Advanced Data Analytics in Science and Engineering Group)http://neuralprophet.com (Neural Prophet)https://en.wikipedia.org/wiki/Makridakis_Competitions (M-Competitions)https://www.cambridge.org/us/academic/subjects/engineering/communications-and-signal-processing/machine-learning-refined-foundations-algorithms-and-applications-2nd-edition?format=HB (Machine Learning Refined)https://cs.nyu.edu/~mohri/mlbook/ (Foundations of Machine Learning)http://www.dcs.gla.ac.uk/~srogers/firstcourseml/ (A First Course in Machine Learning)

  • Dan Jeffries is the chief technical evangelist at Pachyderm, a leading data science platform. He's a prominent writer and speaker on all things related to the future. He's been in software for over two decades, many of those at Redhat, and is the founder of the AI Infrastructure Alliance and Practical AI Ethics.
    Learn more about Dan:
    https://twitter.com/Dan_Jeffries1 (https://twitter.com/Dan_Jeffries1)
    https://medium.com/@dan.jeffries (https://medium.com/@dan.jeffries)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://cyou.ai/newsletter (http://cyou.ai/newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:15 How Dan got started in computer science
    06:50 What Dan is most excited about in AI
    14:45 Where we are in the adoption curve of ML
    20:40 The "Canonical Stack" of ML
    32:00 Dan's goal for the AI Infrastructure Alliance
    40:55 "Problems that ML startups don't know they're going to have"
    49:00 Closed vs open source tools in the Canonical Stack
    01:00:05 Building out the "boring" part of the infrastructure to enable exciting applications
    01:08:40 Dan's practical approach to AI Ethics
    01:23:50 Rapid fire questions


    Links:
    https://www.pachyderm.com/ (Pachyderm)
    https://ai-infrastructure.org/ (AI Infrastructure Alliance)
    https://practical-ai-ethics.org/ (Practical AI Ethics Alliance)
    https://towardsdatascience.com/rise-of-the-canonical-stack-in-machine-learning-724e7d2faa75 (Rise of the Canonical Stack in Machine Learning)
    https://www.youtube.com/watch?v=q_KPNtmc9m8 (Rise of AI - The Age of AI in 2030)
    https://magenta.tensorflow.org/ (Google Magenta)
    https://www.youtube.com/watch?v=WXuK6gekU1Y (AlphaGo Documentary)
    https://www.annieduke.com/books/ (Thinking in Bets)
    https://www.goodreads.com/book/show/3872.A_History_of_the_World_in_6_Glasses (A History of the World in 6 Glasses)
    https://www.penguinrandomhouse.com/books/562923/super-thinking-by-gabriel-weinberg-and-lauren-mccann/ (Super-Thinking)

  • Willem Pienaar is the co-creator of Feast, the leading open source feature store, which he leads the development of as a tech lead at Tecton. Previously, he led the ML platform team at Gojek, a super-app in Southeast Asia.
    Learn more:
    https://twitter.com/willpienaar (https://twitter.com/willpienaar)
    https://feast.dev/ (https://feast.dev/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://www.cyou.ai/newsletter (https://www.cyou.ai/newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:15 How Willem got started in computer science
    03:40 Paying for college by starting an ISP
    05:25 Willem's experience creating Gojek's ML platform
    21:45 Issues faced that led to the creation of Feast
    26:45 Lessons learned building Feast
    33:45 Integrating Feast with data quality monitoring tools
    40:10 What it looks like for a team to adopt Feast
    44:20 Feast's current integrations and future roadmap
    46:05 How a data scientist would use Feast when creating a model
    49:40 How the feature store pattern handles DAGs of models
    52:00 Priorities for a startup's data infrastructure
    55:00 Integrating with Amundsen, Lyft's data catalog
    57:15 The evolution of data and MLOps tool standards for interoperability
    01:01:35 Other tools in the modern data stack
    01:04:30 The interplay between open and closed source offerings


    Links:
    https://github.com/feast-dev/feast (Feast's Github)
    https://blog.gojekengineering.com/data-science/home (Gojek Data Science Blog)
    https://www.getdbt.com/ (Data Build Tool (DBT))
    https://www.tensorflow.org/tfx/data_validation/get_started (Tensorflow Data Validation (TFDV))
    https://feast.dev/post/a-state-of-feast/ (A State of Feast)
    https://cloud.google.com/bigquery (Google BigQuery)
    https://www.amundsen.io/ (Lyft Amundsen)
    https://www.cortex.dev/ (Cortex)
    https://www.kubeflow.org/ (Kubeflow)
    https://mlflow.org/ (MLFlow)

  • Benedikt Koller is a self-professed "Ops guy", having spent over 12 years working in roles such as DevOps engineer, platform engineer, and infrastructure tech lead at companies like Stylight and Talentry in addition to his own consultancy KEMB. He's recently dove head first into the world of ML, where he hopes to bring his extensive ops knowledge into the field as the co-founder of Maiot, the company behind ZenML, an open source MLOps framework.
    Learn more:
    https://zenml.io/ (https://zenml.io/)
    https://maiot.io/ (https://maiot.io/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://www.cyou.ai/newsletter (https://www.cyou.ai/newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)
    Timestamps:
    02:15 Introducing Benedikt Koller
    05:30 What the "DevOps revolution" was
    10:10 Bringing good Ops practices into ML projects
    30:50 Pivoting from vehicle predictive analytics to open source ML tooling
    34:35 Design decisions made in ZenML
    39:20 Most common problems faced by applied ML teams
    49:00 The importance of separating configurations from code
    55:25 Resources Ben recommends for learning Ops
    57:30 What to monitor in an ML pipelines
    01:00:45 Why you should run experiments in automated pipelines
    01:08:20 The essential components of an MLOps stack
    01:10:25 Building an open source business and what's next for ZenML
    01:20:20 Rapid fire questions
    Links:
    https://github.com/maiot-io/zenml (ZenML's GitHub)
    https://blog.maiot.io/ (Maiot Blog)
    https://12factor.net/ (The Twelve Factor App)
    https://blog.maiot.io/12-factors-of-ml-in-production/ (12 Factors of reproducible Machine Learning in production)
    https://www.seldon.io/ (Seldon)
    https://www.pachyderm.com/ (Pachyderm)
    https://www.kubeflow.org/ (KubeFlow)
    https://www.penguinrandomhouse.com/books/566988/something-deeply-hidden-by-sean-carroll/ (Something Deeply Hidden)
    https://www.goodreads.com/series/56399-the-expanse (The Expanse Series)
    https://us.macmillan.com/books/9780765382030 (The Three Body Problem)
    https://echelonfront.com/extreme-ownership/ (Extreme Ownership)

  • Josh Albrecht is the co-founder and CTO of Generally Intelligent, an independent research lab investigating the fundamentals of learning across humans and machines. Previously, he was the lead data architect at Addepar, CTO of CloudFab, and CTO of Sourceress, which Generally Intelligent is a pivot from.
    Learn more about Josh:
    http://joshalbrecht.com/ (http://joshalbrecht.com/)
    http://generallyintelligent.ai/ (http://generallyintelligent.ai/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://www.cyou.ai/newsletter (https://www.cyou.ai/newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)
    Timestamps:
    02:15 Introducing Josh Albrecht
    03:30 How Josh got started in computer science
    06:35 Josh's first two startup attempts
    09:15 The tech behind Sourceress, an AI recruiting platform
    16:10 Pivoting from Sourceress to Generally Intelligent, an AI research lab
    23:50 How Josh defines "general intelligence"
    28:35 Why Josh thinks self-supervised learning is the current most promising research area
    36:15 Generally Intelligent's immediate research roadmap: BYOL, simulated environments
    59:20 How Josh thinks about creating an optimal research environment
    01:11:35 The "why" behind starting an independent research lab
    01:13:30 AI alignment
    01:17:00 Rapid fire questions


    Links:
    https://arxiv.org/abs/2006.07733 (Bootstrap your own latent: A new approach to self-supervised Learning)
    https://generallyintelligent.ai/understanding-self-supervised-contrastive-learning.html (Understanding self-supervised and contrastive learning with "Bootstrap Your Own Latent" (BYOL))
    https://arxiv.org/abs/2010.10241 (BYOL works even without batch statistics)
    https://open.spotify.com/show/1hikWa5LWDQJwXtz5LoeVn (Generally Intelligent Podcast)
    https://arxiv.org/abs/2102.03896 (Consequences of Misaligned AI)
    https://www.goodreads.com/book/show/34466963-why-we-sleep (Why We Sleep)
    https://www.goodreads.com/book/show/26312997-peak (Peak)

  • Elena Samuylova and Emeli Dral are the co-founders of Evidently AI, where they build open source tools to analyze and monitor machine learning models. Elena was previously the head of the startup ecosystem at Yandex, director of business development at their data factory and chief product officer at Mechanica AI. Emeli was previously a data scientist at Yandex, chief data scientist at the data factory and Mechanica AI in addition to teaching machine learning both online and at multiple universities.
    Learn more about Elena, Emeli, and Evidently AI:
    https://evidentlyai.com/ (https://evidentlyai.com/)
    https://twitter.com/elenasamuylova (https://twitter.com/elenasamuylova)
    https://twitter.com/EmeliDral (https://twitter.com/EmeliDral)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://cyou.ai/newsletter (http://cyou.ai/newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.notion.so/charlieyou/Content-Pipeline-af923f8b990646369a85a00a348a1e12 (https://www.givingwhatwecan.org/)


    Timestamps:
    02:15 How Emeli and Elena each got started in data science
    07:10 Applying machine learning across a wide variety of industries at the Yandex Data Factory
    14:55 Using ML for industrial process improvement
    23:35 Challenges encountered in industrial ML and technical solutions
    27:15 The huge opportunity for ML in manufacturing
    34:35 How to ensure safety when using models in physical systems
    37:40 Why they started working on tools for data and ML monitoring
    42:50 Different kinds of data drift and how to address them
    48:25 Common mistakes ML teams make in monitoring
    55:25 Features of Evidently AI's library
    57:35 Building open source software
    01:02:25 Technical roadmap for Evidently
    01:05:50 Monitoring complex data
    01:08:50 Business roadmap for Evidently
    01:11:35 Rapid fire questions


    Links:
    https://github.com/evidentlyai/evidently (Evidently on Github)
    https://evidentlyai.com/blog (Evidently AI's Blog)
    https://us.macmillan.com/books/9780374533557 (Thinking Fast and Slow)
    https://www.goodreads.com/book/show/66354.Flow (Flow)
    https://www.effectivealtruism.org/doing-good-better/ (Doing Good Better)

  • Harikrishna Narayanan is the co-founder of a YC-backed stealth startup. He was previously a Principal Engineer at Yahoo, a Director in Workday's Machine Learning organization, and holds an M.S. from Georgia Tech.
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://cyou.ai/newsletter (https://cyou.ai/newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.notion.so/charlieyou/Content-Pipeline-af923f8b990646369a85a00a348a1e12 (https://www.givingwhatwecan.org/)
    Timestamps:
    02:45 How Hari got started in computer science and machine learning
    06:00 Making the transition from IC to manager
    14:35 What it means to be an effective engineering manager
    19:20 Differences in managing machine learning vs traditional software teams
    24:30 The importance of explaining complicated topics simply
    30:15 How he thinks about hiring for data science and machine learning
    36:50 Mistakes Workday made as it adopted machine learning
    41:50 Essential skills for machine learning engineers
    54:05 Why the future of AI is augmentation, not automation
    58:30 His experience so far with YC
    01:02:00 Rapid fire questions
    Links:
    https://www.brainpickings.org/2014/01/29/carol-dweck-mindset/ (Growth Mindset)
    https://fs.blog/2012/04/feynman-technique/ (The Feynman Technique)
    https://www.radicalcandor.com/ (Radical Candor)
    https://www.trilliondollarcoach.com/ (Trillion Dollar Coach)
    https://thewisemangroup.com/books/multipliers/ (Multipliers)
    https://www.jimcollins.com/article_topics/articles/good-to-great.html#articletop (Good to Great)
    https://hbr.org/books/watkins (The First 90 Days)
    https://www.harpercollins.com/products/crossing-the-chasm-3rd-edition-geoffrey-a-moore?variant=32130444066850 (Crossing the Chasm)
    https://en.wikipedia.org/wiki/Zero_to_One (Zero to One)
    http://theleanstartup.com/ (The Lean Startup)
    https://hardthings.bhorowitz.com/ (The Hard Thing About Hard Things)
    https://www.ynharari.com/book/sapiens-2/ (Sapiens)
    https://www.penguinrandomhouse.com/books/20549/a-short-history-of-nearly-everything-special-illustrated-edition-by-bill-bryson/ (A Short History of Nearly Everything)
    https://numenta.com/resources/on-intelligence/ (On Intelligence)
    https://www.predictionmachines.ai/ (Prediction Machines)
    https://algorithmstoliveby.com/ (Algorithms to Live By)
    https://sdv.dev/ (The Synthetic Data Vault)

  • Learn more about the ML Ops Community: https://mlops.community/ (https://mlops.community/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: https://cyou.ai/newsletter (https://cyou.ai/newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:45 Intro
    04:10 How I got into data science and machine learning
    08:25 My experience working as an ML engineer and starting the podcast
    12:15 Project management methods for machine learning
    20:50 ML job roles are trending towards more specialization
    26:15 ML tools enable collaboration between roles and encode best practices
    34:00 Data privacy, security, and provenance as first class considerations
    39:30 The future of managed ML platforms and cloud providers
    49:05 What I've learned about building a career in ML engineering
    54:10 Dealing with information overload


    Links:
    https://www.mlengineered.com/episode/josh-tobin (Josh Tobin: Research at OpenAI, Full Stack Deep Learning, ML in Production)
    https://towardsdatascience.com/the-third-wave-data-scientist-1421df7433c9 (The Third Wave Data Scientist)
    https://www.youtube.com/watch?v=GvAyV8m8ICI (Practical ML Ops // Noah Gift // MLOps Coffee Sessions)
    https://cyou.ai/podcast/pavle-jeremic/ (Building a Post-Scarcity Future using Machine Learning with Pavle Jeremic (Aether Bio))
    https://www.youtube.com/watch?v=Fu87cHHfOE4 (SRE for ML Infra // Todd Underwood // MLOps Coffee Sessions)
    https://www.youtube.com/watch?v=ShBod1yXUeg (Luigi Patruno on the ML Ops Community podcast)
    https://www.mlengineered.com/episode/luigi-patruno (Luigi Patruno: ML in Production, Adding Business Value with Data Science, "Code 2.0")

  • Pavle Jeremic is the founder and CEO of Aether Biomachines, one of the most exciting ML-powered startups I've come across. His mission is to solve scarcity and Aether is the first step towards that. He was recently featured in Forbes' 30 under 30 in Manufacturing and holds a B.S. in Biomolecular Engineering from UC Santa Cruz.
    Learn more:
    Aether Biomachines


    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:45 Pavle Jeremic
    05:20 How Pavle was introduced to computer science and programming
    08:00 Solving scarcity from first principles
    23:20 How Aether contributes to the post-scarcity future
    29:30 What enzymatic reaction data looks like
    37:20 Using deep learning to figure out what enzymatic experiments to run next
    39:45 How Aether runs thousands of experiments at a time
    47:00 What the current bottleneck of the system is
    53:15 The evolution of ML models at Aether
    59:00 Gaps in existing ML infrastructure solutions
    01:03:30 Why Aether is releasing some of their data for a competition
    01:06:50 The upcoming roadmap for Aether
    01:09:30 Rapid fire questions


    Links:
    https://podcasts.apple.com/us/podcast/4-making-alchemy-real-pavle-jeremic-aether-biomachines/id1498805236?i=1000465399648 (Founders First Interview - Making Alchemy Real)
    https://deepchem.io/ (DeepChem)
    https://en.wikipedia.org/wiki/Engines_of_Creation (Engines of Creation)
    https://www.goodreads.com/series/49121-rama (Rama Series)

  • Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:50 https://www.mlengineered.com/episode/josh-tobin (Josh Tobin: Research at OpenAI, Full Stack Deep Learning, ML in Production)
    21:48 https://www.mlengineered.com/episode/shreya-shankar (Shreya Shankar: Lessons learned after a year of putting ML into production)
    34:44 https://www.mlengineered.com/episode/luigi-patruno (Luigi Patruno: ML in Production, Adding Business Value with Data Science, "Code 2.0")
    53:28 https://www.mlengineered.com/episode/andreas-jansson (Music Information Retrieval at Spotify and the Future of ML Tooling with Andreas Jansson of Replicate)

  • Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)

  • Andreas Jansson is the co-founder of Replicate, a version control tool for machine learning. He holds a PhD from City University of London in Music Informatics and was previously a machine learning engineer at Spotify, researching and applying algorithms for music information retrieval.
    Learn more about Andreas:
    https://replicate.ai/ (https://replicate.ai/)
    https://www.linkedin.com/in/janssonandreas/ (https://www.linkedin.com/in/janssonandreas/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:30 Andreas Jansson
    07:30 Overview of music information retrieval (MIR)
    13:30 Why use spectrograms and not raw audio?
    19:55 The potential for transformers in MIR
    22:45 Most exciting applications for ML in MIR
    29:20 Challenges in putting ML into production
    36:45 What Andreas imagines for the future of ML tools
    41:45 Why he's building a tool for ML version control (http://replicate.ai/ (http://replicate.ai/))
    52:55 What Replicate enables via integration or as a platform
    01:02:55 Learnings from doing customer discovery for Replicate
    01:14:10 "Github for ML models and data"
    01:22:30 Rapid fire questions


    Links:
    https://deepmind.com/blog/article/wavenet-generative-model-raw-audio (WaveNet: a generative model for raw audio)
    https://openaccess.city.ac.uk/id/eprint/19289/1/ (Singing Voice Separation with Deep U-Net CNNs)
    https://openaccess.city.ac.uk/id/eprint/23669/1/ (Joint Singing Voice Separation and F0 Estimation with Deep U-Net Architectures)
    https://www.arxiv-vanity.com/ (arXiv Vanity)
    https://replicate.ai/ (Replicate)
    https://discord.gg/QmzJApGjyE (Replicate's Discord)

  • Luigi is the director of data science at 2U, where he leads a team in developing ML models and infrastructure to predict student success outcomes. He's also the founder of ML in Production, a blog and newsletter that helps readers build, deploy, and run ML systems.
    Learn more about Luigi:
    https://mlinproduction.com/ (https://mlinproduction.com/)
    https://twitter.com/mlinproduction (https://twitter.com/mlinproduction)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:45 Luigi Patruno
    04:50 How can ML teams be more rigorous in their engineering practices?
    10:25 Best practices for monitoring and logging ML systems
    18:00 Adding business value with data science
    37:10 Most valuable types of tools for ML in production
    43:15 What an ideal data pipeline setup looks like
    47:50 Unbundling the "Data Scientist" role
    50:35 The future of building software: "Code 2.0"
    59:45 Most valuable skills for the future
    01:10:15 Learnings from writing his blog "ML in Production"
    01:15:00 Rapid fire questions


    Links:
    https://datacast.simplecast.com/episodes/luigi-patruno (Luigi's interview on Datacast)
    https://mlinproduction.com/deploying-machine-learning-models/ (Ultimate Guide to Deploying ML Models)
    https://mlinproduction.com/maximizing-business-impact-with-machine-learning/ (Maximizing Business Impact with Machine Learning)
    https://mlinproduction.com/newsletter-083/ (Two Types of Companies Using ML)
    https://hackernoon.com/the-ai-hierarchy-of-needs-18f111fcc007 (The AI Hierarchy of Needs)
    https://www.mlengineered.com/episode/josh-tobin (Josh Tobin: Research at OpenAI, Full Stack Deep Learning, ML in Production)
    https://mlinproduction.com/machine-learning-is-forcing-software-development-to-evolve/ (Machine Learning is Forcing Software Development to Evolve)
    https://www.youtube.com/watch?v=fTvB5xMNfTY (ML Street Talk #29: GPT-3, Prompt Engineering, Trading, AI Alignment, Intelligence)
    https://www.oreilly.com/library/view/building-machine-learning/9781492045106/ (Building Machine Learning Powered Applications)
    https://www.penguinrandomhouse.com/books/529343/how-to-change-your-mind-by-michael-pollan/ (How to Change Your Mind)
    https://stevenpressfield.com/books/the-war-of-art/ (The War of Art)
    https://www.ynharari.com/book/sapiens-2/ (Sapiens)

  • Shawn Wang formerly worked in finance as a derivatives trader and equity analyst before burning out and pivoting towards tech. He's a prolific blogger who goes under the pseudonym "swyx" and recently published the excellent https://www.learninpublic.org/ (Coding Career Handbook). He's a graduate of Free Code Camp and Full Stack Academy now working at AWS as a Senior Developer Advocate.
    Learn more about Shawn:
    https://swyx.io/ (https://swyx.io/)
    https://www.learninpublic.org/ (https://www.learninpublic.org/)
    https://twitter.com/swyx (https://twitter.com/swyx)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:45 swyx is back!
    05:25 How his book has been received so far
    11:35 Why and how to negotiate your salary
    24:10 Getting started in public speaking, giving talks at meetups and conferences
    35:45 The role of luck in your career and how to create it
    51:15 Biggest is not best, best *for me *****is best
    59:20 Why swyx angel-invested in Circle
    01:12:00 On Randy Pausch's Time Management lecture
    01:18:00 Using open source to accelerate your coding skill
    01:20:00 Handling information overload and enhancing retention with note taking
    01:27:20 What swyx does in his job as a Developer Advocate and why you should consider non-coding roles
    01:37:30 swyx's new podcast Career Chats (https://careerchats.transistor.fm/ (https://careerchats.transistor.fm/))




    Links:
    https://www.mlengineered.com/episode/swyx (swyx's first ML Engineered appearance)
    https://www.learninpublic.org/ (swyx's book Coding Career Handbook)
    https://www.swyx.io/create_luck/ (How to Create Luck)
    https://www.swyx.io/time-management-randy-pausch/ (Notes on Time Management from a Dying Professor)
    https://www.buildingasecondbrain.com/ (Building a Second Brain)
    https://simplenote.com/ (SimpleNote)
    https://careerchats.transistor.fm/ (swyx's new podcast with Randall Kanna "Career Chats")

  • Yannic Kilcher is PhD candidate at ETH Zurich researching deep learning, structured learning, and optimization for large and high-dimensional data. He produces videos on his enormously popular Youtube channel breaking down recent ML papers.
    Follow Yannic on Twitter: https://twitter.com/ykilcher (https://twitter.com/ykilcher)
    Check out Yannic's excellent Youtube channel: https://www.youtube.com/channel/UCZHmQk67mSJgfCCTn7xBfew (https://www.youtube.com/channel/UCZHmQk67mSJgfCCTn7xBfew)
    Listen to the ML Street Talk podcast: https://podcasts.apple.com/us/podcast/machine-learning-street-talk/id1510472996 (https://podcasts.apple.com/us/podcast/machine-learning-street-talk/id1510472996)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:40 Yannic Kilcher
    07:05 Research for his PhD thesis and plans for the future
    12:05 How he produces videos for his enormously popular Youtube channel
    21:50 Yannic's research process: choosing what to read and how he reads for understanding
    27:30 Why ML conference peer review is broken and what a better solution looks like
    45:20 On the field's obsession with state of the art
    48:30 Is deep learning is the future of AI? Is attention all you need?
    56:10 Is AI overhyped right now?
    01:01:00 Community Questions
    01:13:30 Yannic flips the script and asks me about what I do
    01:25:30 Rapid fire questions


    Links:
    https://www.youtube.com/channel/UCZHmQk67mSJgfCCTn7xBfew (Yannic's amazing Youtube Channel)
    https://www.notion.so/Yannic-Kilcher-e93c81f81100464399e173867815e380 (Yannic's Google Scholar)
    https://discord.gg/4H8xxDF (Yannic's Community Discord Channel)
    On the Measure of Intelligence: https://arxiv.org/abs/1911.01547 (arXiv paper) and https://www.youtube.com/watch?v=3_qGrmD6iQY (Yannic's video series)
    https://www.youtube.com/watch?v=Uumd2zOOz60 (How I Read a Paper: Facebook's DETR (Video Tutorial))
    https://www.youtube.com/watch?v=TrdevFK_am4 (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained))
    https://fs.blog/2014/09/peter-thiel-zero-to-one/ (Zero to One)
    https://www.penguin.co.uk/books/104/1049544/the-gulag-archipelago/9781784871512.html (The Gulag Archipelago)

  • Zak Slayback is a principal at 1517 Fund, a venture capital fund that prioritizes working with dropouts. He wrote the excellent book "How to Get Ahead", one of my most recommended books on careers, and runs Get Ahead Labs where he teaches how to write outstanding cold emails.
    Learn more about Zak:
    https://zakslayback.com/ (https://zakslayback.com/)
    https://www.1517fund.com/ (https://www.1517fund.com/)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)


    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)


    Timestamps:
    02:35 Zak Slayback
    04:45 Using opportunity cost, signaling theory, and incentives to accelerate your career (https://zakslayback.com/frameworks-success-opportunity-cost/ (https://zakslayback.com/frameworks-success-opportunity-cost/))
    14:35 How to set career goals (https://zakslayback.com/ambition-mapping/ (https://zakslayback.com/ambition-mapping/))
    20:15 Rene Girard and Mimetic Desire
    24:30 The difference between a mentor, a coach/consultant, and an advisor (https://zakslayback.com/whats-difference-mentors-advisors-coaches/ (https://zakslayback.com/whats-difference-mentors-advisors-coaches/))
    35:40 Finding a mentor (https://zakslayback.com/professional-mentor-dream-job/ (https://zakslayback.com/professional-mentor-dream-job/))
    44:30 Fighting mental blocks against reaching out to potential mentors
    47:30 Why you should start a personal website (https://zakslayback.com/why-start-a-website/ (https://zakslayback.com/why-start-a-website/))
    56:15 What the most important "meta-skills" are and how to stack talents
    01:05:35 Most over-looked sections of the book
    01:09:00 The future of higher education: the new 95 theses from 1517 Fund (https://medium.com/1517/a-new-95-ec071200d98f (https://medium.com/1517/a-new-95-ec071200d98f))
    01:23:05 What Zak thinks the most exciting trends in technology are
    01:35:15 Rapid fire questions


    Links:
    https://www.nateliason.com/podcast/zak-slayback (The End of School and Building a Valuable Skillset with Zak Slayback)
    https://schoolsucksproject.com/deschool-yourself-find-your-focus-zak-slayback/ (Deschool Yourself and Find Your Focus – With Zak Slayback)
    https://zakslayback.com/book/ (Zak's book - How to Get Ahead (highly recommended!))
    https://zakslayback.com/ambition-mapping/ (Ambition Mapping)
    https://iep.utm.edu/girard/ (Rene Girard and Mimetic Desire)
    https://commoncog.com/blog/tacit-knowledge-is-a-real-thing/ (Why Tacit Knowledge is More Important Than Deliberate Practice)
    https://zakslayback.com/professional-mentor-dream-job/ (How to Get Your Dream Job and Mentor in 6 Easy Steps)
    https://zakslayback.com/whats-difference-mentors-advisors-coaches/ (What’s The Difference Between Mentors, Advisors, and Coaches?)
    https://zakslayback.com/keynote-video-get-ahead-nothing-offer/ (How to Get Ahead When You Have Nothing to Offer)
    https://zakslayback.com/why-start-a-website/ (“Why Should I Start a Website?”)
    https://zakslayback.com/frameworks-success-opportunity-cost/ (Frameworks for Making Better Decisions: Opportunity Cost)
    https://www.amazon.com/How-Fail-Almost-Everything-Still-ebook/dp/B00COOFBA4 (How to Fail at Almost Everything and Still Win Big)
    https://medium.com/1517/a-new-95-ec071200d98f (The New 95 Theses)
    https://www.penguinrandomhouse.com/books/176227/antifragile-by-nassim-nicholas-taleb/ (Antifragile)

  • Letitia Parcalabescu is a PhD candidate at the University of Heidelberg focused on multi-modal machine learning, specifically with vision and language.
    Learn more about Letitia:
    https://www.cl.uni-heidelberg.de/~parcalabescu/ (https://www.cl.uni-heidelberg.de/~parcalabescu/)
    https://www.youtube.com/channel/UCobqgqE4i5Kf7wrxRxhToQA (https://www.youtube.com/channel/UCobqgqE4i5Kf7wrxRxhToQA)
    Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Comments? Questions? Submit them here: http://bitly.com/mle-survey (http://bitly.com/mle-survey)
    Timestamps:
    01:30 Follow Charlie on Twitter (https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI))
    02:40 Letitia Parcalabescu
    03:55 How she got started in CS and ML
    07:20 What is multi-modal machine learning? (https://www.youtube.com/playlist?list=PLpZBeKTZRGPNKxoNaeMD9GViU_aH_HJab (https://www.youtube.com/playlist?list=PLpZBeKTZRGPNKxoNaeMD9GViU_aH_HJab))
    16:55 Most exciting use-cases for ML
    20:45 The 5 stages of machine understanding (https://www.youtube.com/watch?v=-niprVHNrgI (https://www.youtube.com/watch?v=-niprVHNrgI))
    23:15 The future of multi-modal ML (GPT-50?)
    27:00 The importance of communicating AI breakthroughs to the general public
    37:40 Positive applications of the future “GPT-50”
    43:35 Letitia’s CVPR paper on phrase grounding (https://openaccess.thecvf.com/content_CVPRW_2020/papers/w56/Parcalabescu_Exploring_Phrase_Grounding_Without_Training_Contextualisation_and_Extension_to_Text-Based_CVPRW_2020_paper.pdf (https://openaccess.thecvf.com/content_CVPRW_2020/papers/w56/Parcalabescu_Exploring_Phrase_Grounding_Without_Training_Contextualisation_and_Extension_to_Text-Based_CVPRW_2020_paper.pdf))
    53:15 ViLBERT: is attention all you need in multi-modal ML? (https://arxiv.org/abs/1908.02265 (https://arxiv.org/abs/1908.02265))
    57:00 Preventing “modality dominance”
    01:03:25 How she keeps up in such a fast-moving field
    01:10:50 Why she started her AI Coffee Break YouTube Channel (https://www.youtube.com/c/AICoffeeBreakwithLetitiaParcalabescu/ (https://www.youtube.com/c/AICoffeeBreakwithLetitiaParcalabescu/))
    01:18:10 Rapid fire questions
    Links:
    https://www.youtube.com/channel/UCobqgqE4i5Kf7wrxRxhToQA (AI Coffee Break Youtube Channel)
    https://openaccess.thecvf.com/content_CVPRW_2020/papers/w56/Parcalabescu_Exploring_Phrase_Grounding_Without_Training_Contextualisation_and_Extension_to_Text-Based_CVPRW_2020_paper.pdf (Exploring Phrase Grounding without Training)
    https://www.youtube.com/playlist?list=PLpZBeKTZRGPNKxoNaeMD9GViU_aH_HJab (AI Coffee Break series on Multi-Modal learning)
    https://www.youtube.com/watch?v=-niprVHNrgI (What does it take for an AI to understand language?)
    https://arxiv.org/abs/1908.02265 (ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations)

  • Moin Nadeem is a masters student at MIT, where he studies natural language generation. His research interests broadly include natural language processing, information retrieval, and software systems for machine learning.
    Learn more about Moin:
    https://moinnadeem.com/ (https://moinnadeem.com/)
    https://twitter.com/moinnadeem (https://twitter.com/moinnadeem)
    Want to level-up your skills in machine learning and software engineering? Join the ML Engineered Newsletter: http://bit.ly/mle-newsletter (http://bit.ly/mle-newsletter)
    Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey)
    Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI)
    Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/)
    Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen)
    Timestamps:
    01:35 Follow Charlie on Twitter (https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI))
    03:10 How Moin got started in computer science
    05:50 Using ML to identify depression on Twitter in high school
    11:00 Building a system to track phone locations on MIT’s campus
    14:35 Specializing in NLP
    17:20 Building an end-to-end fact-checking system (https://www.aclweb.org/anthology/N19-4014/ (https://www.aclweb.org/anthology/N19-4014/))
    25:15 Predicting statement stance with neural multi-task learning (https://www.aclweb.org/anthology/D19-6603/ (https://www.aclweb.org/anthology/D19-6603/))
    27:20 Is feature engineering in NLP dead?
    29:40 Reconciling language models with existing knowledge graphs
    35:20 How advances in AI hardware will affect NLP research (crazy!)
    47:25 Moin’s research into sampling algorithms for natural language generation (https://arxiv.org/abs/2009.07243 (https://arxiv.org/abs/2009.07243))
    57:10 Under-rated areas of ML research
    01:00:10 How research works at MIT CSAIL
    01:04:35 How Moin keeps up in such a fast-moving field
    01:11:30 Starting the MIT Machine Intelligence Community
    01:16:30 Rapid Fire Questions


    Links:
    https://www.aclweb.org/anthology/N19-4014/ (FAKTA: An Automatic End-to-End Fact Checking System)
    https://stereoset.mit.edu/ (StereoSet: Measuring stereotypical bias in pretrained language models)
    https://www.aclweb.org/anthology/D19-6603/ (Neural Multi-Task Learning for Stance Prediction)
    http://www.incompleteideas.net/IncIdeas/BitterLesson.html (Rich Sutton - The Bitter Lesson)
    https://arxiv.org/abs/2009.07243 (A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation)
    https://arxiv.org/abs/1905.12265 (Strategies for Pre-training Graph Neural Networks)
    https://openreview.net/pdf?id=YicbFdNTTy (Transformers For Image Recognition at Scale)
    https://www.cerebras.net/product/ (Cerebras CS-1)
    https://www.tryklarity.com/ (Klarity: AI for Law Contract Review)
    https://www.mit.edu/~jda/ (Jacob Andreas)
    https://cs.stanford.edu/people/jure/ (Jure Leskovec)
    https://www.simonandschuster.com/books/Shoe-Dog/Phil-Knight/9781501135927 (Shoe Dog)
    https://en.m.wikipedia.org/wiki/Alexander_Hamilton_(book) (Hamilton)
    https://becomingmichelleobama.com/ (Becoming)
    https://www.penguinrandomhouse.com/books/44330/mindset-by-carol-s-dweck-phd/ (Mindset)
    https://en.m.wikipedia.org/wiki/The_Innovators_(book) (The Innovators)