profile

Learn Data Science from Data School πŸ“Š

Kevin Markham

Join 25,000+ aspiring Data Scientists and receive Python & Data Science tips every Tuesday!

Featured Post

Tuesday Tip #46: How to prevent data leakage in pandas & scikit-learn β˜”

Hi Reader, I'm really proud of this week's tip because it covers a topic (data leakage) that took me years to fully understand. 🧠 It's one of those times when I feel like I'm truly contributing to the collective wisdom by distilling complex ideas into an approachable format. πŸ’‘ You can read the tip below πŸ‘‡ or on my blog. πŸ”— Link of the week Building an AI Coach to Help Tame My Monkey Mind (Eugene Yan) In this short post, Eugene describes his experiences calling an LLM on the phone for coaching:...

about 2 months agoΒ β€’Β 4 min read

Hi Reader, Last week, I recorded the FINAL 28 LESSONS πŸŽ‰ for my upcoming course, Master Machine Learning with scikit-learn. That's why you didn't hear from me last week! πŸ˜… I edited one of those 28 videos and posted it on YouTube. That video is today's tip, which I'll tell you about below! πŸ‘‰ Tip #45: How to read the scikit-learn documentation In order to become truly proficient with scikit-learn, you need to be able to read the documentation. In this video lesson, I’ll walk you through the five...

about 2 months agoΒ β€’Β 1 min read

Hi Reader, happy Tuesday! My recent tips have been rather lengthy, so I'm going to mix it up with some shorter tips (like today's). Let me know what you think! πŸ’¬ πŸ”— Link of the week A stealth attack came close to compromising the world's computers (The Economist) If you haven't heard about the recent "xz Utils backdoor", it's an absolutely fascinating/terrifying story! In short, a hacker (or team of hackers) spent years gaining the trust of an open-source project by making helpful...

2 months agoΒ β€’Β 1 min read

Hi Reader, Today's tip is drawn directly from my upcoming course, Master Machine Learning with scikit-learn. You can read the tip below or watch it as a video! If you're interested in receiving more free lessons from the course (which won't be included in Tuesday Tips), you can join the waitlist by clicking here: Yes, I want more free lessons! πŸ‘‰ Tip #43: Should you discretize continuous features for Machine Learning? Let's say that you're working on a supervised Machine Learning problem, and...

3 months agoΒ β€’Β 2 min read

Hi Reader, I'm so excited to share this week's tip with you! It has been in my head for months, but I finally put it in writing ✍️ It's longer than usual, so if you prefer, you can read it as a blog post instead: Jupyter & IPython terminology explained πŸ”— Link of the week Python Problem-Solving Bootcamp (April 1-21) Want to improve your Python skills quickly? There's no better way than solving problems, reviewing alternative solutions, and exchanging ideas with others. That's the idea behind...

3 months agoΒ β€’Β 3 min read

Hi Reader, I just published a new blog post, Get started with conda environments. If you’re new to virtual environments in Python, give it a read! Once you start using virtual environments, you’ll wonder how you ever got along without them! πŸ”— Link of the week Yann LeCun on the future of AI (Lex Fridman interview) Yann LeCun is one of the β€œgodfathers of Deep Learning”, the Chief AI Scientist at Meta, and (in my opinion) one of the clearest and most convincing thinkers on the future of AI. It’s...

3 months agoΒ β€’Β 1 min read

Hi Reader, In case you missed it, I launched a free, 7-hour pandas course! 800+ students have enrolled, and a few have already earned their certificate of completion πŸ‘©πŸŽ“ πŸ”— Link of the week Data Internships Looking for an internship in Data Science or Analytics? This site curates the latest internship postings and emails them to you each week! πŸ‘‰ Tip #40: Build a DataFrame from multiple files Let’s say that your dataset is spread across multiple files, but you want to read the dataset into a...

4 months agoΒ β€’Β 1 min read

Hi Reader, There's a gift for you somewhere in this email... just look for the 🎁 emoji! πŸ‘‰ Tip #39: Six quick Python tricks Here's what I'll cover below: Return the number of unique values Count values with Counter Better debugging with f-strings Return multiple values from a function Count while looping Create a dictionary with a comprehension Let's get started! πŸ‘‡ 1️⃣ Return the number of unique values Need to know the number of unique values in an iterable? Convert it to a set and check the...

4 months agoΒ β€’Β 2 min read

Hi Reader, My goal with Tuesday Tips is to help you get better at Data Science every week. Is there anything that would make these tips even more helpful for you? Let me know! πŸ’¬ You can find past tips at tuesday.tips. (Yes, that’s a real URL!) πŸ‘‰ Tip #38: Five ways to rename your DataFrame columns Let's say that we have a simple pandas DataFrame: I prefer to use dot notation to select pandas columns, but that won't work since the column names have spaces. Let's fix this! The most flexible...

4 months agoΒ β€’Β 1 min read

Hi Reader, How's your January going? I've been playing a lot of PokΓ©mon (the card game) with my 7 year-old... actually I'm the one who's always bugging him to play πŸ˜‚ Are you a PokΓ©mon fan? Meowscarada ex, anyone? πŸ”— Link of the week Introduction to Polars (Practical Business Python) Have you heard of Polars? In short, it's a high-performance, memory-efficient alternative to pandas. If you're new to Polars, this blog post walks through basic Polars code and compares it to pandas. πŸ‘‰ Tip #37:...

5 months agoΒ β€’Β 1 min read
Share this page