Tuesday Tip #21: Find the perfect dataset for your next project ๐ŸŽฏ


Hi Reader, letโ€™s get straight to the tip this week!


๐Ÿ‘‰ Tip #21: Five sources for interesting datasets

Letโ€™s say that you need to find a dataset for a Data Science project. Perhaps this is a project for school, or a practice project to build up your portfolio and showcase your skills.

Where should you look? Here are 5 sources I recommend checking out:

  1. โ€‹Kaggle Datasets: Itโ€™s fun to browse, and the upvoting system makes it easy to discover higher-quality datasets. Also, its Data Explorer lets you see a preview of the raw data.
  2. โ€‹Data Is Plural: This is a fascinating weekly newsletter (since 2015!) that highlights โ€œuseful/curious datasets.โ€ Search its archive via a Google Sheet or web app.
  3. โ€‹Awesome Public Datasets: A gigantic list (on GitHub) of high-quality datasets grouped by topic.
  4. โ€‹Data.gov: Open data from the US government. Itโ€™s huge, well-organized, and more interesting than you would think!
  5. โ€‹Google Dataset Search: This is a great way to search for a dataset, especially if you already have a specific topic in mind. Also, the autocomplete feature is quite nice!

Want even more options? Sebastian Raschka compiled this list of dataset repositories for Machine Learning and Deep Learning.


If you enjoyed this weekโ€™s tip, please forward it to a friend! Takes only a few seconds, and it really helps me out! ๐Ÿ™Œ

See you next Tuesday!

- Kevin

P.S. If toddlers had lawyers (video)

Did someone awesome forward you this email? Sign up here to receive data science tips every week!

Learn Artificial Intelligence from Data School ๐Ÿค–

Join 25,000+ intelligent readers and receive AI tips every Tuesday!

Read more from Learn Artificial Intelligence from Data School ๐Ÿค–

Hi Reader, This week, I've got a short tip about AI agents, followed by some Data School news... ๐Ÿ‘‰ Tip #56: What are AI agents? Google is calling 2025 "the agentic era," DeepLearning.AI says "the agentic era is upon us," and NVIDIA's founder says "one of the most important things happening in the world of enterprise is agentic AI." Clearly AI agents are a big deal, but what exactly are they? Simply put, an AI agent is an application that uses a Large Language Model (LLM) to control its...

Hi Reader, Last week, I launched a brand new course: Build an AI chatbot with Python. 120+ people enrolled, and a few have already completed the course! ๐Ÿ‘ Want to join us for $9? ๐Ÿ‘‰ Tip #55: Should you still learn to code in 2025? Youโ€™ve probably heard that Large Language Models (LLMs) are excellent at writing code: They are competitive with the best human coders. They can create a full web application from a single prompt. LLM-powered tools like Cursor and Copilot can autocomplete or even...

Hi Reader, The Python 14-Day Challenge starts tomorrow! Hope to see you there ๐Ÿคž ๐Ÿ‘‰ Tuesday Tip: My top 5 sources for keeping up with AI I'll state the obvious: AI is moving incredibly FAST ๐Ÿ’จ Here are the best sources I follow to keep up with the most important developments in Artificial Intelligence: The Neuron (daily newsletter) My top recommendation for a general audience. Itโ€™s fun, informative, and well-written. It includes links to the latest AI news and tools, but the real goldmine is...