Hi Reader,
Last week, I released a 3-hour video, My top 50 scikit-learn tips.
I also finished Chapter 10 of my next ML course, which I'll tell you about once all 20 chapters are done 😅
Anyway, let’s get to today’s tip!
For fun, I’ve been building an interactive dashboard using pandas, Plotly Express, and Shiny for Python. (Check out a screenshot here.)
The goal is to help me analyze sales of my online courses. And since I’m working with sales data, I’m reminded of how much I love the pandas resample function!
Let’s see an example of how to resample 😉
Pretend you have a DataFrame of sales data that looks like this:
You might ask: What are my total sales for each product?
In that case, you would use groupby:
You can read that code as: For each Product, this is the sum of the Sale column.
Another similar question you might ask is: What are my total sales for each day?
Instead of groupby, you would use resample, which I think of as “groupby for time series data”:
You can read that code as: For each day, this is the sum of the Sale column.
(Notice that it inserted 2023-03-31 with a value of 0, since there were no sales on that day.)
By changing the 'D' to an 'M', you can resample by month instead:
'D' and 'M' are known as the “offset alias”, and there are many other offset aliases you can use.
Finally, let’s say that the index is not a datetime column:
In that case, you need to use the 'on' parameter to specify the datetime column:
If you work with time series data, I bet you’ll find a use for resample!
If you enjoyed this week’s tip, please forward it to a friend! Takes only a few seconds, and it really helps me out! 🙌
See you next Tuesday!
- Kevin
P.S. What’s the worst volume control interface? (my favorite is #12)
Did someone awesome forward you this email? Sign up here to receive data science tips every week!
Join 25,000+ intelligent readers and receive AI tips every Tuesday!
Hi Reader, Last week, I encouraged you to experiment with different LLMs, since there’s no one model that is superior across all use cases. Specifically, I suggested you try using Chatbot Arena, which allows you to chat with multiple models at once. It’s completely free, but has two significant disadvantages: Your chats are not private and may be used for research. It lacks the feature-rich interface provided by other LLMs. Today, I want to offer you a better method for experimenting with...
Hi Reader, Over the past 50 tips, I’ve touched on many different topics: Python, Jupyter, pandas, ML, data visualization, and so on. Going forward, I’m planning to focus mostly on Artificial Intelligence. I’m announcing this so you know what to expect, and I know what to deliver! 💌 I’ll also try to make the tips shorter, so that they're easier to digest on-the-go. Finally, I plan to include an “action item” each week, so that you can practice what you’re learning. I hope you like these...
Hi Reader, Next week, I’ll be offering a Black Friday sale on ALL of my courses. I’ll send you the details this Thursday! 🚨 👉 Tip #50: What is a "method" in Python? In Python, a method is a function that can be used on an object because of the object's type. For example, if you create a Python list, the "append" method can be used on that list. All lists have an "append" method simply because they are lists: If you create a Python string, the "upper" method can be used on that string simply...