Tuesday Tip #34: What are conda, Anaconda, and Miniconda? 🐍


Hi Reader,

Soon it will be winter break for my 6-year-old, so this is going to be my last Tuesday Tip of the year! β›„


πŸ‘‰ Tip #34: What's the difference between conda, Anaconda, and Miniconda?

If you've ever taken one of my courses, you may have noticed that I frequently recommend the Anaconda distribution of Python.

You might be left wondering:

  • What is the Anaconda distribution, and why do people recommend it?
  • How is it related to conda?
  • How is it related to Miniconda?
  • As a data scientist, which of these do I need to be familiar with?

I'll answer those questions below! πŸ‘‡


What is Anaconda?

​Anaconda is a Python distribution aimed at data scientists that includes 250+ packages (with easy access to 7,500+ additional packages). Its value proposition is that you can download it (for free) and "everything just works." It's available for Mac, Windows, and Linux.

A new Anaconda distribution is released a few times a year. Within each distribution, the versions of the included packages have all been tested to work together.

If you visit the installation page for many data science packages (such as pandas), they recommend Anaconda because it makes installation easy!


What is conda?

​conda is an open source package and environment manager that comes with Anaconda.

As a package manager, you can use conda to install, update, and remove packages and their "dependencies" (the packages they depend upon):

  • If Anaconda doesn't include a package that you need, you use conda to download and install it.
  • If Anaconda doesn’t have the version of a package you need, you use conda to update it.

As an environment manager, you can use conda to manage virtual environments:

  • If you're not familiar with virtual environments, they allow you to maintain isolated environments with different packages and versions of those packages.
  • conda is an alternative to virtualenv, pipenv, and other related tools.

conda has a few huge advantages over other tools:

  • It’s a single tool to learn, rather than using multiple tools to manage packages, environments, and Python versions.
  • Package installation is predictably easy because you’re installing pre-compiled binaries.
  • Unlike pip, you never need to build from source code, which can be especially difficult for some data science packages.
  • You can use conda with languages other than Python.

What is Miniconda?

​Miniconda is a Python distribution that only includes Python, conda, their dependencies, and a few other useful packages.

Miniconda is a great choice if you prefer to only install the packages you need, and you're sufficiently familiar with conda. (Here's how to choose between Anaconda and Miniconda.)


Summary:

  • ​Anaconda and Miniconda are both Python distributions.
  • Anaconda includes hundreds of packages, whereas Miniconda includes just a few.
  • ​conda is an open source tool that comes with both Anaconda and Miniconda, and it functions as both a package manager and an environment manager.

Personally, I make extensive use of conda for creating environments and installing packages. And since I'm comfortable with conda, I much prefer Miniconda over Anaconda.

Would you be interested in taking a short course about conda? Reply and let me know! πŸ’Œ


If you enjoyed this week’s tip, please forward it to a friend! Takes only a few seconds, and it really helps me reach more people!

I'll see you again in January! πŸ‘‹

- Kevin

P.S. Christmas decorating injuries πŸŽ„

Did someone awesome forward you this email? Sign up here to receive Data Science tips every week!

Learn Artificial Intelligence from Data School πŸ€–

Join 25,000+ intelligent readers and receive AI tips every Tuesday!

Read more from Learn Artificial Intelligence from Data School πŸ€–

Hi Reader, A reader asked me the following question: I am now looking towards a new career. Machine Learning is what I've always found very interesting and fascinating. However, I'd like to ask you: Is there really future for this stuff with all the buzz about LLMs becoming more and more capable all the time? Would time be spent well on learning all this stuff? Read this article online Excellent question! As someone who just published a book on Machine Learning, I clearly believe there is...

Hi Reader, the response to my new Machine Learning book has been outstanding! 🀩 My goal with this book is to reach as many people as possible, which is why I’ve made it free to read online as well as keeping the paperback and ebook prices as low as possible. I’m confident it would be an invaluable resource for any Machine Learning bootcamp or course, since it's a highly practical guide as opposed to focusing mostly on theory. Do you have a personal contact at any bootcamp or university where...

Hi Reader, I'm thrilled to announce that my new book, Master Machine Learning with scikit-learn, is now on sale! Buy from Amazon I poured my heart and soul into making this the highest quality and most practical Machine Learning book available. Publishing this book is a dream come true, and I'd be grateful if you'd consider picking up a copy! πŸ™ Option 1: Get the paperback from Amazon ($19) Although most technical books of this size (300+ pages) tend to sell for at least $39, I've priced the...