Using conda to manage your own Python environments

Body

Background

Conda is an open source system for managing Python environments.  A python environment is a version of Python and some associated Python packages.  We highly recommend that researchers create Python environments for projects because it will give them a stable and reproducible place to run their code.   With conda, you can do things like

  • install the packages you need without relying on an administrator
  • let conda worry about pulling in other packages that a package you want depends upon
  • avoid changes to packages because someone else on the system requested them
  • maintain different environments for different needs
  • clone an environment to experiment with before risking changes to your production environment
  • export an environment file that can be use to recreate the environment on a different system

Common conda commands

We are only listing a few of the more useful commands here.  You can add “--help” to the end of any conda command to get some help on how to use it.  See https://conda.io/projects/conda/en/latest/ for complete documentation.

  • conda create – make a new environment 
  • conda env remove – remove an existing environment
  • conda info --envs – show environments
  • conda install – add a package to an environment
  • conda remove – remove a package from an environment
  • conda list – show packages installed in an environment
  • conda search – show available packages
  • conda activate – activate (use) an environment
  • conda deactivate – deactivate an environment

Creating your first conda environment

First enable the conda command.  We recommend putting this into your .bashrc file so it happens automatically every time you login.

$ source /optnfs/common/miniconda3/etc/profile.d/conda.sh

There is a bug in the conda command requiring a one time work-around.  If you already have a .conda directory in your home this is unnecessary but it won't hurt anything if you run it again.

$ cd
$ mkdir -p .conda/pkgs/cache .conda/envs

When creating a Python environment, you'll definitely want your own Python in it so specify a version at creation time.  That is done with the conda create command and a python=version argument.  version can be

  • nothing - you'll get the latest version of Python available in the conda respository
  • the major version (i.e. 2 or 3) - you will get the latest available release of that major version
  • a specific version (e.g. 3.4.3) - you will get exactly that version

Other packages work the same way.  You may specify multiple packages when you create the environment and of course can also add packages later.  Here is an example which creates an environment named mytest that has python 3.7.2 and the latest version of the numpy package.

$ conda create --name mytest python=3.9.5 numpy

Be sure to activate your new environment before trying to use it!

$ conda activate mytest

Anaconda environments

Anaconda is a special "meta" package which includes hundreds of other packages.  Anaconda environments are quite common in the research world.  Creating one is no different than creating any other environment.  Pick a version of Python and also include the anaconda meta package.  Like this

$ conda create --name my_anaconda3_env python=3 anaconda

Be sure to activate your new environment before using it

$ conda activate my_anaconda3_env

Installing conda packages into an existing environment

You use the "conda install" command to do this.

Important: You must activate an environment before installing packages in it (or specify the environment as part of the install command with --name env).

Package specifications work the same way as for "conda create".  You can request multiple packages on the command line and specify (or not) a version for each one.  Here is an example of installing a specific version of the numpy package and whatever is current for the ldap3 package into the "myenv" environment.  Note: part of what conda does is to figure out dependencies so you would also be installing a couple dozen packages required by numpy and ldap3 if you run this command.

$ conda activate myenv
$ conda install numpy=1.16.2 ldap3

How do I see what packages are available?

Use the "conda search" command for this.

To see every version of the package, provide only the package name.  This often gives a very long list. Here is an example searching for the ldap3 package.

$ conda search ldap3
# Name                       Version           Build  Channel             
ldap3                        0.9.8.4          py26_0  pkgs/free           
ldap3                        0.9.8.4          py27_0  pkgs/free           
ldap3                        0.9.8.4          py33_0  pkgs/free
[...]

You can tell conda to return only packages which will work with a particular Python version. Here is an example searching for the ldap3 package but only if it works with Python 3.7.  Note: the '*' in the example command is a wildcard for the ldap3 package version.  You could also specify a particular version of the ldap3 package  though that isn't usually what people want.

$ conda search ldap3=*=py37_0
# Name                       Version           Build  Channel             
ldap3                            2.5          py37_0  pkgs/main           
ldap3                          2.5.1          py37_0  pkgs/main

You can also use wildcards in the package name if you aren't sure what it is called.  This example shows all packages that have "ldap" in the name.

$ conda search *ldap*
# Name                       Version           Build  Channel             
flask-ldap-login               0.3.0          py27_0  pkgs/free           
flask-ldap-login               0.3.0          py27_1  pkgs/free           
ldap3                        0.9.8.4          py26_0  pkgs/free
[...]

What if the package I want isn't available through conda?

First of all, it may be.  There are more conda package repositories (called channels) than the ones we have configured for you by default.  For example, if you wanted the "ads" package, it's available in the conda-forge channel.  You could install it like this.

$ conda install --channel conda-forge ads

If you cannot find the package distributed via conda then it is possible to use other kinds of Python packages like "pip", or "easy_install", or "python setup.py" but conda will not know their dependencies so if you update the conda packages in an environment that mixes conda packages with other packaging styles there is a risk of breaking those others (until you manually update them).  In the case of pip, make sure you add the "--user" option so it installs to your home directory.

Details

Details

Article ID: 72888
Created
Fri 3/1/19 2:14 PM
Modified
Wed 4/5/23 9:57 PM

Related Articles

Related Articles (3)

This article describes how to enable a centrally maintained installation of Anaconda/Python.
Conda is a little tricky when it comes to shared storage, especially with permissions in a Kerberized environment. This article describes one way to make it work.