Managing R and RStudio with conda
R/RStudio with conda
R
There was once a time when conda wasn’t really a viable way to manage R environments, but thanks to the work of many, that time has mostly completely passed. Now, it’s generally easier, and easier for reproducibility, to create new environments with the wanted version of R and any needed packages, rather than installing a system-wide R and working in there hoping no conflicts arise. This page covers a little bit of how we can do that.
If you aren’t yet already familiar with
conda
, I recommend running through the intro to conda page first, and having it installed as demonstrated there is required to follow below 🙂
Setting up a conda environment with R and RStudio
We can check which version of R are currently available to be installed through conda (using mamba
here as also described on the conda intro page here) with the following:
mamba search -c conda-forge r-base
Which at the time of putting this page together, 4.1.3 was the latest available for my system, so I’m going to specify that one.
And I usually let it figure out whichever RStudio version it wants to use. And I always install the tidyverse package too, because it’s awesome. And for that I let it select whichever version it wants also, unless it matters to me. So I would typically create a new environment with R and Rstudio like so:
mamba create -n R-4.1.3 -y -c conda-forge r-base=4.1.3 rstudio r-tidyverse
Breakdown
mamba create
– our base command-n
– here is where we provide the name we want the environment to have, in this case naming it R with the version following-y
– says not to ask us for any confirmation-c conda-forge
– here is where we are specifying the one channel we need for the particular packages we are trying to install- positional arguments - the trailing positional arguments of what packages/versions
r-base=4.1.3
– tells is we want the r-base package and which versionr-rstudio
andr-tidyverse
- the remaining packages I wanted to install while creating the environment, with no versions specified because I wasn’t worried about them in this case
Opening RStudio in our conda environment
Once we’ve created the above, we just need to activate that conda environment:
conda activate R-4.1.3
Then once in the environment, we can execute rstudio
to open an RStudio window within that conda environment (see notes below if this doesn’t successfully open RStudio):
rstudio
Windows NOTE
If running conda on a Windows computer, and the above failed to launch RStudio, running the following and trying again has worked for some folks:sudo apt-get update sudo apt-get install libgl1-mesa-glx libegl1-mesa libxrandr2 libxrandr2 libxss1 libxcursor1 libxcomposite1 libasound2 libxi6 libxtst6
Mac NOTE
You made need to install xcode command line tools for RStudio to open properly. To prompt to begin that, from the terminal, execute this:xcode-select --install
And an installation window should pop up. After that is installed, try running the
rstudio
command again.
Finding a conda installation for a specific R library we want
To find the specific package names needed for conda, I’ll typically go to anaconda.org and do a search for the package I’m looking for.
For example, searching for “tidyverse” there brings up many hits of packages called “r-tidyverse” available from different channels. I then look at the number of downlaods from each, and typically have a preference for the conda-forge channel as I know that’s generally well-used and well-curated. So that’s why in the command above, I left solely the conda-forge channel as the only one specified.
Installing more R libraries
Libraries can be installed in the R conda environment the way we normally would (meaning no conda involved), but I occasionally have run into problems when doing things that way. Because of this I generally try to find whatever R libraries I want as conda-installable packages and install them that way when I’m working in a conda-created R environment. So that would be my first recommendation. For instance, searching for dada2 on the anaconda page reveals it is installable through the bioconda channel.
So in the terminal, not in R, I would install this package with conda, while in the active R-4.1.3 environment we created above:
mamba install -y -c conda-forge -c bioconda -c defaults bioconductor-dada2
NOTE
Notice we provide all 3 of these channels in that order to install something from bioconda. The section on channels in the conda intro page covers this.
Then in the R environment within this conda environment, we are able to laod that library as usual with:
library(dada2)
If there isn’t a conda-prepared package for a specific R library I want, then I’d move on to trying to do the installation the recommended way for that library inside R.
BONUS: Creating a conda environment with Jupyter Lab and an R kernel
Here is one way we can create a conda environment that has an R kernel in Jupyter Lab:
mamba create -n jupyter-lab -y -c conda-forge r-base=4.1.3 rstudio r-tidyverse jupyterlab r-irkernel
Then when we activate that environment, we can launch jupyter lab with be executing jupyter lab
at our command line. And we’ll see we can select an R kernel for notebooks there 👍