library(renv)
#>
#> Attaching package: 'renv'
#> The following objects are masked from 'package:stats':
#>
#> embed, update
#> The following objects are masked from 'package:utils':
#>
#> history, upgrade
#> The following objects are masked from 'package:base':
#>
#> autoload, load, remove, use
Package installation is an important part of renv. This vignette gives you the details, starting with an overview of renv’s cache, before going into various installation challenges around building from source and downloading with proxies or authentication.
Cache
One of renv’s primary features is the global package cache, which shared across all projects. The renv package cache provides two primary benefits:
Installing and restoring packages is much faster, as renv can find and re-use previously installed packages from the cache.
Projects take up less disk space, because each project doesn’t need to contain it’s own copy of every package.
When installing a package, renv installs into the global cache and then adds a symlink1 to that directory in the project library. That way each renv project remains isolated from other projects on your system, but they can still re-use the same installed packages.
The process by which packages enter the cache is roughly as follows:
Package installation is requested via e.g.
install.packages()
, orrenv::install()
, or as part ofrenv::restore()
.If renv is able to find the requested version of the package in the cache, then that package is linked into the project library, and installation is complete.
Otherwise, the package is downloaded and installed into the project library.
After installation of the package has successfully completed, the package is then copied into the global package cache, and then symlinked into the project library.
In some cases, renv will be unable to directly link from the global package cache to your project library, e.g. if the package cache and your project library live on different disk volumes. In such a case, renv will instead copy the package from the cache into the project library. This is much slower, so is worth avoiding.
Cache location
You can find the location of the current cache with
renv::paths$cache()
. By default, it will be in one of the
following folders:
- Linux:
~/.cache/R/renv/cache
- macOS:
~/Library/Caches/org.R-project.R/R/renv/cache
- Windows:
%LOCALAPPDATA%/renv/cache
If you’d like to share the package cache across multiple users, you
can do so by setting the RENV_PATHS_CACHE
environment
variable to a shared path. This variable should be set in an R startup
file to make it apply to all R sessions. While you can set it in a
project-local .Renviron
, or the user-level
~/.Renviron
, we generally recommend using the R
installation’s site-wide Renviron.site
if you’d like to
ensure the same cache path is visible to all users of R on a system.
You may also want to set RENV_PATHS_CACHE
so that the
global package cache can be stored on the same volume as the projects
you normally work on. This is especially important when working projects
stored on a networked filesystem.
Multiple caches
It is also possible to configure renv to use multiple cache
locations. For example, you might want to make both a user-local package
cache, as well as a global administrator-managed cache, visible within
an renv project. To do so, you can specify the paths to the cache
separated with a ;
(or :
on Unix if
preferred). For example:
RENV_PATHS_CACHE=/path/to/local/cache;/path/to/global/cache
In such a case, renv will iterate over the cache locations in order
when trying to find a package, and newly-installed packages will enter
the first writable cache path listed in
RENV_PATHS_CACHE
.
Shared cache locations
When the renv cache is enabled, if that cache is shared and visible to multiple users, then each of those users will have an opportunity to install packages into the renv cache. However, some care must be taken to ensure that these packages can be used by different users in your environment:
Packages copied into the cache may have Access-control Lists (ACLs), which might prevent others from using packages that have been installed into the cache. If this is the case, it’s important that ACLs be set (or updated) on cache entries so that the cache is accessible to each user requiring access. When deploying renv in an enterprise environment, the system administrator should take care to ensure ACLs (if any) allow users access to packages within the renv cache.
By default, packages copied into the cache will remain “owned” by the user that requested installation of that package. If you’d like renv to instead re-assign ownership of the cached package to a separate user account, you can set the
RENV_CACHE_USER
environment variable. When set, renv will attempt to runchown -R <package> <user>
to update cache ownership after the package has been copied into the cache.
Caveats
While we recommend enabling the cache by default, if you’re having
trouble with it, you can disable it by setting the project setting
renv::settings$use.cache(FALSE)
. Doing this will ensure
that packages are then installed into your project library directly,
without attempting to link and use packages from the renv cache.
If you find a problematic package has entered the cache (for example,
an installed package has become corrupted), that package can be removed
with the renv::purge()
function. See the
?purge
documentation for caveats and things to be aware of
when removing packages from the cache.
You can also force a package to be re-installed and re-cached with the following functions:
# restore packages from the lockfile, bypassing the cache
renv::restore(rebuild = TRUE)
# re-install a package
renv::install("<package>", rebuild = TRUE)
# rebuild all packages in the project
renv::rebuild()
See each function’s respective documentation for more details.
Building from source
Where possible, renv will install package binaries, but sometimes a binary is not available and you have to build from source. Installation from source can be challenging for a few reasons:
Your system will need to have a compatible compiler toolchain available. In some cases, R packages may depend on C / C++ features that aren’t available in an older system toolchain, especially in some older Linux enterprise environments.
Your system will need requisite system libraries, as many R packages contain compiled C / C++ code that depend on and link to these packages.
Configure flags
Many R
packages have a configure
script
that needs to be run to prepare the package for installation. Arguments
and environment variables can be passed through to those scripts in a
manner similar to install.packages()
. In particular, the
R
options configure.args
and
configure.vars
can be used to map package names to their
appropriate configuration. For example:
# installation of RNetCDF may require us to set include paths for netcdf
configure.args = c(RNetCDF = "--with-netcdf-include=/usr/include/udunits2")
options(configure.args = configure.args)
renv::install("RNetCDF")
This could also be specified as, for example,
Install flags
Similarly, additional flags that should be passed to
R CMD INSTALL
can be set via the install.opts
R
option:
# installation of R packages using the Windows Subsystem for Linux
# may require the `--no-lock` flag to be set during install
options(install.opts = "--no-lock")
renv::install("xml2")
# alternatively, you can set such options for specific packages with e.g.
options(install.opts = list(xml2 = "--no-lock"))
renv::install("xml2")
Vignettes
renv does not build vignettes when installing a package from source. This is because vignettes often require suggested packages, and installing all suggested packages (particularly from source) can be arduous.
If you want to distribute vignettes for your own packages, we suggest creating your own repository of binaries, either with R Universe (for publicly hosted packages on GitHub), with Posit Package Manager, or with drat.
Downloads
By default, renv uses curl for file downloads when available. This allows renv to support a number of download features across multiple versions of R, including:
- Custom headers (used especially for authentication),
- Connection timeouts,
- Download retries on transient errors.
If curl
is not available on your machine, it is highly
recommended that you install it. Newer versions of Windows 10 come with
a bundled version of curl.exe
; other users on Windows can
use renv::equip()
to download and install a recent copy of
curl
. Newer versions of macOS come with a bundled version
of curl
that is adequate for usage with renv, and most
Linux package managers have a modern version of curl
available in their package repositories. You can also configure which
curl
executable is used by setting the
RENV_CURL_EXECUTABLE
environment variable, if
necessary.
curl
downloads can be configured through renv’s
configuration settings – see ?renv::config
for more
details.
Alternative downloaders
If you’ve already configured R’s downloader and would like to bypass
renv’s attempts to use curl
, you can use the R option
renv.download.override
. For example, executing:
options(renv.download.override = utils::download.file)
would instruct renv to use R’s own download machinery when attempting
to download files from the internet (respecting the R options
download.file.method
and download.file.extra
as appropriate). Advanced users can also provide their own download
function, provided its signature matches that of
utils::download.file()
.
You can also instruct renv to use a different download method by
setting the RENV_DOWNLOAD_METHOD
environment variable. For
example:
# use Windows' internal download machinery
Sys.setenv(RENV_DOWNLOAD_METHOD = "wininet")
# use R's bundled libcurl implementation
Sys.setenv(RENV_DOWNLOAD_METHOD = "libcurl")
If you find that downloads work outside of renv projects, but not within renv projects, you may need to tell renv to use the same download file method that R has been configured to use. You can check which download method R is currently configured to use with:
getOption("download.file.method")
And the downloader currently used by renv can be queried with:
renv:::renv_download_method()
You can force renv to use the same download method as R by setting:
Sys.setenv(RENV_DOWNLOAD_METHOD = getOption("download.file.method"))
and, if necessary, you could also set this environment variable
within e.g. your ~/.Renviron
, so that it is visible to all
R sessions. See ?Startup
for more details.
Note that other features (e.g. authentication) may not be supported
when using an alternative download file method – you will have to
configure the downloader yourself if that is required. See
?download.file
for more details.
Proxies
If your downloads need to go through a proxy server, then there are a variety of approaches you can take to make this work:
Set the
http_proxy
and / orhttps_proxy
environment variables. These environment variables can contain the full URL to your proxy server, including a username + password if necessary.You can use a
.curlrc
(_curlrc
on Windows) to provide information about the proxy server to be used. This file should be placed in your home folder (seeSys.getenv("HOME")
, orSys.getenv("R_USER")
on Windows); alternatively, you can set theCURL_HOME
environment variable to point to a custom ‘home’ folder to be used bycurl
when resolving the runtime configuration file. On Windows, you can also place your_curlrc
in the same directory where thecurl.exe
binary is located.
See the curl documentation on proxies and config files for more details.
As an example, the
following _curlrc
works when using authentication with NTLM
and SSPI on Windows:
--proxy "your.proxy.dns:port"
--proxy-ntlm
--proxy-user ":"
--insecure
The curl R package also has a helper:
curl::ie_get_proxy_for_url()
which may be useful when attempting to discover this proxy address.
Authentication
Your project may make use of packages which are available from remote sources requiring some form of authentication to access – for example, a GitHub enterprise server. Usually, either a personal access token (PAT) or username + password combination is required for authentication. renv is able to authenticate when downloading from such sources, using the same system as the remotes package. In particular, environment variables are used to record and transfer the required authentication information.
Remote Source | Authentication |
---|---|
GitHub | GITHUB_PAT |
GitLab | GITLAB_PAT |
Bitbucket |
BITBUCKET_USER + BITBUCKET_PASSWORD
|
Git Remotes |
GIT_PAT / GIT_USER +
GIT_PASSWORD
|
These credentials can be stored in e.g. .Renviron
, or
can be set in your R session through other means as appropriate.
If you require custom authentication for different packages (for
example, your project makes use of packages available on different
GitHub enterprise servers), you can use the renv.auth
R
option to provide package-specific authentication settings.
renv.auth
can either be a a named list associating package
names with environment variables, or a function accepting a package name
+ record, and returning a list of environment variables. For
example:
# define a function providing authentication
options(renv.auth = function(package, record) {
if (package == "MyPackage")
return(list(GITHUB_PAT = "<pat>"))
})
# use a named list directly
options(renv.auth = list(
MyPackage = list(GITHUB_PAT = "<pat>")
))
# alternatively, set package-specific option
# as a list
options(renv.auth.MyPackage = list(GITHUB_PAT = "<pat>"))
# as a function
options(renv.auth.MyPackage = function(record) {
list(GITHUB_PAT = "<pat>")
})
For packages installed from Git remotes, renv will attempt to use
git
from the command line to download and restore the
associated package. Hence, it is recommended that authentication is done
through SSH keys when possible.
Authentication may be required when resolving a package from a remote
specification. If the package name cannot be inferred directly from the
remote, it can be supplied with a prefix of the form
<package>=
. For example, the igraph package on GitHub
at https://github.com/igraph/rigraph could be installed
with:
renv::install("igraph=igraph/rigraph")
Azure DevOps
Authentication with Azure DevOps may require credentials to be set in a slightly different way. In particular, you can use:
GITHUB_USER = <username>
GITHUB_PASS = <password>
replacing <username>
and
<password>
as appropriate – for example, your
password may just be a PAT. See https://github.com/rstudio/renv/issues/1751 for more
details.
Git and Personal Access Tokens
Rather than username + password, some remote Git repositories might require Personal Access Tokens (PATs) for authentication. Typically, such servers expect authentication credentials where:
- Your PAT is supplied as the username,
- Either an empty response, or the string
x-oauth-basic
, is provided as the password.
To help facilitate this, you can set:
GIT_USER = <pat>
GIT_PASS = x-oauth-basic
in an appropriate startup R file.
Custom headers
If you want to set arbitrary headers when downloading files using
renv, you can do so using the renv.download.headers
R
option. It should be a function that accepts a URL, and returns a named
character vector indicating the headers which should be supplied when
accessing that URL.
For example, suppose you have a package repository hosted at
https://my/repository
, and the credentials required to
access that repository are stored in the AUTH_HEADER
environment variable. You could define
renv.download.headers
like so:
options(renv.download.headers = function(url) {
if (grepl("^https://my/repository", url))
return(c(Authorization = Sys.getenv("AUTH_HEADER")))
})
With the above, renv will set the Authorization
header
whenever it attempts to download files from the repository at URL
https://my/repository
.
Debugging
If having problems with downloads, you can get more debugging information (including raw requests and responses) by setting:
options(renv.download.trace = TRUE)