anaconda-project.yml
.
You can also access and load data in a variety of formats, stored in common sources including the following:
- File systems
- NFS shared drives
- Databases
- Hadoop and Spark clusters
- Distributed version control repositories such as Git and Bitbucket (if configured by your Administrator).
Understanding resource profiles
Resource profiles are used to limit the amount of CPU cores and RAM available for use when running a project session or deployment.Choosing a resource profile with a greater number of available cores is not guaranteed to improve performance—it will also depend on whether the libraries used by the project can take advantage of multiple cores, for example.
Uploading files to a project
Open an editing session for the project, then choose the file you want to upload. The process of uploading files varies slightly, based on the editor used:- In Jupyter Notebook, click Upload and select the file to upload. Then click the blue Upload button displayed in the file’s row to add the file to the project
- In JupyterLab, click the Upload files icon and select the file. In the top right corner, click Commit Changes to add the file to your project.
- In Zeppelin, use the Import note feature to select a JSON file or add data from a URL.
Accessing NFS shared drives
After your Administrator has configured Workbench to mount an NFS share, you’ll be able to access it from within your notebooks. You’ll just need to know the name of the volume, so you can access it. For example, if they named the configuration file sectionmyvolume
, the share will be mounted at /data/myvolume
.
From a notebook you can use code such as this to read data from the share: