Jupyter Notebook

Jupyter is a notebook environment popular in the data science community as a way of capturing and sharing code, documentation, images, interactive charts and data sources in a single file. It supports many programming languages with Python being it’s primary language. A Jupyter notebook server started on G.E.M.S has four programming environment options, called kernels, by default. They are python 3, python 3 with an active Apache Spark node, R, and Scala. Both R and Scala can also have a Spark node, but those must be set up with a bit of user intervention. See Apache Spark for more information.

Here are some examples of how Jupyter notebooks can be used. If the provided Python environment lacks a required library for an analysis, see how to create and extend a custom Python kernel.