An overview on the Databricks Community Cloud platform offered by Databricks at: https://community.cloud.databricks.com/
Provides step by step instructions on how to create a Spark Standalone Cluster and how to use notebooks.
Size: 5.98 MB
Language: en
Added: Nov 14, 2016
Slides: 32 pages
Slide Content
Databricks Community Cloud By: Robert Sanders
Databricks Community Cloud Free/Paid Standalone Spark Cluster Online Notebook Python R Scala SQL Tutorials and Guides Shareable Notebooks
Why is it useful? Learning about Spark Testing different versions of Spark Rapid Prototyping Data Analysis Saved Code Others …
Forums https:// forums.databricks.com /
Login/Sign Up https:// community.cloud.databricks.com / login.html
Home Page
Active Clusters
Create a Cluster - Steps From the Active Clusters page, click the “+ Create Cluster” button Fill in the cluster name Select the version of Apache Spark Click “Create Cluster” Wait for the Cluster to start up and be in a “Running” state
Create a Cluster
Active Clusters
Active Clusters – Spark Cluster UI - Master
Workspaces
Create a Notebook - Steps Right click within a Workspace and click Create -> Notebook Fill in the Name Select the programming language Select the running cluster you’ve created that you want to attach to the Notebook Click the “Create” button
Create a Notebook
Notebook
Using the Notebook
Using the Notebook – Code Snippets > sc > sc. parallelize ( 1 to 5 ). collect ()
Using the Notebook - Shortcuts Short Cut Action Shift + Enter Run Selected Cell and Move to nex t Cell Ctrl + Enter Run Selected Cell Option + Enter Run Selected Cell and Insert Cell Bellow Ctrl + Alt + P Create Cell Above Current Cell Ctrl + Alt + N Create Cell Bellow Selected Cell
Tables
Create a Table - Steps From the Tables section, click “+ Create Table” Select the Data Source (bellow steps assume you’re using File as the Data Source) Upload a file from your local file system Supported file types: CSV, JSON, Avro, Parquet Click Preview Table Fill in the Table Name Select the File Type and other Options depending on the File Type Change Column Names and Types as desired Click “Create Table”