Cloud Computing Tutorial#
Welcome to the Cloud Computing Tutorial!
This tutorial is just the tip of the ice[SAT-2]berg (😬) of cloud computing. It focuses on accessing data stored in the cloud. An understanding of the difference between the “download to local” and “direct from cloud” methods of data access will explain how and why the cloud facilitates the scaling and reproducibility of your science.
Learning Goals
At the conclusion of this tutorial, you should be able to answer:
What is cloud computing?
What is cloud object storage and the difference between data stored in the cloud, data on a local file system and data stored in “on-premise” data centers.
How to optimize data for reading from cloud object storage.
Outline#
-
Definition of cloud computing
Exercise: Difference between resources on your local machine and resources in the cloud
Why you might use cloud computing
-
Definition of cloud object storage
Exercise: How many NASA datasets (aka collections) are in the cloud? How many ICESat-2 datasets are in the cloud? Which DAAC manages ICESast-2 data?
Difference between data stored in the cloud, data on a local file system and data stored in “on-premise” data centers
Why you might use cloud object storage
-
What are we optimizing for and why?
Anatomy of a structured data file
Thought Exercise: Garage analogy
How do we optimize data for reading from cloud object storage?
-
Cloud-Optimized vs Cloud-Native
Creating an ICESat-2 GeoParquet
Plot the data with lonboard
Or simply: Cloud -> Cloud data access -> Optimized cloud data access -> Demo with ICESat-2