Understanding NSIDC DAAC Earthdata Cloud S3 Buckets and Paths
The NSIDC DAAC Earthdata Cloud collections are stored in Amazon Simple Storage Service (Amazon S3), a secure, scalable object storage service. If you plan to work directly with data in the cloud or use command-line tools, it’s helpful to understand how S3 buckets and URLs are structured.
This guide explains:
- How NSIDC DAAC organizes S3 bucket paths
- How to interpret S3 paths, object keys, and prefixes
- The difference between public and protected buckets
- What’s required to access these files
What is a Bucket?
Amazon S3 stores data in buckets, which act like top-level folders or containers. Each bucket holds objects (files) along with their associated metadata, including permissions and descriptive details.
NSIDC DAAC’s Main S3 Buckets
- Protected bucket:
nsidc-cumulus-prod-protected
- Public bucket:
nsidc-cumulus-prod-public
Buckets contain many datasets, organized into a folder-like hierarchy by product and date.
Important Note: NSIDC DAAC Buckets Are Non-Listable
When working with NSIDC DAAC S3 data, it’s important to know that our S3 buckets are intentionally non-listable. This applies to both public and protected buckets.
What “non-listable” means
In many cloud storage systems, you can run a command (for example, aws s3 ls
) to see all of the files in a bucket. For NSIDC DAAC buckets, this operation is disabled by policy. If you try it, you’ll likely see a 403 Permission Denied
error.
This does not mean your credentials are broken or that access to the data is blocked—it simply reflects the bucket policy.
Why this matters
Because the buckets are non-listable, tools or scripts that depend on directory-style browsing or recursive syncing will not work. Instead, you'll need to use direct paths to objects instead of trying to browse the bucket.
Objects: Files Inside a Bucket
An object is a single file in a bucket. Every object has:
- Bucket name – the top-level container (e.g.,
nsidc-cumulus-prod-protected
) - Object key – the path to the file within the bucket
- Metadata – details such as size, permissions, or tags
Example: Full S3 Path for a SMAP SPL3SMP_E File
s3://nsidc-cumulus-prod-protected/SMAP/SPL3SMP_E/006/2025/09/04/SMAP_L3_SM_P_E_20250904_R19240_001.h5
Tree breakdown:
s3://nsidc-cumulus-prod-protected/ ← bucket
└── SMAP/ ← prefix (collection name)
└── SPL3SMP_E/ ← prefix (product short name)
└── 006/ ← prefix (collection version)
└── 2025/ ← prefix (year)
└── 09/ ← prefix (month)
└── 04/ ← prefix (day)
└── SMAP_L3_SM_P_E_20250904_R19240_001.h5 ← object (file)
In the full S3 path above, the object key is:
SMAP/SPL3SMP_E/006/2025/09/04/SMAP_L3_SM_P_E_20250904_R19240_001.h5
What is an Object Prefix?
An object prefix is the folder-like portion of the object key that organizes files. S3 does not have real directories, but prefixes make the bucket behave like one.
At NSIDC DAAC, data are organized hierarchically:
- Mission (e.g.,
SMAP/
,ATLAS/
) - Product short name (e.g.,
SPL3SMP_E/
,ATL06/
) - Collection version (e.g.,
006/
,007/
) - Date (year → month → day)
- File (object) – the actual science granule (e.g.,
.h5
file)
In the SMAP example above, the prefixes include:
SMAP/
→ missionSMAP/SPL3SMP_E/
→ productSMAP/SPL3SMP_E/006/
→ versionSMAP/SPL3SMP_E/006/2025/
→ yearSMAP/SPL3SMP_E/006/2025/09/
→ monthSMAP/SPL3SMP_E/006/2025/09/04
→ day
Why prefixes matter:
- Organization – unsures data are grouped logically by mission, product, version, and time
- Filtering – tools like AWS CLI,
boto3
, ands3fs
can list objects by prefix so you only see what you need. - Efficiency – avoids scanning the entire bucket when querying data subsets
Anatomy of an S3 Path
General format:
s3://bucket-name/prefixes/filename.ext
Part | Description |
---|---|
bucket-name | Top-level container. (e.g., nsidc-cumulus-prod-protected |
prefixes | Directory-like structure organizing datasets by collection, version, date, etc. |
filename.ext | The actual file/object (science file) |
S3 vs HTTPS URLs
Every file in Earthdata Cloud can be referenced in two ways:
URL Type | Example | When to Use |
---|---|---|
S3 URL | s3://nsidc-cumulus-prod-protected/SMAP/SPL3SMP_E/006/2025/09/04/SMAP_L3_SM_P_E_20250904_R19240_001.h5 | For use with AWS CLI, SDKs, or Python tools like s3fs or boto3 ). Requires Earthdata Login authentication and temporary AWS credentials. |
HTTPS URL | https://data.nsidc.earthdatacloud.nasa.gov/nsidc-cumulus-prod-protected/SMAP/SPL3SMP_E/006/2025/09/04/SMAP_L3_SM_P_E_20250904_R19240_001.h5 | Use in a browser or with wget /curl . Requires Earthdata Login credentials (username/password or .netrc configuration). |
Both URLs point to the same file, but differ in workflow.
- S3 URLs are optimized for cloud-native workflows and large-scale automation.
- HTTPS URLs are simpler for downloading files locally but still require Earthdata Login.
For a quick guide on when to use S3 vs HTTPS, see: https://nsidc.org/data/user-resources/help-center/creating-text-files-https-and-s3-urls-earthdata-cloud-data-access
Public vs. Protected Buckets
The main differences between the public and protected buckets are in their contents and the credentials required for access.
Comparison
Feature | Public Bucket (nsidc-cumulus-prod-public ) | Protected Bucket (nsidc-cumulus-prod-protected ) |
---|---|---|
Contents | - Granule-level metadata (XML), may include DAAC-enriched details (e.g., archive date/time) - Thumbnail/browse images (JPEG) typically generated by NSIDC DAAC as low-resolution previews of science data | - Primary science data files (e.g., HDF5, NetCDF, etc.) - Ancillary files (e.g., DMR++ files) stored alongside science data - Additional related science files delivered with the granule |
Purpose | Preview or discover granules before downloading full data | Access and analyze science-quality data |
Authentication | Not required | Required (via Earthdata Login → temporary AWS credentials) |
Ease of Access | Directly accessible via browser, curl , or wget | Requires setup of AWS CLI/SDKs or authenticated tools |
Typical Use Case | Quickly browse metadata or view preview images | Download, analyze, or process science data sets |
Example Path | s3://nsidc-cumulus-prod-public/ATLAS/ATL06/006/ | s3://nsidc-cumulus-prod-protected/ATLAS/ATL06/006/ |
Key Takeaways
- Buckets are the top-level containers in Amazon S3.
- Objects are individual files identified by their bucket name and object key.
- Prefixes organize data logically within a bucket.
- Use S3 URLs for cloud workflows and automation, HTTPS URLs for simple downloads.
- The public bucket provides metadata and preview images without authentication.
- The protected bucket holds science data files and requires Earthdata Login credentials.
- NSIDC DAAC buckets are non-listable - you cannot browse them with
ls
orsync
, but direct object access using known S3 paths will work.