HDFS Service

The platform provides a managed Hadoop File System (HDFS) for big data storage and computing purposes.

HDFS name node web interface is available through the launcher. Please enter your UT credentials to access the interface.

You can use standard CLI commands to perform HDFS operations.

[1]:
hadoop fs -copyFromLocal /data/public/readme.md
copyFromLocal: `readme.md': File exists

[2]:
hadoop fs -ls
Found 2 items
drwxr-xr-x   - jovyan supergroup          0 2021-01-15 13:53 .sparkStaging
-rw-r--r--   1 jovyan supergroup        583 2021-01-15 10:56 readme.md
[12]:
hadoop fs -cat readme.md
# Public Folder

This **read-only** folder contains public files shared with all users.

The main purpose of the folder is to share common datasets, such as base vector and raster maps.

If you need such datasets, please send an e-mail to <s.girgin@utwente.nl> with the following information:

- Name of the dataset
- URL address for download

Self-service portal will be available soon to request datasets.

You can find useful information on the platform, such as lists of available packages, feature tests, configuration files, and benchmark results, under the `platform` folder.

For more information please refer to the Apache Hadoop Documentation

[ ]: