DataPM helps you quickly publish and consume data. Let's start with some concepts.
Visit datapm.io to search and discover packages of data. Then use the command line client to fetch those packages. You can publish your own data packages, and host your own private registry!
Install the DataPM Command Line Client
Download a DataPM Client Installer from the list below.
- Windows DataPM Client Installer
- MacOS DataPM Client Installer
- Debian & Ubuntu DataPM Client Package
- Redhat & Centos DataPM Client Package
Run the installer by double clicking on it. Follow the instructions.
Search, Consume, and Publish Data Packages
You can search the public datapm.io registry using a modern web browser. Or use the following command to search via the command line client.
datapm search "example search"
Then fetch a specific data package.
datapm fetch datapm/example
Generate your own packages from a publicly available data set.
datapm package https://some-web-server.com/path/to/data.csv
You can also create packages of data from databases, cloud systems, and many other sources. Use the following command to see all available sources.
You can update the schema and statistics in the package file using the following command.
datapm update my-package-file.datapm.json
You can edit properties and descriptions of a package using the edit command.
datapm edit my-package-file.datapm.json
Finally, you can publish your package to a registry with the following command.
datapm publish my-package-file.datapm.json
And then you can re-publish the updates using the same publish command above. Or using the update command directly on the registry.
datapm update my-catalog/my-package
Host A Private Registry
DataPM offers the Registry as a free Docker image. View full private registry documentation for a detailed description of how to host your own registry.