- Python (here tested with 3.12.2)
- Docker
- make
- curl
- jq
-
Setup DataHub
make init
-
Create an access token for making API requests.
- Go to http://localhost:9002/settings/tokens and login (username/password are
datahub
/datahub
) - Create a token, then on your terminal, make it available as an environment variable:
DATAHUB_API_KEY
- Go to http://localhost:9002/settings/tokens and login (username/password are
-
Create sample data in DataHub
datahub docker ingest-sample-data --token "$DATAHUB_API_KEY"
-
Ready to experiment with DataHub!
To learn about DataHub GraphQL API, take advantage of its GraphiQL! at http://localhost:9002/api/graphiql
-
You can add
Account Balance
term to all fields inSampleKafkaDataset
dataset.- First check fields in
SamplekafkaDataset
before making changes at http://localhost:9002/dataset/urn:li:dataset:(urn:li:dataPlatform:kafka,SampleKafkaDataset,PROD)/Schema - Run
bash add-glossary-terms.sh
- Check the fields again to confirm the change.
- First check fields in