Skip to content

Instantly share code, notes, and snippets.

@jsvitor
Last active August 25, 2022 15:11
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save jsvitor/40236dab063dc312e1f67eb314c8dee1 to your computer and use it in GitHub Desktop.
Save jsvitor/40236dab063dc312e1f67eb314c8dee1 to your computer and use it in GitHub Desktop.
Tools to increase data infrastructure robustness and resilience

[approaches] Increase data infrastructure robustness and resiliency

With increasing scalability comes the need to use strategies and approaches to ensure the availability of our services. Here I list some very important tools and concepts to achieve these goals.

Most of these tools are multifunctional.

Data Catalog

Data Observability

what is data observability

three pillars of observability data

  • Metrics refer to a numeric representation of data measured over time.
  • Logs, a record of an event that took place at a given timestamp, also provide valuable context regarding when a specific event occurred.
  • Traces represent causally related events in a distributed environment.

Data collectors

Data Reliability

Introducing Soda Core: The New Way for Data Reliability

Data Virtualization

Data Discovery and Visualization

Data Modeling

  • Delta lake archtecture ➡️ bronze(raw) ⇒ silver(treated) ⇒ gold(final)
  • Data Mesh concept

Data orchestration

Data processing

apache spark vs dask vs pandas

Data enrichment

Data scraping vs Data crawling "Data Crawling means dealing with large data sets where you develop your crawlers (or bots) which crawl to the deepest of the web pages. Data scraping, on the other hand, refers to retrieving information from any source (not necessarily the web)." By Trifacta

More

https://www.snowflake.com/

https://www.rudderstack.com/

https://github.com/joelparkerhenderson/architecture-decision-record

https://nifi.apache.org/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment