An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports

airflow pyspark datawarehouse airflow-docker dataengineering amazon-s3 posgresql azure-blob-storage etl-pipeline apache-superset bi-dashboards

Updated Dec 7, 2022
Python

bengen343 / superset-on-gcp-cloud-run

Star

A walkthrough to deploy Apache Superset on Google Cloud Run

apache-superset google-cloud-run

Updated Jan 9, 2023
Shell

ismaildawoodjee / aws-data-pipeline

Star

A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):

python docker aws airflow sql etl terraform aws-s3 postgresql aws-emr data-engineering infrastructure-as-code aws-ec2 aws-iam elt data-pipeline aws-redshift apache-superset

Updated May 14, 2022
Python

sairamkrish / trino-superset-demo

Star

Demo application to showcase integration of Trino with Apache superset using Minio and Hive metastore

docker demo superset showcase parquet trino apache-superset trinodb

Updated Sep 28, 2022
Dockerfile

abhishektripathi24 / platform-setup

Star

Setup guidelines for infrastructure setup of open-source technologies

letsencrypt kubernetes elasticsearch kibana schema-registry grafana postgresql redis-sentinel prometheus telegraf nginx-proxy kafka-connect kafka-connector kafka-cluster apache-airflow timescaledb apache-superset zookeeper-cluster celery-workers

Updated Dec 10, 2022
HCL

aadel / sqlalchemy-solr

Star

Apache Solr dialect for SQLAlchemy

python sqlalchemy sql solr apache-solr apache-superset

Updated Sep 4, 2024
Python

fraibacas / lakehouse-poc

Star

Run an open-source data LakeHouse locally using Docker Compose

docker-compose prefect apache-superset apache-iceberg lakehouse

Updated May 31, 2024
Python

blcksrx / ansible-superset

Star

Ansible playbook for Apache Superset

ansible-playbook apache-superset

Updated Mar 31, 2023

szachovy / superset-cluster

Star

Highly available Apache Superset against MySQL InnoDB cluster.

mysql python docker redis ansible terraform cluster routing apache celery high-availability innodb apache-superset

Updated Oct 14, 2024
Python

philips-labs / terraform-hsdp-apache-superset

Star

Module to deploy Apache Superset on HSDP Container Host

terraform terraform-module apache-superset hsdp container-host

Updated Apr 24, 2024
HCL

open-datastudio / superset-archived

Star

Moved to https://github.com/open-datastudio/superset

kubernetes open-source business-intelligence apache-superset

Updated Oct 7, 2020
Python

goyal07nidhi / Data-Pipeline

Star

A data pipeline to ingest, process, store storm events datasets so we can access them through different means.

python aws-s3 apache-beam aws-athena gcp-storage apache-superset aws-glue datastudio snowflakedb sqlalchemy-python gcp-dataflow aws-quicksight gcp-bigquery

Updated Apr 7, 2021
Jupyter Notebook

mikekenneth / twitter_data-lakehouse_minio_drill_superset

Star

Building a Data Lakehouse for Analyzing Elon Musk Tweets using MinIO, Apache Airflow, Apache Drill and Apache Superset

python twitter s3 minio apache-drill apache-airflow apache-superset

Updated Feb 4, 2023
Python

predictiveworks / sqlalchemy-ignite

Star

This project provides a SQLAlchemy driver for Apache Ignite. It was built to enable (ad-hoc) data exploration and visualization of datasets managed by Apache Ignite.

sqlalchemy apache-ignite apache-superset

Updated Nov 3, 2021
Python

Improve this page

Add a description, image, and links to the apache-superset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the apache-superset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-superset

Here are 45 public repositories matching this topic...

apache / superset

abhioncbr / docker-superset

himarygr / apache-superset-css-templates

sairamkrish / django-demo-project

anilkulkarni87 / airflow-docker

sairamkrish / superset-custom-authentication

judeleonard / Prescriber-ETL-data-pipeline

bengen343 / superset-on-gcp-cloud-run

ismaildawoodjee / aws-data-pipeline

sairamkrish / trino-superset-demo

abhishektripathi24 / platform-setup

aadel / sqlalchemy-solr

fraibacas / lakehouse-poc

blcksrx / ansible-superset

szachovy / superset-cluster

philips-labs / terraform-hsdp-apache-superset

open-datastudio / superset-archived

goyal07nidhi / Data-Pipeline

mikekenneth / twitter_data-lakehouse_minio_drill_superset

predictiveworks / sqlalchemy-ignite

Improve this page

Add this topic to your repo