
Master parallel ETL orchestration with Apache Airflow by running real-world pipelines on AWS EC2 using public data from the FakeStoreAPI, storing it in AWS RDS (PostgreSQL), and exporting to S3 — all while leveraging TaskGroups, dynamic task mapping, and AWS deployment best practices.
Orchestrating a Parallel ETL Pipeline with Apache Airflow on AWS EC2
Stack: Airflow + EC2 + RDS (Postgres) + S3 + Python + Pandas + Boto3
