• NOAA dataset

Parquet format

Data I/O

Local pipeline with Metaflow

  • metaflow
  • get from API
  • save as Parquet files

Pipeline

Local

  • practical example for investment assistant

Scheduling

https://docs.prefect.io/core/concepts/schedules.html#overview

Running on Fargate

https://towardsdatascience.com/serverless-data-pipelines-made-easy-with-prefect-and-aws-ecs-fargate-7e25bacb450c#a045

https://docs.prefect.io/orchestration/agents/ecs.html#flow-configuration

Running on EC2

https://towardsdatascience.com/deploying-prefect-server-with-aws-ecs-fargate-and-docker-storage-36f633226c5f

Scheduled tasks on AWS Fargate with scheduled tasks

  • time-based, cron-loke and event-based scheduling

https://docs.aws.amazon.com/AmazonECS/latest/developerguide/scheduled_tasks.htm