| AWS Service | Type | Description |
|---|---|---|
| AWS S3 | Data Storage | AWS S3 or Simple Storage Service can be used to store data in almost any file format. This can be used as input as well as output. |
| AWS Kinesis | Streaming Data | Amazon Kinesis is used to analyze the streaming data within AWS. |
| AWS Glue | ETL | Glue is a serverless platform by amazon to provide on cloud ETL or Extract, Transform and Load service. Glue catalog can be created and crawlers can be defined to fetch data. pyspark code can be used also for transformation. |
| AWS Cloudwatch | Logging | Cloudwatch is the service where job logs are generated which can be analyzed for issue and failures. |
| AWS IAM | Access Management | AWS IAM or Identity Access Management is used to assign different kind of access to different roles which is essential in order to control data and job access. |
| AWS Athena | Virtual Database | AWS Athena is basically used to analyze data which is stored in S3. We can create tables in athena using athena DDL which can point to data location in S3. |
| AWS EMR | Data Processing | This service is used for processing data where large volume is expected. |
| AWS DynamoDB | RDBMS | This provides relational database alternative within AWS. Can be used for structured and semistructured data. |