AWS Roadmap for Java Spring Boot Engineers

From Java dev to AWS engineer. Hands-on, free, no fluff.

AWS Roadmap for Java Spring Boot Engineers

12 phases + capstone 173 hands-on tasks 52 Q&A deep-dives ~60 hrs total Free forever

Build genuine, hands-on AWS experience across 12 phases, from VPC fundamentals to production-grade Kubernetes, event-driven serverless, observability, and GitOps. Each phase teaches the concepts, gives you tasks to build the real thing, and prepares you to talk about it in a senior-level interview.

12 phases + capstone · Move at your own pace, the hands-on tasks are the real measure of progress, not the clock.

Best for mid-to-senior Java engineers who've built applications but have never owned AWS infrastructure.

Start with Phase 1: Foundations →

EC2 IAM VPC STS RDS Aurora RDS Proxy S3 ElastiCache CloudWatch X-Ray Prometheus Grafana Docker ECR ECS EKS IRSA Helm ArgoCD Lambda SQS SNS API Gateway DynamoDB Terraform CloudFormation CDK Secrets Manager

Rajat Chaudhary

Senior Java/Spring Boot engineer. Built this to give developers a real, structured path into AWS: build it, run it, then talk about it in interviews.

GitHub →

Found this useful? Buy me a coffee ☕

Security, Encryption & Identity

Goal: Audit every API call, encrypt secrets and data at rest, and understand the full IAM permission model before building stateful services

Architect this phase

AWS Organization → Management Account → Member Account (your learning account) · CloudTrail → all API calls → S3 bucket (audit log) · KMS CMK → envelope-encrypts Secrets Manager, EBS, RDS (upcoming phases) · IAM Identity Center → SSO portal → developer → short-lived temporary credentials

Draw this yourself for better retention: draw.io · Official AWS Icons

Topics

Networking (continued from Phase 1)

DNS in VPC & Route 53 Resolver

What Every VPC has a built-in DNS resolver reachable at the base CIDR + 2 address (e.g., 10.0.0.2 for a 10.0.0.0/16 VPC). Two VPC attributes control it: enableDnsSupport (enables the resolver; must be on) and enableDnsHostnames (assigns public DNS hostnames to instances; required for Interface Endpoint private DNS). Route 53 Private Hosted Zones let you register custom DNS names (e.g., postgres.internal.company.com) that resolve only inside associated VPCs. Route 53 Resolver Endpoints extend DNS across hybrid networks: an Inbound Endpoint lets on-premises DNS servers forward queries into AWS; an Outbound Endpoint lets EC2 instances forward queries to on-premises resolvers.

Why Services in a VPC find each other by name, not by IP. Private Hosted Zones are how a Spring Boot app resolves postgres.internal to an RDS endpoint without hardcoding IPs. Resolver endpoints are required in hybrid deployments where on-premises systems must reach AWS-hosted services by name and vice versa.

Gotcha

If enableDnsHostnames is off on your VPC, Interface Endpoints private DNS override silently stops working. The SDK resolves the service endpoint to the public IP instead of the ENI in your subnet. This is the root cause when an Interface Endpoint exists but traffic still routes to the internet.
A Private Hosted Zone must be explicitly associated with a VPC before its records resolve inside that VPC. Creating the zone is not enough.
Route 53 Resolver Endpoints cost $0.125/hr per endpoint plus $0.40 per million queries. For small deployments, consider this cost before adding endpoints purely for convenience.

IAM (Advanced)

Resource-based Policies

What An IAM policy attached to an AWS resource rather than to an identity. Common examples: S3 bucket policies, SQS queue policies, SNS topic policies, KMS key policies, Lambda resource policies, and Secrets Manager resource policies. A resource-based policy specifies a Principal (who) and the actions they can perform on that specific resource. The principal can be an IAM role, an AWS account, an AWS service, or * (everyone).

Why Resource-based policies enable cross-account access without an AssumeRole call. If Account B puts a bucket policy allowing Account A's role s3:GetObject, that role can read from the bucket using its own existing credentials. No extra STS call required. This is the standard pattern for SaaS services serving data to customer accounts.

Gotcha

KMS key policies are mandatory. Unlike S3 bucket policies, a KMS key without an explicit key policy grants access to nobody. The default key policy AWS creates includes a root-account delegation statement; if you write a custom key policy from scratch and omit that statement, you can permanently lock yourself out of the key.
For same-account access, identity-based policies alone are usually sufficient. Resource-based policies become necessary for cross-account access and for expressing conditions tied to the resource itself (e.g., deny delete unless MFA is present).
When both policy types exist for the same account, AWS grants access if either allows it and no explicit Deny overrides it. Across accounts, both the identity-based policy on the caller and the resource-based policy on the target must allow the action.

IAM Identity Center (formerly AWS SSO)

What A managed service for human access to AWS accounts in an organization. Instead of creating individual IAM users in each account, you connect IAM Identity Center to your identity provider (Okta, Microsoft Entra ID, Google Workspace, or the built-in directory), define Permission Sets (role templates with IAM policies), and assign users or groups to specific accounts. Developers get short-lived credentials via the AWS access portal or aws sso login on the CLI, not via long-lived Access Keys.

Why A team with individual IAM users and Access Keys has as many long-lived credentials as employees, each needing rotation, each being a leak risk, each requiring individual revocation when someone leaves. IAM Identity Center centralises provisioning and revocation: remove someone from the IdP, and access across all AWS accounts disappears immediately.

Gotcha

IAM Identity Center requires AWS Organizations to be enabled. It is a multi-account service by design.
Permission Sets become IAM roles in each assigned account under the naming pattern AWSReservedSSO_PermissionSetName_xxx. Do not modify these roles directly in IAM; they are managed by Identity Center and changes get overwritten.
aws sso login credentials are short-lived (typically 1-12 hours). Automated scripts that run unattended cannot use SSO credentials; they need a dedicated service account role with long-lived credentials stored in Secrets Manager or a dedicated IAM role assumed by the automation service (e.g., a Lambda execution role or a GitHub Actions OIDC role).

AWS Organizations & Service Control Policies (SCPs)

What AWS Organizations lets you manage multiple AWS accounts under a single management account. Accounts are organized into Organizational Units (OUs). Service Control Policies (SCPs) are maximum-permission guardrails attached to OUs or accounts. An SCP does not grant permissions; it limits the ceiling. Even an IAM Role with AdministratorAccess cannot exceed what the SCP permits. SCPs apply to all identities in member accounts, including root users of those accounts.

Why SCPs enforce compliance rules at the account boundary without modifying any IAM policy. Example: an SCP on all production OUs that denies any EC2 action outside us-east-1 and eu-west-1. No developer can accidentally spin up instances in a prohibited region regardless of their IAM permissions.

Gotcha

SCPs do not apply to the management account. The management account has full access regardless of SCP configuration. This is why the management account should hold no workloads: it is a privileged account for organization administration only.
Organizations consolidates billing: all charges across member accounts roll up to the management account, and volume discounts aggregate across the whole org. This benefit alone is worth enabling Organizations even for small teams.
AWS service-linked roles are exempt from SCPs when they need specific permissions to function on your behalf. This prevents SCPs from accidentally breaking managed services like EKS or RDS.

Security Operations

AWS CloudTrail

What CloudTrail records every AWS API call in your account: who called it (IAM user, role, or service), from where (IP address, console or CLI), when, what was requested, and what was returned. Management events (control-plane actions such as create, delete, or modify resources) are captured by default and viewable in Event History for 90 days at no cost. Data events (S3 object reads/writes, Lambda invocations) and Insights events (anomaly detection) are opt-in and billed separately. A Trail sends events to an S3 bucket for long-term retention and optionally to CloudWatch Logs for alerting.

Why When a security incident happens ("who deleted that S3 bucket?", "which role changed this Security Group at 2am?", "why are there 1,000 STS calls in 60 seconds?"), CloudTrail is the first place you look. It is also a prerequisite for most compliance frameworks: SOC 2, PCI-DSS, HIPAA, and ISO 27001 all require API audit logs.

Gotcha

Event History shows only management events for the last 90 days. For S3 object access logs or records older than 90 days, you must create a Trail. The first copy of management events per region is free; additional copies and data events are billed.
CloudTrail logs land in S3 with up to a 15-minute delay. For near-real-time alerting on specific API calls (e.g., alert when anyone calls DeleteTrail), send the Trail to CloudWatch Logs and create a metric filter and alarm.
Protect the Trail's S3 bucket. An attacker who compromises your account will attempt to delete the CloudTrail logs to cover their tracks. Enable S3 Object Lock on the Trail bucket and restrict cloudtrail:StopLogging and cloudtrail:DeleteTrail with an SCP or a deny policy on all non-admin roles.

AWS KMS (Key Management Service)

What KMS manages cryptographic keys used to encrypt data at rest. A Customer Managed Key (CMK) is a key you create and control ($1/month + $0.03 per 10,000 API calls). An AWS Managed Key is created by AWS on behalf of a specific service (e.g., aws/s3, aws/rds) at no charge. KMS uses envelope encryption: KMS generates a one-time data key (AES-256 symmetric), your application or the AWS service encrypts the data locally using that key, and then the data key itself is encrypted with the CMK. The encrypted data key is stored alongside the ciphertext. To decrypt, KMS decrypts the data key and returns the plaintext key in memory for the calling application to use.

Why Nearly every later phase uses KMS: RDS encrypts storage at rest, EBS volumes encrypt by default in most regions, S3 uses SSE-KMS for object encryption, and Secrets Manager encrypts secrets with a CMK. Understanding envelope encryption explains why rotating a CMK does not require re-encrypting all your data (only the data keys are re-wrapped), and why a CMK's raw key material cannot be exported from KMS hardware.

Gotcha

Deleting a CMK has a mandatory 7-30 day waiting period and is irreversible. Any data encrypted with that key and not separately backed up becomes permanently unrecoverable after deletion. AWS cannot help you recover it. Before scheduling deletion, run a last-access audit in the KMS console.
Cross-account EBS snapshot sharing requires the snapshot to be encrypted with a CMK whose key policy explicitly grants access to the target account. Snapshots encrypted with AWS Managed Keys cannot be shared cross-account.
KMS is rate-limited; the default in most regions is 30,000 requests/second across all cryptographic operations. High-throughput applications that call GenerateDataKey per database record can approach this limit. The correct pattern is to cache the plaintext data key in memory and call KMS only when the cache expires or the key is rotated.

AWS Secrets Manager vs SSM Parameter Store

What Both store sensitive configuration values so applications never embed credentials in code or config files. AWS Secrets Manager: purpose-built for secrets, $0.40/secret/month + $0.05 per 10,000 API calls, built-in automatic rotation for RDS, Redshift, and DocumentDB, plus Lambda-based custom rotation for any other secret. AWS Systems Manager Parameter Store: free for standard parameters (up to 4 KB, up to 10,000 parameters per account), $0.05/advanced-parameter/month, no automatic rotation. Both encrypt values using KMS and integrate with IAM for access control.

Why Your Spring Boot application in Phase 2 must connect to Phase 4's RDS PostgreSQL. The database password must not be in application.properties, an environment variable, or baked into an AMI. The correct pattern: store it in Secrets Manager, grant the EC2 IAM Role secretsmanager:GetSecretValue on that specific secret ARN, and retrieve it at startup using the AWS SDK.

Gotcha

When Secrets Manager rotates a secret, the value changes in place. Your application must re-fetch the credential on the next connection attempt rather than caching it indefinitely at startup. The AWS Secrets Manager JDBC driver wrapper for Java handles this automatically for database connections.
SSM Parameter Store standard tier is throttled at 40 transactions/second by default (soft limit). Applications that fetch many parameters at startup can hit this. Use GetParametersByPath to batch fetches, or use the advanced tier for higher throughput.
Use Secrets Manager for anything that rotates or is a credential (database passwords, API keys, TLS private keys). Use SSM Parameter Store for non-secret configuration (feature flags, service URLs, environment-specific config values). Mixing them per-secret type is fine; mixing them arbitrarily per-project creates confusion during incident response.

Hands-on Tasks

Browse CloudTrail Event History (no setup needed). Open CloudTrail → Event History. Filter by Event name: CreateVpc and find the event from Phase 1. Click into it and examine the JSON: note userIdentity.arn (who called it), sourceIPAddress, requestParameters.cidrBlock, and responseElements.vpc.vpcId. Every CloudTrail event has this shape. You are learning to read incident evidence, not just finding a VPC.
Create a CloudTrail Trail for long-term retention. CloudTrail → Trails → Create trail → name: management-trail → create a new S3 bucket → enable SSE-KMS encryption using a new CMK named cloudtrail-key → management events: Read + Write → leave data events off. After creation, confirm the Trail shows "Logging: On" and a log file appears in the S3 bucket within 15 minutes. Cost: the Trail itself is free for the first copy of management events per region; you pay for S3 storage and KMS API calls (a few cents per month).
Create a KMS Customer Managed Key and test encrypt/decrypt. KMS → Customer managed keys → Create key → Symmetric → Encrypt and decrypt → alias: my-learning-key. Confirm the key policy includes the root account delegation statement. Then via CLI:
CIPHER=$(aws kms encrypt --key-id alias/my-learning-key --plaintext "hello world" --output text --query CiphertextBlob)
aws kms decrypt --ciphertext-blob fileb://<(echo "$CIPHER" | base64 -d) --output text --query Plaintext | base64 -d
You should get "hello world" back. This is envelope encryption at the KMS API level; in practice the AWS SDK wraps GenerateDataKey and local AES-256 encryption for you.
Store a secret in Secrets Manager. Secrets Manager → Store a new secret → Other type of secret → key: db_password, value: my-test-password-123 → KMS key: alias/my-learning-key → name: learning/db-password. Retrieve via CLI: aws secretsmanager get-secret-value --secret-id learning/db-password --query SecretString --output text. In Phase 4, you will store your RDS master password here and fetch it from Spring Boot using this same call.
Write an S3 bucket policy (resource-based policy). Create a bucket named p2-policy-test-<your-account-id>. Open Permissions → Bucket policy and add this policy (replace the account ID placeholder):
```
{
  "Version": "2012-10-17",
  "Statement": [{
    "Effect": "Deny",
    "Principal": "*",
    "Action": "s3:GetObject",
    "Resource": "arn:aws:s3:::p2-policy-test-<account-id>/*",
    "Condition": {"Bool": {"aws:SecureTransport": "false"}}
  }]
}
```
This denies HTTP (non-TLS) object reads regardless of the caller's IAM policies. Save, then try to list objects via the CLI over HTTP (use --no-verify-ssl) and confirm the Deny is enforced. Delete the bucket when done.
Create a Route 53 Private Hosted Zone. Route 53 → Hosted zones → Create hosted zone → Domain name: internal.learning → type: Private → associate with your Phase 1 VPC. Add an A record: name postgres, value 10.0.2.10. In Phase 4 you will point this record at your actual RDS endpoint so the Spring Boot app connects to postgres.internal.learning rather than an opaque AWS hostname. Note the cost: $0.50/zone/month; delete this zone when done if you prefer to skip the ongoing charge and create it fresh in Phase 4.
Self-check. Without notes, explain aloud: (1) What is the difference between an identity-based and a resource-based IAM policy? Give one concrete example of each. (2) A KMS CMK is rotated annually. Do your encrypted RDS backups need to be re-encrypted? Why or why not? (3) Your team uses IAM Identity Center. A developer leaves. What is the one action required to revoke all their AWS access, and why does the same step not work with individual IAM users? (4) An SCP at the OU level denies s3:DeleteBucket. A developer with AdministratorAccess tries to delete a bucket. What happens, and does it matter which AWS region they use? (5) CloudTrail Event History shows a Security Group was modified at 3am. What two fields in the event tell you who made the change and whether it was from a compromised credential? If all five are clear, Phase 2 is done.

Interview Q&A, Expand each to see the answer

What is the difference between an identity-based and a resource-based IAM policy?

An identity-based policy is attached to an IAM user, group, or role. It defines what that identity is allowed to do. It says nothing about who else can access a particular resource.

A resource-based policy is attached to an AWS resource (S3 bucket, KMS key, SQS queue, Lambda, etc.). It specifies a Principal: who can perform which actions on that specific resource. The principal can be in a different AWS account.

The key practical difference: resource-based policies enable cross-account access without an AssumeRole step. An IAM role in Account A can access an S3 bucket in Account B if the bucket policy grants Account A's role permission. With identity-based policies alone, Account A's role would need to first call sts:AssumeRole into Account B.

For same-account access, identity-based policies suffice. For cross-account access, use either a resource-based policy on the target or an STS AssumeRole into the target account (the latter is more auditable).

How should a team of 10 engineers access the AWS console and CLI in production?

Through IAM Identity Center, connected to the corporate identity provider (Okta, Active Directory, Google Workspace).

Engineers authenticate with their corporate credentials. IAM Identity Center issues short-lived temporary credentials (valid 1-12 hours). CLI access uses aws sso login; console access uses the AWS access portal.

Why this beats individual IAM users with Access Keys:
- No long-lived Access Keys to rotate, lose, or have stolen
- Offboarding is instant: remove the user from the IdP, and all AWS access disappears immediately across every account
- Permission Sets define role templates centrally; you assign a person to an account with a role, not a pile of policies
- All session activity in CloudTrail is tied to the individual's identity, not to a shared role ARN

In a solo learning account, an IAM user with an Access Key is acceptable. In any team setting, IAM Identity Center is the correct architecture.

What does CloudTrail capture and how do you use it to investigate an incident?

CloudTrail captures every AWS API call: the caller's identity (userIdentity.arn), source IP (sourceIPAddress), timestamp, service and action (eventSource, eventName), request parameters, and the response (including error codes).

Investigation workflow:
1. Identify the affected resource by its ARN or ID
2. Filter CloudTrail for the relevant action (e.g., DeleteBucket, AuthorizeSecurityGroupIngress, PutBucketPolicy)
3. Read userIdentity.arn to identify the caller; read sourceIPAddress to see whether it came from a known IP or an unexpected one
4. Widen the time window and search all API calls by that role ARN to trace what else was accessed
5. Check errorCode fields for failed access attempts that preceded the successful one

Logs land in S3 with up to a 15-minute delay. For real-time alerting (e.g., notify on DeleteTrail or ConsoleLogin from an unknown IP), pipe the Trail to CloudWatch Logs and configure metric filters.

What is envelope encryption and why does KMS use it instead of encrypting data directly?

Envelope encryption is a two-layer scheme:
1. KMS generates a one-time symmetric data key (AES-256) via GenerateDataKey
2. Your application (or the AWS service) encrypts the data locally with the data key
3. KMS encrypts the data key itself with the CMK
4. The encrypted data key is stored alongside the ciphertext

To decrypt: call KMS to decrypt the data key, then use the plaintext data key in memory to decrypt the data, then discard the plaintext key.

Why not let KMS encrypt data directly? Three reasons:
- Size: KMS direct encryption handles only up to 4 KB. Database records and files are larger
- Performance: every read or write would make a network call to KMS (1-5ms each). Local AES-256 is nanoseconds
- Cost: KMS charges per API call; encrypting every database record individually would be expensive at scale

The rotation insight: rotating a CMK requires only re-wrapping the data keys (a KMS API call per data key), not re-encrypting the actual data. This is why key rotation is cheap even when you have terabytes of encrypted data.

My Notes

Saved to browser storage automatically as you type.

AWS Roadmap for Java Spring Boot Engineers

AWS Foundations

Deploy Spring Boot on EC2

Security, Encryption & Identity

RDS Integration

S3 Storage

Monitoring and Observability

Containers and ECS

Kubernetes with EKS

Serverless

Infrastructure as Code

ElastiCache / Redis

Prometheus + Grafana

ArgoCD / GitOps

Capstone Project