-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Scale up Avni Cloud resources #56
Comments
Have one concern regarding going live with 11.0 release during this time period. |
Dry RunScaled DB from t3.medium -> t3.large; EC2 from t3.medium -> t3.large and Xmx2048m
Expected 10-15 mins downtime. PerformanceRan a few performance simulation runs using gatling with users from goonj and apfodisha and we get similar performance as current prod for 18 users syncing in parallel (peak: 18 req/sec (1000/min); avg: 10 req/sec) which is roughly 3-4x our prod load. (note: we will be using higher spec machines for prod so performance should scale accordingly). APIs taking longer: syncDetails, locationMapping (confirmed known bottlenecks). Also, Prerelease db was continuously under heavy load - diagnosed it to database sync activity from metabase green instance and ETL for rwb group orgs. These have been disabled. |
Prod scaled up as planned. Down time 22:04 to 22:12 |
Prod RDS downscaled back to db.t3.medium |
Need
New org with ~8000 users starts using Avni cloud on Jan 21. Need to scale resources to handle anticipated load.
Training period - 5 day period between Jan 21 - Feb 15
Regular usage - post Feb 15
Changes
AWS
Dry run
Jan 10 - Scale up on prerelease to confirm no/minimal outage and no other issues during scale up and then scale back down to current prerelease capacity.
Live
(Scheduling change ahead of time to allow CPU credits to accumulate prior to load and for the change to happen during low usage period.)
Jan 19 pm - Change Prod RDS read-write instance: db.t3.medium -> db.t3.2xlarge (8vCPU/32GB => 8x current)
Jan 19 pm - Change 'Prod machine' EC2 instance: t3.large -> t3.xlarge (4vCPU/16GB => 2x current)
Feb ? - Change Prod RDS read-write instance: db.t3.2xlarge -> db.t3.xlarge (4vCPU/16GB => 4x current)
Application
avni-server
Tune memory params to leverage available resources.
Observe application performance.
The text was updated successfully, but these errors were encountered: