GOV Video Flashcards
Intro
Cloud Integration I deliveered while working as a engineer for Thomson Reuters
Situation
The business needed to onboard the 1st level support group to monitor replication from the HVR tool
Task
The business required a tactical replication alerting solution for the HVR infrastructure until the strategic datadog solution was ready (6 months away)
Action
python script on EC2 rep hub - not cloud native and not utilizing HVR out of the box alerting also wanted a solution more eventually driven and micro service based. Have had a robust scheduler and didn’t want to rely on a logfile
Solution I came up with was to use a combination of SNS , Lambda and API gateway to send tickets to Snow
Initial issue was that the HVR tool required an SNMP endpoint with usernamd and password
Result
Business unit teams had control over their own monitoring
the support group received snow tickets
the business had an interim monitoring solution prior to datadog rollout
Learning
Improvements
Support did have a few issues with the lack of deduplication - this had to be managed in the replication tool
Would have been a nice addition to have handled this in the automation perhaps using something like SQS that has depulication capabilities or even dynamo DB
Bear in minf that the solution would have grown larger and larger and this was an interim solution
Also teh learning curve for support was quite steep and it would have been more beneficial to have onboarded them earlier