Infra questions Flashcards

potential infrastructure questions

1
Q

How long do we store metrics for?

A

We retain all of our metrics for 15 months

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Can I automate the build of dashboards and monitors?

A

Yes - you can use our API alongside tools like terraform to automate builds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Do you deploy the agent as a sidecar for my Kubernetes Cluster?

A

No - it’s deployed as a daemonset, therefore one agent per host/node

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How often do we receive metrics from API crawlers?

A

For cloud - around 10 minutes, which is the same interval you’d experience in cloud native tools.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Do we need the Agent?Or can we just configure through API

A

Since the APIs collect metrics on a 10-15 min interval, to get more granular metrics AND quicker metrics, you should get the Agent, because the agent collects every 15 seconds…Why would you NOT want the agent?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Where is my data stored?

A

We are fully hosted in the cloud, so in multiple cloud instances (AWS, Azure, GCP, also in the EU. Mostly in USA.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Do you offer on-prem hostings?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How could I get more advanced metrics from RDS?

A

Deploy the agent to an ec2 instance on the same security group as the database - you can hook into the rds like a regular DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Can I automate setting up multiple AWS accounts rather than manually in the UI?

A

Yes, using terraform, API, or cloudformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Any community dashboards?

A

No. But we have all types of dashboards and it is very easy to customize your own

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Can I convert grafana dashboards into DD so I dont have to recreate?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Can I trigger changes on my infrastructure when an alert goes off?

A

Not directly, but we can trigger a webhook to allow a client to trigger a script based on the web hook. We also have AWS Automation and Kubernetes to autoscale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Will I see Watchdog alerts right away?

A

No, it can take anywhere from 2-6 weeks. Watchdog needs to collect data in order to start making assumptions. The more data it has, the better

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Can I silence alerts periodically?

A

Yes, we can manage downtime in monitors easily

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How long does it take to set up Communcation integrations?

A

With any supported tech stack, very quickly, but can depend

17
Q

What If I have to completely seperate my data based on teams?

A

We could spin up multi-org environments, or seperate DD instances

18
Q

Is RBAC supported?

A

Yes, for logs and dashboards

19
Q

Do you use monitor netflow? (e.g. monitoring the flow between devices such as routers/firewalls

A

Yes, in public beta

20
Q

Can you support hybrid envrionments?

A

Yes

21
Q

Support serverless?

A

Yes - theres a Serverless page under infrastructure to show AWS lambdas and AWS serverless

22
Q

How often do live containers get updated?

A

2 second buffer period

23
Q

Whats the agent overhead?

A

Depends… .08%-3% CPU image

24
Q

If a client is running AWS ECS on Fargate, do they deploy the agent as a sidecar using Docker within the task definition?

A

Yes

25
Q

Do I have to use Helm to deploy the agent in a Kubernetes environment?

A

Nope. We also provide YAML manifests for required resources

26
Q

Can I see container health information beyond the 36 hours available in Live view?

A

Yes. Docker, Kubernetes, Chhecks/Integrations retain container info for the standard 15 months

27
Q

My K8s is epehemeral

A

Datadog can provide insights into the entire lifecycle of pods, including observing when they terminate. We can also capture real time data on CPU and memory util. of pods

28
Q

My servers are ephemeral

A

You can use a metric monitor on the system uptime metric to address this. This metric is an ever increasing timer which resets to 0 when the host starts. You can use diff() function with the metric to distinguise between a new server and a rebooted server