general questions Flashcards

1
Q

What are the main differences between SQL and NoSQL databases?

A

SQL databases are relational, have predefined schemas, and use structured query language. NoSQL databases are non-relational, have dynamic schemas, and scale horizontally more easily.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Name three popular NoSQL databases.

A

MongoDB, Cassandra, and Redis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is PostgreSQL known for?

A

PostgreSQL is known for its reliability, feature robustness, and support for both relational and non-relational queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are distributed SQL systems? Give two examples.

A

Distributed SQL systems are databases that appear as a single node but scale out horizontally. Examples include YugabyteDB and CockroachDB.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Apache Kafka used for?

A

Apache Kafka is a distributed event streaming platform used for high-performance data pipelines, streaming analytics, and data integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Apache Spark?

A

Apache Spark is an open-source unified analytics engine for large-scale data processing, capable of handling batch and stream processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Name two major cloud infrastructure providers.

A

Amazon Web Services (AWS) and Google Cloud Platform (GCP).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are key considerations when designing high-performance, low-latency data systems?

A

Data locality, caching strategies, optimized data structures, efficient algorithms, and minimizing network calls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a microservices architecture?

A

A microservices architecture is an architectural style that structures an application as a collection of loosely coupled, independently deployable services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How would you approach managing distributed teams across London and Lisbon offices?

A

Establish clear communication channels, use collaboration tools, set regular check-ins, foster a unified team culture, and be mindful of time zone differences.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can you effectively collaborate with data scientists, software engineers, and risk management teams?

A

Regular cross-functional meetings, clear documentation, shared goals, and fostering mutual understanding of each team’s needs and constraints.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Name three techniques for monitoring system performance.

A

Use of monitoring tools (e.g., Prometheus, Grafana), log analysis, and performance profiling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is GDPR and why is it important in data management?

A

GDPR (General Data Protection Regulation) is an EU law on data protection and privacy. It’s important because it sets guidelines for collecting and processing personal information from EU citizens.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How do you stay updated with new technologies in data engineering?

A

Following tech blogs, attending conferences, participating in online communities, and experimenting with new tools in personal projects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is ComplyAdvantage’s primary mission?

A

ComplyAdvantage aims to neutralize the risk of money laundering, terrorist financing, corruption, and other financial crimes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name three key technologies in ComplyAdvantage’s tech stack.

A

Kubernetes, Apache Kafka, and YugabyteDB.

17
Q

How would you handle a situation where team members disagree on a technical approach?

A

Facilitate open discussion, encourage evidence-based arguments, seek consensus, and if necessary, make a decision based on the best available information while explaining the rationale.

18
Q

What’s a good question to ask about the company’s future plans?

A

‘What are the biggest challenges and opportunities you see for ComplyAdvantage in the next 3-5 years, and how does this role contribute to addressing them?’