What Data Scientists Do Flashcards
What example did Dr. Murtaza Haider investigate to demonstrate the role of a data scientist?
Dr. Haider found a relationship between unexpected bad weather and the number of public transit complaints in Toronto.
How can data scientists help tackle environmental challenges like water toxicity?
By using artificial neural networks, data scientists can help predict algae blooms and safeguard ecosystems.
What did Norman White build that simplified intricate problems across departments?
He built a recommendation engine.
What educational tools does Dr. White use to teach future data scientists?
Python notebooks, Unix, Linux, relational databases, and tools like Pandas.
What educational backgrounds does Dr. Vincent Granville list as necessary for a data scientist?
Algebra, calculus, training in probability, and statistics.
What is the difference between a statistician and a data scientist according to Dr. Granville?
A data scientist uses statistics, but is not only a statistician.
What is statistical regression used for?
To show the probable relationship between two variables, such as distance driven and gas used.
What machine learning algorithm is mentioned in the text for processing big data?
Nearest neighbor.
Why should the term ‘big data’ be used with caution?
Because what was once considered big data is constantly evolving due to innovation.
What tools have expanded the possibilities for handling big data?
Tools like Hadoop and software advancements have expanded the limits for handling data.
What sets a data scientist apart according to Dr. Patel?
Their ability to unlock insights and convey compelling narratives to stakeholders.
What types of data do data scientists work with?
Data from a wide variety of sources, including video, audio, and text (structured and unstructured).
What are some common data formats used by data scientists?
Delimited text files, spreadsheets, XML, PDFs, and JSON.
What quality does Rachel Schutt highlight as making a data scientist exceptional?
Curiosity.
What skills and roles does a data scientist combine, according to Rachel Schutt?
A blend of computer scientist, software engineer, and statistician.