Introduction to Data Analytics | Week 3

Course Link: https://in.coursera.org/learn/introduction-to-data-analytics/

Gathering Data

Practice Quiz

1. What are the requirements in order for data to be reliable? (Select all that apply)

Data should be free of all errors 

Data should be easy to collect 

Data should be structured

Data should be relevant 

2. What type of data is produced by wearable devices, smart buildings, and medical devices?

Observation study data

Sensor data

Census data

Survey data

3. What type of data is semi-structured and has some organizational properties but not a rigid schema?

 Online forms 

 Web logs

 Emails

Data from OLTP systems

Graded Quiz

1. What are some of the steps in the process of “Identifying Data”? (Select all that apply)

Determine the visualization tools that you will use

 Define a plan for collecting data 

 Determine the information you want to collect 

 Define the checkpoints 

2. What type of data refers to information obtained directly from the source?

 Primary data 

 Sensor data 

 Third-party data 

 Secondary data 

3. Web scraping is used to extract what type of data? 

Data from news sites and NoSQL databases

Images, videos, and data from NoSQL databases

Text, videos, and data from relational databases

Text, videos, and images

4. Data obtained from an organization’s internal CRM, HR, and workflow applications is classified as:

Copyright-free data

Primary data

Third-party data

Secondary data

5. Which of the provided options offers simple commands to specify what is to be retrieved from a relational database?

Web Scraping

RSS Feed

SQL

API

Wrangling Data

Practice Quiz

1. What is one of the common structural transformations used for combining data from one or more tables?

Denormalization

Joins

Cleaning

 Normalization 

2. What tool allows you to discover, cleanse, and transform data with built-in operations?

Watson Studio Refinery

Google DataPrep

Trifacta Wrangler

OpenRefine

 3. What is data called that does not fit within the context of the use case?

 Duplicate data 

 Irrelevant data 

 Relevant data

 Missing data 

Graded Quiz

1. What does a typical data wrangling workflow include?

Using mathematical techniques to identify correlations in data

Predicting probabilities

Validating the quality of the transformed data 

Recognizing patterns

2. OpenRefine is an open-source tool that allows you to: 

Automatically detect schemas, data types, and anomalies

Use add-ins such as Microsoft Power Query to identify issues and clean data

Transform data into a variety of formats such as TSV, CSV, XLS, XML, and JSON

Enforces applicable data governance policies automatically

3. What is one of the steps in a typical data cleaning workflow? 

Inspecting data to detect issues and errors

Building classification models

Clustering data

Establishing relationships between data events

4. When you’re combining rows of data from multiple source tables into a single table, what kind of data transformation are you performing?

Denormalization

Unions

Normalization

Joins

5. When you detect a value in your data set that is vastly different from other observations in the same data set, what would you report that as?

Outlier

Syntax error

Missing value

Irrelevant data

* The material and content uploaded on this website are for general information and reference purposes only and don’t copy the answers of this website to any other domain without any permission or else copyright abuse will be in action.

Please do it by your own first!

DMCA.com Protection Status


More From Kuizzer


Introduction to Data Analytics | Week 1


Introduction to Data Analytics | Week 2


Introduction to Data Analytics | Week 3


Introduction to Data Analytics | Week 4


Introduction to Data Analytics | Week 5

0 0 votes
Article Rating
Subscribe
Notify of
guest

1 Comment
Oldest
Newest Most Voted
Inline Feedbacks
View all comments



1
0
Would love your thoughts, please comment.x
()
x