Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 2

Which process of KDD aids in unifying data from different sources?

-> Data
Integration
Consider an example of an apartment: The number of bedrooms, bathrooms, and the
floor of an apartment determines its price. Which is the dependent variable in this
example?-> Price
__________ step of classification contributes to the construction of learning
model.->Learning Step

Consider an example of an apartment: The number of bedrooms, bathrooms, and the


floor of an apartment determines its price.

Which is/are the predictor variable(s) in this example?-. All the options(Number of
bedrooms,Floor in which the apartment is located & Number of bathrooms)
Response variable is a __________.->Dependent Variable
Explanatory variable is a __________.-> Predictor Variable
__________ is the problem of identifying a category to which a new observation
belongs to, based on a training set of data containing observations whose
categories are already known.-> Classification
__________ term portrays the process of discovering small pieces from a large
volume of raw material.-> Mining

Collective outlier significantly deviates from the entire dataset.-> False


Derived relationships in Association Rule Mining are represented in the form of
__________.-> Rules
Jacard Index distance measure is used on __________.-> Non-numeric dataset
Clustering process works on _________ measure.-> Distance
__________ step of KDD process helps in identifying valuable patterns.-> Pattern
Evaluation
Identify the Unsupervised Learning method.-> Clustering

Which of the following association measure helps in identifying how frequently the
item appears in a dataset?-> Support
__________association measure compares the confidence with the expected
confidence.-> Lift
Which of the following helps in measuring the central tendency of the dataset?->
All the options(Mode,Mean&Median)
Distance measure(s) used in clustering process of Numeric Dataset is/are
__________.-> All the options(Hamming,Minkowski&Manhattan Distance)
__________ aids in identifying associations, correlations, and frequent patterns in
data.-> Association Rule Mining
Which among the following is/are (an) Ensemble Classifier?-> All the options(Random
Forest,Boosting&Bagging)
__________ parameter of regression helps in identifying the direction of
relationship between variables.-> Regression Coefficient

Classification is a __________ task.-> Data Analysis


__________ stage of data science process helps in converting raw data into a
machine-readable format.-> Data Cleaning
__________ stage of data science process helps in exploring and determining the
patterns from the data.-> Exploratory Data Analysis
Identify the algorithm that works based on the concept of clustering.-> K-Means
Identify the algorithm that works based on the concept of classification.-> None of
the options(Decision Tree,Random Forest & SVM)

Descriptive statistics is used in __________ datasets.->Sample


__________statistics provides inferences on population.-> Inferential
__________statistics provides the summary statistics of the data.-> Descriptive
Inferential statistics is used in __________ datasets.->Population

Identify the application of the Outlier Detection Method.-> Intrusion Detection


Regression can be used in predicting/forecasting Applications.-> True
Classification predicts the value of __________ variable.-> Categorical
__________ outlier significantly deviates based on the context selected.->
Contextual Outlier
Which of the following helps in measuring the dispersion range of the data?-> All
the options(Standard Deviation,Range,Variance&interquartile range)
Which among the following is/are (an) outlier detection method(s)?-> All the
options(Proximity-based approach,Clustering-based approach,Statistical
approach&Classification approach)
The science of collecting, interpreting, and analyzing data is known as
__________.-> Statistics
In Association Rules, the Antecedent and Consequent form a disjoint set.-> True

You might also like