7. A researcher wants to study the association between gender and using a mobile phone. Data collected for this study will be ______
a) Qualitative data
b) Quantitative data
c) Continuous data
d) Classified data
8. Primary way to collect DATA (Data Gathering process)?
a) Experiment
b) Survey
c) Interview
d) Observation
9. The data scientist will use ______ for predictive modelling?
a) Artificial Intelligence
b) Machine Learning
c) Training Set
d) Deep Learning
10. Which one does NOT belong with Classification loss?
a) Log loss
b) Mean Absolute Error
c) Exponential Loss
d) Hinge Loss
11. Which process does NOT come under Capstone Project?
a) AI Model
b) AI Project Cycle
c) Deployment
d) Data Gathering
12. Which one does NOT belong with Regression loss?
a) Log Loss
b) Mean Absolute Error
c) Log cosh Loss
d) Quantile Loss
13. Choose Correct Option
a) Scope >> Acquire >> Explore >> Prepare >> Model >> Assess >> Deploy >> Batch
b) Scope >> Acquire >> Explore >> Prepare >> Model >> Deploy>> Real Time >> Batch
c. Scope >> Acquire >> Prepare >> Assess >> Deploy >> Batch >> Real Time >> Explore
d. Scope >> Acquire >> Explore >> Model >> Prepare >>Assess >> Deploy >> Batch
14. Adding a non-important feature to a linear regression model may result in.
(i) Increase in R-square
(ii) Decrease in R-square
a) Only (i) is correct
b) Only (ii) is correct
c) Either (i) or (ii)
d) Neither (i) nor (ii)
15. Which of the following options is/are true for K-fold cross-validation?
(i) Increase in K will result in higher time required to cross validate the result.
(ii) Higher values of K will result in higher confidence on the cross-validation result as compared to lower value of K.
(iii) If K=N, then it is called Leave one out cross validation, where N is the number of observations
a) (i) and (ii)
b) (ii) and (iii)
c) (i) and (iii)
d) (i), (ii) and (iii)
16. Which of these statements about deep learning programming frameworks are true?
(i) A programming framework allows you to code up deep learning algorithms with typically fewer lines of code than a lower-level language such as Python.
(ii) Even if a project is currently open source, good governance of the project helps ensure that the it remains open even in the long term, rather than become closed or modified to benefit only one company.
(iii) Deep learning programming frameworks require cloud-based machines to run.
a. (i)
b. (i) & (ii)
c. (i), (ii) & (iii)
d. (i) & (iii)
17. Choose Correct Option
a) Data Requirements >> Data Collection >> Data understanding >> Data preparation
b) Data Requirements >> Data understanding >> Data Collection >> Data preparation
c) Data Requirement >> Data Deployment >> Data Collection >> Data Gathering
d) Data Collection >> Data Request >> Data Filtering >> Data Evaluation
18. In AI development which framework is used?
a) Scikit-learn
b) Tkinter
c) PyCharm
d) Matplotlib
19. If your Neural Network model seems to have high variance, what of the following would be promising things to try?
a) Make the Neural Network deeper
b) Get more training data
c) Get more test data
d) Increase the number of units in each hidden layer
20. Which stage in Design Thinking missing [Prototype, Ideate, Test, Define]
a) Evaluation
b) Empathies
c) Evolution
d) Enrichment
21. Why do we normalize the inputs x?
a) Normalization is another word for regularization–It helps to reduce variance
b) It makes it easier to visualize the data
c) It makes the cost function faster to optimize
d) It makes the parameter initialization faster
22. Which language is Most suitable for developing AI?
a) Kotlin
b) Swift
c) Python
d) HTML
23. A random sample of n=6 taken from the population has the elements 6, 10, 13, 14, 18, 20. Then, which option is False?
a) Point estimate for population mean is 13.5
b) Point estimate for population standard deviation is 4.68
c) Point estimate for population standard deviation is 3.5
d) Point estimate for standard error of mean is 1.91
24. Which of the following statement is False in the case of the KNN Algorithm?
a) For a very large value of K, points from other classes may be included in the neighborhood.
b) For the very small value of K, the algorithm is very sensitive to noise.
c) KNN is used only for classification problem statements.
d) KNN is a lazy learner.
25. The following data is used to apply a linear regression algorithm with least squares regression line Y=a1X. Then, the approximate value of a1 is given by:(XIndependent variable, Y-Dependent variable)
| X |
1 |
20 |
30 |
40 |
| Y |
1 |
400 |
800 |
1300 |
SECTION C - Competency Based Questions
Answer any 5 questions out of the given 7 questions. (1X5=5 Marks)
27. Which of the following statements is/are INCORRECT
(i) The volume of test data can be large, which presents complexities
ii) Your testing team should test the AI and ML algorithms keeping model validation, successful learnability, and algorithm effectiveness in mind
(iii) Test data should include all irrelevant subsets of training data, i.e., the data you will use for training the AI system.
a) None of the Below
b) (ii)
c) (iii)
d) (i), (ii) & (iii)
28. Which of the following is FALSE about Correlation and Covariance?
a) A zero correlation does not necessarily imply independence between variables
b) Correlation and covariance values are the same
c) The covariance and correlation are always the same sign
d) Correlation is the standardized version of Covariance
29. Which of these is NOT analytic based on type of question?
a) Descriptive
b) Statistical Analysis
c) Forecasting
d) Data evaluation
30. Which of the following statements is/are INCORRECT:
(i) Different transforms of the data used to train the same machine learning model
(ii) Different machine learning models cannot be trained on the same data
(iii) Different configurations for a machine learning model trained on the same data
a) (i)
b) (ii)
c) Both (ii) & (iii)
d) Both (i) & (ii)
31. Which of the following is FALSE about Deep Learning and Machine Learning algorithms?
a) Deep Learning algorithms work efficiently on a high amount of data
b) Feature Extraction needs to be done manually in both ML and DL algorithms
c) Deep Learning algorithms are best suited for unstructured data
d) Deep Learning algorithms require high computational power
32. If the problem is based on probabilities of an action, then which analytic approach can
be used?
a) Predictive Model
b) Prescriptive
c) Diagnostic
d) Descriptive
33. Which of the following is FALSE for neural networks?
a) Artificial neurons are similar in operation to biological neurons
b) Training time for a neural network depends on network size
c) Neural networks can be simulated on conventional computers
d) The basic unit of neural networks are neurons