Special Offer - Enroll Now and Get 2 Course at ₹25000/- Only Explore Now!

All Courses
Machine Learning Interview Questions and Answers

Machine Learning Interview Questions and Answers

March 1st, 2019

Machine Learning Interview Questions and Answers

In case you’re searching for Machine Learning Interview Questions and Answers for Experienced or Freshers, you are at the correct place. Machine Learning Question and Answers provided here will help the candidates to land in Data Science jobs in top-rated companies. You can easily get through the interviews and crack the different rounds just because the questions are gathers and published by experts. Machine learning questions over here are designed as per the candidate requirements and has the capability to improve your technical and programming skills. It is quite simple to gain knowledge on topics like Deep Learning, Kernel methods, Statistics & probability, Machine Learning Algorithms, Docker and Containers, and many more. By going through these question and answers, professionals like Data Scientist, Data Engineer, Data Analyst and NLP Engineers will be able to apply machine learning concepts efficiently on many aspects.

There is parcel of chances from many presumed organizations on the planet. The Machine Learning advertise is relied upon to develop to more than $5 billion by 2021, from just $180 million, as per Machine Learning industry gauges. In this way, despite everything you have the chance to push forward in your vocation in Machine Learning Development. Gangboard offers Advanced Machine Learning Interview Questions and answers that assist you in splitting your AWS interview and procure dream vocation as Machine Learning Developer.

Best Machine Learning Interview Questions and Answers

Do you believe that you have the right stuff to be a section in the advancement of future Machine Learning, the GangBoard is here to control you to sustain your vocation. Various fortune 1000 organizations around the world are utilizing the innovation of Machine Learning to meet the necessities of their customers. Machine Learning is being utilized as a part of numerous businesses. To have a great development in Machine Learning work, our page furnishes you with nitty-gritty data as Machine Learning prospective employee meeting questions and answers. Machine Learning Interview Questions and answers are prepared by 10+ years experienced industry experts. Machine Learning Interview Questions and answers are very useful to the Fresher or Experienced person who is looking for the new challenging job from the reputed company. Our Machine Learning Questions and answers are very simple and have more examples for your better understanding.

By this Machine Learning Interview Questions and answers, many students are got placed in many reputed companies with high package salary. So utilize our Machine Learning Interview Questions and answers to grow in your career. 

Q1) What do you understand by the Machine Learning?

Answer: It is the application of artificial intelligence that can provides systems are the ability to automatically can learn and improve from the experience without being explicitly programmed. Machine  learning focuses on the development of  computer programs that can be access to data and use it’s learn for themselves.

Q2) What are the difference between supervised and unsupervised machine learning?

Answer: Supervised learning is requires training labeled datas. For example, in order to the classification (a supervised learning task), you’ll need to the first label the data you’ll use to the train the model to classify data into your labeled groups. Unsupervised learning, in contrast, does not a require labeling data explicitly.

Q3) What is the difference between the Type I and Type II error?

Answer: Don’t think this as something high level of stuff, interviewers ask questions in such terms just to the know that you have all the bases and you are on the top.
Type I error is the false positive, while Type II is the  false negative. Type I error is claiming on something has to happened when it hasn’t. For the instance, telling an man he is pregnant. On the other hand, Type II error means you claim nothing is happened but in the  fact something . To exemplify, you tell an pregnant lady she isn’t carrying baby.

Q4) Are expected value and mean value different?

Answer: They are not different but the terms are used in the different contexts. Mean are  generally referred  to when talking about an probability distribution or sample population whereas expected value is the generally referred in the random of variable context.

Q5) What does P-value signify about the statistical data?

Answer: P-value is used to the determine the significance of the results after a hypothesis test in statistics. P-value helps to the readers to draw conclusions and is always between 0 and 1.
P- Value > 0.05 denotes weak to evidence against the null hypothesis which are means the null hypothesis cannot be rejected.
P-value <= 0.05 denotes strong to evidence against of the null hypothesis which means the null hypothesis can be rejected.
P-value=0.05 is the marginal value are indicating it is possible to go either way.

Q6) Do gradient descent methods of always converge to same point?

Answer: No, they do not because in some cases it reaches an local minima or a local optima points. You don’t reach to the global optima point. It depends on the data and starting the conditions.

Q7) What is the goal of A/B Testing?

Answer: It is a statistical hypothesis testing for the randomized experiment with two variables to A and B. The goal of A/B Testing is to the identify any changes to the web page to maximize or increase the outcome of an interest. An example for this could be identifying for the click through rate for the banner ad.

Q8) What is Machine Learning?

Answer: The simplest way to the answer this question is – we give the data and equation to the machine. Ask to the machine  look at the data and identify to the coefficient values in an equations.
For example for the linear regression y=mx+c, we give the data for  variable x, y and the machine learns about to the values of m and c from to the data.

Q9) Python or R – Which one would you prefer for the text analytics?

Answer: The best possible answer for this would be Python because it has to Pandas library that provides easy to use data of structures and high performance of data analysis tools.

Q10) What is kernel SVM?

Answer: Kernel SVM is the abbreviated version of kernel support vector of machine. Kernel methods are a class of algorithms for pattern analysis and the most common one of the kernel SVM.

Q11) What kind of error will be solved by organizing?

Answer: In mechanical learning, regulation is the process of introducing additional information as a result of an incorrect phenomenon or to avoid additional material. It is basically a reuse form, which evaluates or controls the value for zero. The regulating technique prevents the complexity or the flexible model to avoid the inappropriate risk.

Q12) What is data science?

Answer: Data Science uses automated methods to analyze and retrieve large quantities of data. By combining features of statistics, computer science, application mathematics and visualization, data science can alter the wide range of data generated by the new digital intelligence and new knowledge of digital age.

Q13) What is Logistic Recession? An example of when you recently used the logistic backlash?

Answer: The Logistic Recreation is often referred to as the Registration Model is a technique to predict binary effects predictive variables from a linear combination.
For example, if you want to predict whether a particular political leader should succeed or not. In this case, the end of the forecast is binary ie 0 or 1 (success / loss). Here the predictive variables are the amount spent for a particular candidate’s election campaign, the amount of time spent on the campaign, etc.

Q14) What are Recommended Systems?

Answer: Recommended Systems is a sub directory of information filtering systems, which predicts the preference or rankings offered by a user to a product. Recommendations are widely used in movies, news, research articles, products, social tips, music, etc.

Q15) What is the difference between the rule of governance and the character of the fruit trees?

Answer: The difference is that the research on decision making trees assesses the quality of a certain number of intermediate set standards, while evaluating only the value of the evaluators.

Q16) What is Periberan in machine stones?

Answer: In machine learning, Perception is a step in the classification of an input supervision in many potential non-binary releases.

Q17) Explain Two Parts of the Bayes on Logic Plan?

Answer: There are two elements in the Bayesian logic project. The first component is a logical one; It is a collection of the Bayesian Klaus package, which captures the domain’s characteristic structure. The second component is a criterion, which marks the amount of information about the domain.

Q18) What is Paysyni Networks (BN)?

Answer: Answer: The Poison Network is used to represent a graphical model for the probability relationship under the Variables.

Q19) Why is learning algorithm sometimes referred to as a laser learning algorithm?

Answer: The learning algorithm, based on music, is also referred to as a laser learning algorithm until they are aggravated by the stimulation or generalization process.

Q20) What are two types of methods that can handle SVM (support vector machine)?


  • Connecting binary classifier
  • Binary replacement with multiple courses

Q21) What to learn in computer?

Answer: To solve a particular computing plan, many models, such as classifiers or technicians, are strategically developed and connected. This process is known as group learning.

Q22) Why is learning in alcohol use?

Answer: Integration learning is used to improve the classification of a sample, prediction, and functional approximation.

Q23) When to use group learning?

Answer: Ensemble learning is used when you create a more accurate and independent component classifier for each other.

Q24) What are two forms of group systems?

There are two forms of group systems

  • Continuous Group Methods
  • co-operative systems

Q25) What is the general principle of a group system, what damage and inclusion?

Answer: The general principle of a group is to combine the computations of multiple models built with learning methodology to improve the weakness of a model. Group is a group to promote illegal assessment or classification. Increasing the method for reducing the essence of the integrated model is used continuously. Error and decreasing firing errors by reducing time varies.

Q26) What is the difference between taxonomy errors in regular order?

Answer: A learning algorithm can be distinguished into an expected error function and variation. Measuring a dependent period, comparing the average classroom prepared by the learning algorithm to the target dependence. The calculation of the duration of the varying time learning method provides a compatibility rate for a variety of exercises.

Q27) What is a Development Learning Algorithm in the Group?

Answer: The Advanced Learning System is an algorithmic ability to learn from the new data available since it has already created a database that has already been exported from the database.

Q28) What is PCA, KPCA and ICA?

Answer: The key feature is the extraction techniques used for the dimensional reduction of PCA (Primary Components Analysis), KPCA (Kernel-based Primary Component Analysis) and ICA (Independent Component Analysis).

Q29) What is the dimensional reduction in machine learning?

Answer: In mechanical learning and statistics, the transfer reduction is a process of reducing random variables in calculations, and the feature feature and feature extraction

Q30) What are Supplement Vector Machines?

Answer: Learning methods used for classification and recession analysis of vector machines.

Q31) What are the elements of relevant assessment strategies?

Answer: Key elements of the relevant assessment strategies

  • Data acquisition
  • Ground Trude Acquisition
  • cross-estimate technique
  • question type
  • metric marks
  • a significant test

Q32) What are the different mechanisms for series monitoring surveys?

Answer: There are different methods to solve continuous supervision learning problems

  • sliding window modes
  • Repeat sliding windows
  • hidden marco samples
  • Maximum eighty Marco models
  • Conditional random fields
  • graphic transformer networks

Q33) Robotics and Information Processing Areas Continuous Computational Problem?

Answer: Robotics and information processing areas are in places where there is a constant computation problem

  • fantasy learning
  • Computed computation
  • model-based reinforcement learning

Q34) What is statistical study?

Answer: Statistical learning techniques allow a function or predict from a set of permitted data that can make predictions about the future or future data. These techniques confirm the effectiveness of a learning perspective on future unobtrusive data based on the statistical assumption of data creation process.

Q35) What is PAC learning?

Answer: BAA (perhaps approximate) Learning Learning algorithm has been introduced to introduce learning methods and their statistical capabilities.

Q36) Are you different categories that you can classify the sequence learning process?


  • sequence
  • line generation
  • Row recognition
  • continuous conclusion

Q37) What are two techniques of machine learning?

There are two techniques for machine learning

  • Genetic programming
  • Learning stimulation

Q38) Give a popular use of machine learning that you see on a daily basis?

Answer: The machine uses machine learning that is implemented by major eCommerce websites.

Q39) Please explain trade-off of Bias & variance.

Answer: All we wanted to have is low bias & low variance. But in reality, both is actually a trade-off. We can simply term it as the error introduced between the bias and variance.

Q40) What is Gradient descent means?

Answer: GD is one of the first order optimization algorithm that works better for minimum function to take steps propositional to the negative gradient functions.

Q41) What is Overfitting? Please explain in laymen term

Answer: Overfitting is a problem occurred when we have low error in the training set. But produces high error in test or unseen data.

Q42) What is Underfitting? Please explain in laymen term

Answer: Underfitting is a problem, when we have low error in both training set and the testing set. Few algorithms works better for interpretations. But fails for better predictions.

Q43) What is Curse of dimensionality?

Answer: COD is state which is commonly referred to lack of intuitive understanding of multiple dimensions. If a user wants to produce better understanding on data COD will make limitations.

Q44) What are all the methods for standardization?


  • Range level standardization
  • Standard deviation level standardization

Q45) What is data normalization?

Answer: Data normalization is a common practice to get the data features weighted equally. It causes to lose data interpretability.

Q46) Is missing data is just blanks?

Answer: No. Not only the blanks, data points which has NA, NULL and also sometimes the corrupted data that has been recorded by mistake or given improper data by purpose.

Q47) List of few clustering algorithms you are familiar with.


  • K-means
  • K-means++
  • Hierarchical clustering

Q48) What is clustering?

Answer: Clustering technique is a segmentation process. It works whenever we don’t have the target variable and still wanted to have a groups created.

Q49) What is EDA to you?

Answer: EDA which refers to Exploratory Data Analysis is a process to understand the data prior getting it into machine learning pipeline.

Q50) Can you give some wise advice for selecting algorithms?

Answer: Selecting the algorithms required as per our problem is always tricky. But it is always good to start with linear regression for Regression and Logistic regression for Classification problems.

Q51) What is class imbalance?

Answer: Class imbalance is something which most of all the classification problem falls on. It is always good to check the number of observations for each target variable. To be precise, it is something like we get 990 cancer free patients and 10 cancer patients in the data set. While machine will learn a lot about those 990 cancer free patients. But high importance is for those 10 predictions.

Q52) When we can do predictive analysis?

Answer: Predictive and Prescriptive analytics comes into picture only when descriptive and diagnostic analytics is successful and provide some value to the business.

Q53) When we can provide insights in a project?

Answer: Once the affecting factors are found, let’s make some prediction with machine learning algorithms. Once we feel the model is making sense out of our data. We will prescribe useful insight.

Q54) Name any industry that is drastically affected by Data science?

Answer: The retail industry is one among the few which is drastically impacted by data science and business analytics.

Q55) How do DS help in retailers?

Answer: Data Science helps retailers stay ahead in competition or at least on par with their competitors on selling goods to customers and also predictive analytics help them solve problems like never before.

Q56) What is the best career advice can be given to fresher?

Answer: My advice would be to select a company where you can learn something new every single day. Like literally every single day should be a battle to learn something exciting and work on a problem that can transform the business. You should always find a trade-off in life for multiple things but don’t compromise on this.

Q57) What is SVM and how do we create a portfolio with it?

Answer: SVMs are used for classification problems, and they are quite interesting as well. Get a classification dataset from UCI ML repo and start working on your portfolio.

Q58) What is data visualization?

Answer: Data Visualization doesn’t mean you can only use bar charts and line charts to display everything. There are many unconventional charts to display data. The ultimate flexibility is the ability to change the backend data structure based on our front end requirements.

Q59) How to choose right chart type?

Answer: At times there won’t be much of freedom to create different charts because of backend data architecture or just because of business stakeholders stubborn affinity towards a chart type.

Q60) What is right to do with data visualization tools?

Answer: Recreating an excel table in tableau or any data visualization is an absolute waste of the tool’s capability. Instead, try finding a reason to highlight specific rows or for example, calculate the difference in % and color rows based on it to show a highlight table.

Q61) How to avoid Bias?

Answer: Bias can cause to feel or show inclination or prejudice for or against someone or something. Avoiding bias in machine learning is very important, and the last thing we would want is to create a model which will most of the times/always classify a non-defective product as a defective one.

Q62) What are the important outcomes of DS?


  • Applications, whereby we use the model to perform a task, ideally as accurately and effectively as possible.
  • Interpretation, whereby we use the model to gain insight into our data via the learned relationship between independent and dependent variables.

Q63) What is the trade-off between accuracy and interpretability?

Answer: There needs to be a trade-off between accuracy and interpretability. Neural networks spit out the best possible result, and we can’t ignore that just because we don’t understand the internal functioning of the model.

Q64) What is important? Accuracy or interpretability?

Answer: We can’t solve every business problem with an interpretable model and at the same time vice versa holds good as well.

Q65) How much domain knowledge is required to do DS?

Answer: Domain knowledge and model building experience comes handy in this kind of situations. I worked in a sales driver model and only when I understood the business value point, feature engineering became effective.

Q66) How much design will affect your work in DS?


  • Design thinking matters a lot in the business analytics space. There needs to be a purpose for any visualisation we as professionals create.
  • Business stakeholders will be fine with dashboards with only bar and line charts. Like seriously, you can use them to answer most of the questions, and they look familiar to users as well.

Q67) What is dumbbell chart?

Answer: Business Stakeholders won’t be even aware of dumbbell chart. To show the performance of a product between two years with contrasting colours will immediately grasp users attention than a regular bar chart.

Q68) What is the goal of a Data science dashboards?

Answer: The goal is always to provide easy and user friendly visualization to end users and for that we need to understand the end users requirements and how they are friendly with charts and graphs and overall dashboards and accordingly we have to deliver results and insights.

Q69) What is needed most? BI or DS?

Answer: Most of the companies need business intelligence, data analyst, data engineers and analysts more than data scientists at this point. Only when the infrastructure is built with known KPIs and the trends in years, someone can come in and work on the unknown variables to push the business in the right direction to make critical decisions.

Q70) What are all the important R packages?

Answer: Tidyverse, broom, and lubridate for most of my work in data wrangling phase. At times once the data wrangling is done, I have also moved the machine learning part to python for leveraging sckit-learn package.

Q71) Where do predictions depends on?

Answer: The insights/predictive results should not wholly depend on the beta coefficients of the model. It should be backed something more – Business and Statistics.

Q72) When do Simpson’s paradox occurs?

Answer: Simpson’s paradox occur while working on marketing problems with 100s of features impacting the sales unit. Just believing in the beta values might lead us to the wrong conclusion which can potentially cost the business to spend millions on different channels than the right ones.

Q73) How long will it take to build ML model?

Answer: Building a model doesn’t take much of your time but evaluating it and making it the right suitable model takes time and other elements as mentioned earlier.

Q74) Where can R complement R?

Answer: R will complement your learning from the stats book, and you can play with sample datasets like iris, mtcars to check out the importance of descriptive statistics

Q75) Is DS is actually business of science?

Answer: Data science is more “Business” than “Science”. Don’t emphasis on tools and technologies more than the problem itself. Understanding the problem requires a bit of business context. If not it will be like shooting arrows in the dark.

Q76) Where do most of DS project falls in?

Answer: Project with both technical feasibility and data availability but less business impact. – Most of the data science projects fall under this category.

Q77) How do a DS project go in poor data sourcing?

Answer: Project with high business impact but less or no availability to required data. – Poor data collection and management. With little guidance, these projects can answer essential questions.

Q78) Will DS help to make crucial business solutions?

Answer: Only a handful of data science projects have required technical feasibility, data availability and high business impact. Those are the projects that help the business make crucial decisions

Q79) What is extension in Jupyter notebook does?

Answer: You can add extensions to jupyter notebooks to prevent yourself from distractions. One of my favorite option/extension is the Zen mode. It hides the menu bar and makes us focus on the code itself. Plus knowing a few of the essential shortcuts can make us work more efficiently.

Q80) How would you prioritize your work as DS?

Answer: DS helps one to do the predictions based on existing data. So, it will help in various aspects like knowing the nature of business, helps in growing the business, can know customer needs based of past data, any kind of recommendations. Based on all this one can prioritize their work/business.

Q81) Will DS improve business all alone?

Answer: Data professionals should never work in silos. Our job might be the sexiest job of this century, but it indeed depends on a lot of business teams and technical teams in the organization. We are not master of everything to change things in a day. We need help from others, and for that, we need to ask them the right questions.

Q82) Is DS only to build and implement algorithms?

Answer: Data science is not only to build, test and implement models but most importantly, it is solving business challenges through data science. Need all the soft skills mentioned above.

Q83) How to start with Tableau in DS?

Answer: Tableau as a data visualization tool is easy to learn and takes time to master. All you need is Tableau Public version or Desktop trial version and a couple of Excel/CSV files.

Q84) Will creating visualizations using scripts in tableau supported?


  • creating visualizations using scripting languages when I used to extensively work on R. When I started moving my data visualization part of work to Tableau.
  • Tableau is not just a drag and drop play around to figure out all options tool. When it is used to its utmost potential can deliver better data visualization reports than any other tool.

Q85) What kind of understanding is important in DS?


  • Understanding the need to use mean, median and mode.
  • Understanding the need to use Inter-quartile ranges and not normal ranges.
  • Understanding the use of a line chart instead of a bar chart.

Q86) How to generate random number in scripts like R/PY?

Answer: Using any scripting language like R/Python, you can generate random values for attributes to analyze them. Again, if you’re looking to make business decisions out of a dataset, then it should be reliable and should also contain relevant values to make such decisions.

Q87) Will all DS projects result in a viable product?

Answer: Most importantly, not all data science projects will become a viable product which can support the business. So the lead should exactly know when to pull the plug on a project and when not to if project management for a data science project is not effective, high chances that the project will not yield the desired output.

Q88) What is inspection stage in a DS pipeline?

Answer: Inspection stage is where you can find the abnormalities in data, the inconsistencies, incompleteness, outliers .etc. Here it can be done using any scripting language or a tool like Tableau to quickly understand what is present and what is not in the backend data.

Q89) Why are the innocent demons ‘innocent’?

Answer: Since innocent ghosts are very ‘naïve’, all aspects of the data set are equally important and independent. As we know, this assumption is rare in the real world situation.

Q90) What is Z Score?

Answer: The z-score is the standard distortion count from a data point on average. But technically this is a source of how many constant changes are above or above the population. A z-score is known as a fixed value
and can be placed in a normal distribution ramp. It eliminates values from the database that are lower than Z times 3 times.

Q91) What is the remainder?

Answer: In the review analysis, the difference between the estimated value of the dependent variable (y) and the calculated value (ŷ) is called the remainder (d). Every data point is a remainder.
Remaining = Value Value – Estimated value e = y – ŷ
The total and the remaining remaining are equal to zero. Σ e = 0 and e = 0.

Q92) What are the major opinions of Linear regression?


  • A linear relationship is a Restricted Multi-collinearity value and then
  • Homoscedasticity dependence
  • Firstly, there must be a linear relationship between the dependent also independent To verify this relationship, a separate plot proves to be useful.
  • Secondly, there need no or very few multi-collinearity between the autonomous variables in The value must be restricted, which depends on the field requirement.
  • The third is that It is a unity of the most important suspicions
  • which asserts that the errors are uniformly

Q93) What means by heteroscedasticity?

AnswerHeteroscedasticity is specifically the contrast of homoscedasticity, which indicates that the error terms are not uniformly distributed. To change this phenomenon, normally, a log function is used.

Q94) What are the reasonable ways of increasing the accuracy of a linear regression model?

Answer: There could be many ways of developing the accuracy of linear regression, most commonly related ways are as follows:
Outlier Treatment:
Regression is on sensitive to outliers, so it becomes very essential to treat the outliers with proper values. Replacing the importance with mean, median, mode or percentile depending on the distribution can show to be useful.

Q95) What is means by odds ratio?

Answer: The odds ratio is the odds within two groups. For example, let’s pretend that we are trying to determine the effectiveness of medicine. We administered the medication to the ‘intervention’ organization and a position to the ‘control’ group.

Q96) What is the value of a baseline in a classification problem?

Answer: Most classification difficulties deal with imbalanced datasets. the number of certain types will be very low when connected to the removed species. In some cases, it is normal to have positive classes that are less than 1% of the entire sample. In such cases, an efficiency of 99% may appear very good but, in reality, it may not be.

Q97) Different methods of MLE also when are any method preferred?

Answer: The unconditional method is preferred to a number of parameters dataset value is below related to the number of instances. If the number of parameters is extremely correlated to the number of cases when reduced MLE is to be preferred. Statisticians are supported that restricted MLE is to be performed when in doubt. the Conditional MLE
is always providing that the results.

Q98) Why is accuracy not a good model for classification problems?

Answer: Accuracy is not a good basis for distribution problems because it provides equal significant value to both false positives and false negatives dataset value. Accuracy provides equal quality to both cases and
cannot distinguish between them.

Q99) What does mean by p-value?

Answer: When you make a hypothesis analysis in statistics, a p-value can help you discover this strength of your results. In a p-value is a number between 0 and 1. Based on that value it will register the intensity of the results. The part which is before trial is called the Null Hypothesis.

Q100) Explain about from the regularization is and why it is useful?

Answer: Regularization is the method of calculating a tuning parameter upon a method to produce a system in order to prevent overfitting. This appears multiple often made by combining a fixed multiple on an actual weight vector. The model predictions should later overcome the loss function determined on the regularized training set.

Q101) What is the forward selection of data pre-processing?

Answer: The perspective option is a rectification method that begins without any aspect of the model. In each iteration, we will add a better way to improve our model until we add a new variable to improve the performance of the model.

Q102) What is the removal of the recursive feature in data pre-processing?

Answer: This is a greedy optimization algorithm that finds a good style feature subset. It creates repetitive models and each reboot keeps aside the best or worse performance feature. This creates the next model with the left features until all features are exhausted. Then there will be elements based on the order to remove them.

Q103) What are the three stages for creating a model in machine learning?


  • Model building
  • Model test
  • Applying the model

Q104) Keep in mind that you are working in a data system, and explain whether you choose key variables.

Answer: Some methods are used to select the following critical variables:

  • Using the loso regression system.
  • Using the Random Forest, the plot variable imprtance chart.
  • Using linear lag.

Q105) How is KI different?

Answer: K-Recent neighboring countries have a classification algorithm, while k-object is an uncontrolled clustering algorithm. Although the mechanisms seem to look the same, you need data that you need to classify an unnamed point (neighboring area) to work with neighboring countries. K-material clustering requires only a single point of reference and a starting point: Algorithm can learn how to group the group into groups by taking unstoppable points and calculating the gap between different points.
The significant difference here is that the KNN has to be named for points, which require supervised learning, while the k-object does not – there is no supervision.

Q106) Is It the Most Important For You Model Model Accuracy or Model Performance?

Answer: This question tests your grip on the machine learning model performance nuances! Machine Learning Interview Questions are often headed towards the details. There are models with greater accuracy, which advance the power of the advance – how is it realized?
Well, model accuracy model performance is only a subset of how to do it, sometimes it’s a misguided guide. For example, if you find millions of models in a large database, if only a very small number of fraud cases, the most accurate model does not contradict any fraud. However, it will be ineffective in advance – insisting that there is no fraud on a model designed to detect fraud! Questions like these help you to demonstrate that you need to understand the model’s accuracy.

Q107) When Should You Use Taxonomy on Retreat?

Answer: Sorting creates a database for distinct values and strict categories, while you record the conclusions that allow you to distinguish the difference between individual points. You can categorize the consequences if you want to reflect the combination of data points in your database for certain specific sections. (For example, female names, when compared to male, female, male and female).

Q108) What are the main guidelines to avoid excesses?


  • Simplify the sample: You can reduce the transition by lower variables and parameters, thus eliminating some of the noise in training data.
  • Use k-folds cross-validation for cross-checking techniques.
  • Use 3- regulatory techniques such as LASOO, which are some sample parameters to be punished if they make the tablet.

Q109) How to handle unbalanced databases?

Answer: When you have an unbalanced database, for example, a classification test and 90% of data is in a class. This leads to problems: if there is no computing power in the other section of data data, 90%

Q110) What is the central trend?

Answer: The central trend is a value that attempts to describe the data set by identifying the position of the central within a set of measurement data. Therefore, the activities of the central tendencies are sometimes called central location operations. They are categorized as abstract statistics.
Example: average, average, pattern

Q111) When we use Pearson’s relationship co-efficient method?

Answer: Pearson communicates the linear relationship between two consecutive variables involved. Relationship linear is when the change in a variable is related to a proportional change in the other variable.
For example, a Pearson contact can be used to assess whether the increase in the temperature of your production facilities is associated with lower thickness of your chocolate coatings.

Q112) What is the standard deviation, how is it calculated?

Answer: Standard Disadvantage (SD) is a statistical measure, which captures the meanings of the meanings and rankings.
Step 1: Find the average.
Step 2: Find the average square of its distance for each data point.
Step 3: A total of values from step 2.
Step 4: Separate the number of data points.
Step 5: Take a square hunt.

Q113) Defined reinforcement learning?

Answer: Reinforcement Learning is ended effect is to maximize the differential reward signal. Reinforcement learning is stimulated by the experience of personal beings, it is based on any reward/penalty mechanism.

Q114) Explain Supervised Machinelearning?


  • Supervised learning it’s requirs training labeled data.
  • Supervised learning its handled regression and classification problems.
  • Regression problem to Predect the result with in continous output.
  • Classification Problem to predict results in a discrete output.
  • Suprvised learning Algorithm : SVM, Navie bayes, Decision tree, KNN Algorithm and Neural Network.

Q115) Explain Unsupervised Machine learning?

Answer: Unsupervised learning is consisting of input data without labeled responses.
Algorithm: Clustering, Aprior.

 Q116) What is a confusion matrix?

Answer: The confusion matrix contains 4 output providers by the binary classifier. Various measure, such as error rate, accuracy, precision and recall are derived from it confusion matrix.

Q117) What is linear regression?


  • Linear regression is modeled using a straight line.
  • Its used with continous variable and the output prediction value of the variable.
  • Accuracy: It Measured by loss, R squared, Adjusted R squared.

Q118) How can I retrieve an important part of data collection?

Answer: The distance from the remaining studies is limited to the limited violations. As a result, they can be flexible or disagreeable for any analysis in any analysis in the database. It is therefore important to detect and be harmful enough.When a 100% reassurance is due to a test/transcription/ etc error, they should only be rejected if they are exited. Otherwise, the removal of the outlines would have been underestimated.

Q119) What is underfitting?

Answer: Underfitting occurs when a statistical model or machine learning algorithm does not catch the basic trend of data. Instinctively, if the sample or algorithm does not match the data correctly, it shows the high independence, especially if it has shown a sample or algorithmic variance. The foundation is often a very simple model result.

Q120) When we use Pearson’s relationship co-efficient method?

Answer: Pearson communicates the linear relationship between two consecutive variables involved. Relationship linear is when the change in a variable is related to a proportional change in the other variable.
For example, a Pearson contact can be used to assess whether the increase in the temperature of your production facilities is associated with lower thickness of your chocolate coatings.

Q121) What is Z Score?

Answer: The z-score is the standard distortion count from a data point on average. But technically this is a source of how many constant changes are above or above the population. A z-score is known as a fixed value and can be placed in a normal distribution ramp. It eliminates values from the database that are lower than Z times 3 times.

Q122) What is the remainder?

Answer: In the review analysis, the difference between the estimated value of the dependent variable (y) and the calculated value (ŷ) is called the remainder (d). Every data point is a remainder.
Remaining = Value Value – Estimated value e = y – ŷ
The total and the remaining are equal to zero. Σ e = 0 and e = 0.

Q123) What is a Sample Model Test?

Answer: A sample T-test is used to check whether the population mean is significantly different from the value of some hypotheses.

Q124) What is F Statistics?

Answer: If you have a significant difference in the way between the two people you will find an FO point of value when you are running an ANOVA test or a regression analysis. This is just like a T-test a D statistic; If the A-T test is a variable statistically significant and will tell you if a F test variable is of significant significance.

Q125) What is Anuava?

Answer: ANOVA is used for comparison with three or more models.

  • One way is ANOVA (which is an independent variable).
  • Two way ANOVA (there are two distinct variables)

 Q126) Some important Measures of Skewness


  • Karl-Pearson coefficient of skewness
  • Bowley’s coefficient of skewness
  • Coefficient of skewness based on moments

Q127) Scatter is a good action trait.

Answer: An excellent measure of decay is satisfied with the following characteristics.

  • It should be well defined without the ambiguity.
  • It should be based on all observations of the data set.
  • Easy to understand and compute.
  • To pursue math treatment.
  • It should not be affected by fluctuations in the model.
  • It should not be affected by serious surveillance.

Q128) Difference between supervised and unsupervised machine learning?

Answer: Supervised learning is required to be labeled data. For example, in the system to do classification, you’ll require to first design the data content and you’ll relate to range train the model to create the data value process. Unsupervised learning, if it’s in contrast, does not need labeling data explicitly.

Q129) Difference between L1 and L2 regularization.

Answer: L2 regularization serves to increase error with all the terms, while L1 is also binary/sparse, including several variables specific being selected a 1 or 0 in weighting. L1 compares before installing some Laplacian earlier at the terms, while L2 agrees to each Gaussian prior.

Q130) Describe a hash table

Answer: A hash table is a data structure input value that returns an associative array. Key movements mapped to specific conditions through this application of a hash function. They continue normally done during tasks such as database indexing.

Q131) How would you evaluate a logistic regression model?

Answer: you have to demonstrate an understanding of something the typical purposes of logistic regression and bring up a few examples and use cases.

Q132) How would you handle an imbalanced dataset?


  • To Collect more data to even that imbalances in a specific dataset.
  • To Resample the dataset utility to adjust for imbalances dataset value.
  • Try a modified algorithm collectively on your dataset.

Q133) Why does overfitting happen?

Answer: The possibility of overfitting lives as specific criteria used over the model remains no choice the same as the criteria applied to decide the efficacy of a model.

Q134) What is inductive machine learning?

Answer: The inductive machine learning means the process of knowledge by examples, where a system, from a data set of identified situations, tries to convince a general rule.

Q135) Popular algorithms of Machine Learning?


  • Decision Trees
  • Neural Networks (back propagation)
  • Probabilistic networks
  • Nearest Neighbor
  • Support vector machines

Q136) Which algorithm uses sigmoid function?

Answer: Logisitc Regression

Q137) What is precision?

Answer: True positive/true positive +

Q138) How to check the model accuracy?

Ans: Using the evaluation metrics like accuracy, precision, recall, f1-score etc.,

Q139) Is linear regression and logistic regression belongs to same category?

Ans: No

Q140) What is the objective function for Knn?

Answer: Knn abbreviation: K-nearest neighbour.

Q141) Is clustering a supervised algorithm?

Answer: No

Q142) Is logistic regression a regression or classification

Answer: Classification

Q143) Does bias and variance trade off is important factor to check in the data?

Answer: Yes

Q144) Is L1 and L2 regularization are same?

Answer: No they are different because of their objective function.

Q145) Why do we use import statement?

Answer: It is used to import the in built functions.

Q146) Is Random Forest a tree based algorithm?

Answer: Yes

Q147) Do we call Knn a lazy algorithm?

Answer: Yes

Q148) What are the three main divisions in datascience?


  1. Machine Learning
  2. Deep Learning
  3. Artificial Intelligence

Q149) Does ID3 uses Entropy?

Answer: Yes

Q150) Does Radial basis kernel function is there in SVM?

Answer: yes

Q151) Does both bagging and boosting belongs to Ensemble Learning?

Answer: Yes

Q152) Can we randomly pick the number of clusters in clustering?

Answer: No, we have to chose the optimum number of clusters by ploting the “Elbow Curve”

Q153) Does Correlation and Causation is Same?

Answer: No its not same

Q154) What do we call for manipulating the features of the data ?

Answer: Feature Engineering

Q155) Do we always need more data for better result?

Answer: No

Q156) How we deal with missing value?

Answer: By doing missing value imputation

Q157) How can we deal with outliers?

Answer: By plotting IQR and then deleting the values which are away from the range.

Q158) Can we determine the most important feature in our data?

Answer: Yes

Q159) Can we find the variable importance using Random Forest algorithm?

Answer: Yes

Q160) Does PCA works on the logic of variance ?

Answer: Yes

Q161) Best pictorial graphs to show Correlation plot?

Answer: Heat Map

Q162) Which algorithm uses margin to classify the classes?

Answer: SVM

Q163) Which algorithm takes the data to the next dimension and then classify?

Answer: SVM

Q164) Which algorithm is used for reducing the feature ?

Answer: PCA

Q165) Does PCA a Supervised algorithm?

Answer: No

Q166) What is the major preprocessing step in distance based algorithm?

Answer: Normalization

Q167) Name any one distance based algorithm?

Answer: Clustering

Q168) Name any one Penalty based algorithm?

Answer: SVM

Q169) Random forest is bagging or boosting algorithm?

Answer: Bagging Algorithm

Q170) Xgboost is bagging or boosting algorithm?

Answer: Boosting Algorithm

Q171) For high variance present in data  which algorithm to use?

Answer: Bagging Algorithm

Q172) For High Bias present in data algorithm to use?

Answer: Boosting Algortihm

Q173) What leads to Underfitting in linear regression?

Answer: Poor Line of fit.

Q174) What is the Main objective of any data science problems?

Answer: To minimize the error.

Q175) What is next level of machine learning?

Answer: Deep learning and artificial intelligence.

Q176) Does very less data lead to best model?

Answer: No, it leads to underfitting.

Q177) Is doing mean imputation for missing value is always a best method?

Answer: No, its not a best method because mean can mislead if outliers are present.

Q178) Is it mandatory for the data to  always follow normal distribution?

Answer: Not exactly, but if it is in normal distribution the results will be better.

Q179) CLT full form:


Q180) RBF full form:

Answer: Radial Basis Kernel Function.

Q181) Does Kmeans and Kmeans++ is same?

Answer: No, Kmeanas++ uses different initation to calculate the centroid.

Q182) Name any one hyper parameter of Decision tree?

Answer: Number of tree, no.of.nodes etc.,

Q183) Is pruning always a good method to construct a tree?

Answer: No it depends on the problem and data.

Q184) Does Tree Based algorithm Overfits?

Answer: Yes

Q185) What do you understand by machine learning?

Answer: A branch of computer science that involves system programming to enhance and increase user experience is known as Machine Learning.

Q186) Give an example of Machine Learning?

Answer: A good example of Machine Learning would be in the case of Robots. Robots are able to perform and complete their tasks based on the information they accumulate from their sensors. Thus they automatically learn from the data provided.

Q187) What is the difference between data mining and machine learning?

Answer: Data mining is the basic process of getting information from unstructured data without any patterns assigned to them. Machine learning is the process which assigns algorithms and specifications in terms of programming to develop and design systems. These systems are meant to enhance learning and utilitarian purposes.

Q188) In the case of machine learning what is the meaning of “Overfitting”?

Answer: When there is a random error or noise produced due to excessive information overload, it is known as “Overfitting”.

Q189) When does overfitting usually occur?

Answer: Overfitting in machine learning usually occurs when the model is too complex or there are too many parameters included to keep track of.

Q190) Why does one see the occurrence of overfitting?

Answer: The occurrence of Overfitting is seen when there are different parameters used for training the model and different parameters used for gauging the efficiency of the same.

Q191) In order to avoid Overfitting what needs to be done?

Answer: As Overfitting usually occurs due to large & complex data models, the main idea is to use a smaller dataset.

Q192) What do you understand by inductive machine learning?

Answer: This kind of learning is learning by examples. A general instruction or rule is introduced by virtue of observation of situations.

Q193) State the five algorithms of machine learning?

Answer: The five algorithms of machine learning are as follows Decision Trees, Neutral Networks, Probabilistic Networks, Nearest Neighbor, Support Vector Machines.

Q194) State some of the algorithm techniques for Machine learning?

Answer: Some of the algorithm techniques for machine learning are as follows supervised learning, unsupervised learning, semi-supervised learning, reinforcement learning, transduction, learning to learn.

Q195) State the three stages which are necessary to make the model for machine learning?

Answer: The three stages which are required to build the model for machine learning are as follows…model building, model testing, applying the model.

Q196) How is supervised learning generally conducted?

Answer: The best way to acclimatise one to supervised learning is to divide the information into the training piece and the assessment piece.

Q197) What do you understand by the Training set and the Test Set?

Answer: A training set or an information set refers to examples given to the learner. The test set or assessment set is the method used to decipher how correctly the user has comprehended the information provided.

Q198) What are the various approaches for machine learning?

Answer: Some of the approaches of machine learning are as follows concept learning and classification learning, symbolic learning and statistical learning, inductive learning and analytical learning.

Q199) Which are the two branches of computer technology which are not classified as machine learning?

Answer: The two branches of computer technology which are not classified as machine learning are Artificial Intelligence and Rule-Based Inference.

Q200) What function does unsupervised learning have?

Answer: Unsupervised learning conducts the following function finding clusters of data, finding interesting directional patterns in data, cleaning up the existing database, finding new observations and finding new and different coordinates and correlated concepts.

Q201) What function does supervised learning have?

Answer: Supervised learning has the following functions… it has classifications, it has speech recognition, it involves regression and it shows time prediction series.

Q202) What do you understand by machine learning which is independent of algorithms?

Answer: This is a type of machine learning which is independent of any classification series, markers or categories.

Q203) What is the main difference between artificial intelligence and machine learning?

Answer: Machine learning is primarily based in algorithms which are designed strictly on information given by empirical data. Artificial learning encompasses machine learning, however in addition to that it also includes non empirical data like natural language processors, robotics, etc.

Q204) What do you understand by the classifier in machine learning?

Answer: A system that helps to input information, values, features and assimilates all to give one single value known as the class.

Q205) State some of the areas in which Pattern Recognition is used?

Answer: Some of the areas which uses Pattern Recognition are … computer vision, speech recognition, data mining, statistics, informal retrieval, bio-informatics.

Q206) What do you understand by genetic programming?

Answer: Genetic programming is the name given to the technique which is dependent on assessing and choosing the most prime choice amongst all the results provided.

Q207) State the meaning of Inductive Logic Programming in machine learning?

Answer: A subfield of machine learning which uses logic to represent background knowledge and its examples, is known as Inductive Logic Programming.

Q208) In supervised learning, state the methods used for calibration?

Answer: In supervised learning the two methods which are used for calibration are known as Platt Calibration and Isotonic Regression.

Q209) To prevent Overfitting, which of these two methods is usually chosen?

Answer: In order to prevent Overfitting the method that is usually preferred is Isotonic Regression.

Q210) What do you understand by the term Perceptron in Machine Learning?

Answer: When an algorithm needs to be placed into a nonbinary output, Perceptron is that algorithm which is used in supervised classification.

Q211) A support vector machine (SVM) handles which two classification methods?

Answer: The two classification methods are as follows… combining binary classifies and modification of binary for the inclusion of multi class learning.

Q212) What do you understand by ensemble learning?

Answer: When multiple models, classifiers, experts are combined or specifically generated to solve complex programs, it is known as ensemble learning.

Q213) When is ensemble learning generally utilized?

Answer: When each component classifier is more precise and completely independent from each other, that is when ensemble learning is used.

Q214) State the two types of ensemble learning methodologies?

Answer: The two types of ensemble learning methodologies are sequential ensemble method and parallel ensemble method.

Q215) State some of the key components of relational evaluation technique?

Answer: Some of the key components of relational evaluation techniques are as follows data acquisition, ground truth acquisition, cross validation technique, query type, scoring metrics and significance test.

Q216) State some of the methods of sequential supervised learning?

Answer: Some of the methods of sequential supervised learning are as follows…sliding-window methods, recurrent sliding windows, hidden marrow models, conditional random fields and graph transformer methods.

Q217) In robotics where does the problem of sequential prediction arise?

Answer: The areas in robotics where the problem of sequential prediction arises are as follows structured prediction, imitation learning and model based reinforcement learning.