The following topics are from: WeChat public account (artificial intelligence headlines)
- You mentioned in your resume that you have built a document mining system. What work have you done? Is it possible to implement document clustering using LDA technique in topic modeling?
- Suppose you have hundreds of megabytes of data files, including PDF files, text files, images, scanned PDF files, etc., please give a classification scheme.
- How do you read the content of scanned pdf files or written documents in image format?
- Why is Naive Bayes called "naive"?
- Please describe the Naive Bayes classifier in detail.
- What is deep learning? What is the difference between deep learning and machine learning? In unsupervised learning, how to do document clustering?
- How to find files related to certain query statements/searches?
- Explain TF-IDF technology.
- From my experience, the TF-IDF technique doesn't work well for document classification or clustering, how would you improve it?
- What is a Long Short Term Memory (LSTM) neural network? Explain how it works.
- What is a word2vec model?
- Explain mutable and immutable objects in python.
- What data structures have you used in python?
- How to handle multi-class classification problems with unbalanced datasets?
- How do you perform language recognition from a text sentence?
- How to represent hieroglyphs in Chinese or Japanese?
- How to design a chatbot? (I'm out of ideas, but I try to answer this with intent and feedback based on TF-IDF similarity.)
- Can a chatbot be designed using recurrent neural networks to respond to incoming questions with intent and answers .
- Suppose you design a chatbot using recurrent neural network or long short-term memory neural network on the Reddit dataset, it can provide 10 possible replies, how to choose the best reply, or how to delete other replies?
- Explain how support vector machines (SVMs) learn nonlinear boundaries.
- What is precision and recall? In medical diagnosis, which do you think is more important?
- Explain precision and recall.
- How to draw receiver operating characteristic curve (ROC curve)? What does the area under the ROC curve mean?
- How to plot ROC curve for multi-class classification task?
- List other metrics for multiclass classification tasks.
- What is sensitivity and specificity?
- What does "random" in random forest mean?
- How to do text classification?
- How to be sure that a text has been learned? Is it impossible to achieve without TF-IDF technology? (I replied to use an n-gram model (n=1, 2, 3, 4), and use the TF-IDF technique to create a long vector of counts)
- What else can you do with machine learning? (I suggest a combination of long short-term memory neural network and word2vec, or a 1D recurrent neural network combined with word2vec, for classification. But the interviewer wants to improve the machine learning based algorithm.)
- How does a neural network learn nonlinear shapes when it consists of linear nodes? What is the reason it learns nonlinear bounds?
- When training a decision tree, what are its parameters?
- To split at a certain node of the decision tree, what is the split standard?
- What is the formula for calculating the Gini coefficient?
- What is the formula for calculating entropy?
- How does a decision tree decide at which feature a split must be made?
- How to use the information gathered by mathematical calculations?
- Briefly describe the advantages of random forests.
- Briefly describe the boosting algorithm.
- How does gradient boosting work?
- Briefly describe the working principle of AdaBoost algorithm.
- Which kernels are used in SVM? What are the optimization techniques of SVM?
- How does SVM learn hyperplanes? Discuss the details of its mathematical operations.
- Talk about unsupervised learning? What algorithms are there?
- How to define the value of K in K-Means clustering algorithm?
- List at least 3 ways to define K in the K-Means clustering algorithm.
- Other than that, what clustering algorithms do you know?
- Introduce the DB-SCAM algorithm.
- Briefly describe the working principle of Hierarchical Agglomerative clustering.
- Explain the principal component analysis algorithm (PCA), and briefly describe the mathematical steps of using the PCA algorithm.
- 20. What are the disadvantages of using the PCA algorithm?
- Talk about how convolutional neural networks work? Details of its implementation are specified.
- Explain backpropagation in Convolutional Neural Networks.
- How do you deploy machine learning models?
- Most of the time we have to use C++ to build a machine learning model from scratch, can you do this?
- What is the scope of the sigmoid function?
- Name the package in scikit-learn that implements logistic regression.
- What is the mean and variance of the standard normal distribution?
- What data structures do you use in Python?
- What are the methods of text classification? How would you do the classification?
- Explain TF-IDF technology and its shortcomings, how to overcome the shortcomings of TF-IDF?
- What are Bigrams and Trigrams? Explain the TF-IDF technique of two-word collocation and three-word collocation with a text sentence.
- Give an example to illustrate the applications of word2vec.
- How to design a neural network? How to achieve "depth"? This is a basic neural network problem.
- Briefly describe how LSTM works. How does it remember text?
- What is Naive Bayes Classifier?
- What is the probability of tossing a coin 10 times and getting heads 4 times?
- How to get the index of an element in a Python list?
- How to merge two pandas datasets?
- From user behavior, you need to simulate a fraudulent activity, how would you solve this problem? This could be an anomaly detection problem or a classification problem!
- Decision tree or random forest, which one do you prefer?
- What is the difference between logistic regression and random forest?
- Would you use decision trees or random forests to solve classification problems? What are the advantages of random forests?
- In an imbalanced dataset, what model would you choose: Random Forest or Boosting? Why?
- What Boosting technologies do you know?
- Using supervised learning to solve a classification problem, which model would you choose? Let's say there are 40-50 categories!
- How do you use the Ensemble technique?
- Briefly describe how support vector machines (SVMs) work.
- What is Kernel? Briefly.
- How to implement nonlinear regression?
- What is Lasso Regression and Ridge Regression?
- You mentioned on your resume that you have done speech recognition in speeches. Specifically, what is your implementation method?
- What are Mel Frequency Cepstrums (MFCCs)?
- What is a Gaussian mixture model and how does it accomplish clustering?
- How to maximize expectations? Talk about its implementation steps.
- How are the probabilities in the GMM model calculated?
- How did you perform MAP adjustment for the GMM-UBM technique when doing pronunciation recognition?
- Talk about the I-vector technique you use.
- When analyzing context, what are the main factors?
- What is the difference between JFA and I-vector? Why choose I-vector over JFA?
- Have you ever used PLDA I-vector technology?
- Have you read Baidu's Deep Speaker paper?
- If you have two models to choose from, what is your basis for choosing? (Exploring techniques for model selection)
- Briefly describe the mathematical working principle of Bayesian Information Metric (BIC) and Akaike Information Quantity (AIC).
- How do Bayesian Information Metrics and Akaike Information Metrics work?
- What should I do if the data in the MFCC eigenvector matrix is missing?
- How to do speech recognition? What are the characteristics?
- Is your classifier a classifier for speech and music, or a classifier for speech and non-speech?
- How are deep neural networks used in speech analysis?