Summary of Open Source Libraries for Machine Learning

Summary of Open Source Libraries for Machine Learning

Machine Learning  2013-03-28 17:22:50 Released
Your review :
         
4.1  
     17 Favorites

  I have been studying data mining and machine learning for a while. For data mining, commercial software includes SAS, Clementine, Oracle data mining components, etc.; due to personal learning and copyright, algorithm customization and other issues, open source data mining and machine learning software (Library) is also very necessary at present. Now I will introduce the more popular and commonly used machine learning open source libraries.

  I have used matlab in school before. To be honest, it is really convenient. Usually, a model can be done with dozens or even a dozen lines of code. However, the genuine matlab is more expensive and not suitable for commercial development. Therefore, it is not suitable for the industry to use it. Many (communication industry and research institutes are more common); correspondingly, the experimental languages ​​of machine learning and data mining in the industry are mostly java and python, because python and C\C++ have innate blood relationship, so python is easier to expand; java Machine learning libraries are also very popular, mainly reflected in the nature of jvm under big data and the advantages of algorithm parallelization; the following learning libraries are recommended for everyone:

 

1. Machine Learning Open Source Software Network (a collection of academic and commercial open source software in various programming languages ​​for machine learning)

http://mloss.org

2 Occasionally found machine learning resource network: (It is also very complete, 1 and 2 basically include all the classic open source software of ML)

http://www.dmoz.org/Computers/Artificial_Intelligence/Machine_Learning/Software/

3 libsvm (the best in the support vector machine world, needless to say, the masterpiece of Professor Lin from Taiwan University)

http://www.csie.ntu.edu.tw/~cjlin/libsvm/

4 WEKA (the most comprehensive and easy-to-use open source software for machine learning algorithms based on java)

http://www.cs.waikato.ac.nz/ml/weka/

5 scikit (my favorite python-based machine learning software, the code is very well written, and the official documents are very complete, all have examples, the algorithms are complete, the development is also active
, and it is strongly recommended for everyone to use)

https://pypi.python.org/pypi/scikit-learn/

6 OpenCv (the most powerful open source computer vision library, the future is boundless, it must be used for image processing and pattern recognition, you can't do experiments with matlab all day long and be out of touch with the industry, but it is difficult)

http://opencv.willowgarage.com/wiki/

7 Orange (machine learning software based on c++ and python interface, beautiful interface, easy to call, can learn C++ and python at the same time, as well as visual function,)

http://orange.biolab.si/

8 Mallet (a machine learning library based on JAVA, mainly used for natural language processing, featuring Markov models and random fields that are well done, which can complement WEKA)

http://mallet.cs.umass.edu/

9 NLTK (PYTHON's natural processing open source library, very easy to use, but also powerful, there are several classic tutorials of orelly)

http://nltk.org/

10 lucene (based on java, including a full set of nutch, solr, hadoop, mahout, etc., it is an open source software that must be learned by comrades who do information retrieval and search engines, and must learn JAVA)

http://lucene.apache.org/

Additional:

1.pyml(a python module for machine learning,支持svm/knn/k-means==)

http://mlpy.sourceforge.net/

2. mahout (a project under the Apache Foundation, which can be naturally combined with hadoop to run in parallel, and is very robust in terms of robustness)

http://mahout.apache.org/

3.milk (python's machine learning toolkit, mainly for supervised learning, including svm/knn/decision tree)

http://pypi.python.org/pypi/milk/

4. Octave (recommended in Andrew NG's class, similar to matlab)

http://www.gnu.org/software/octave/

From: http://www.cnblogs.com/wuren/archive/2013/03/27/2985352.html

Extended reading

Machine Learning and Deep Learning Learning Materials
Nearly 200 Machine Learning & Deep Learning Materials Sharing (including various documents, videos, source code, etc.)
Machine Learning and Deep Learning
Materials Machine Learning & Deep Learning Materials
.NET Summary of platform machine learning resources

Recommended for you

A python library similar to jquery: pyquery
pyquery: A Python library similar to jQuery
Python Socket network programming
Use Python to grab European football league data for big data analysis
Python toolkit to insert HTML or MarkDown into Word documents

More

machine learning
Related Documentation  —  more
Relevant experience  -  more
Related Discussions  —  more

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=325182508&siteId=291194637