Orange, RapidMiner, Weka, JHepWork, KNIM, five free and open source data mining software

Orange

Orange is a component-based data mining and machine learning software suite, which is friendly and powerful, fast and versatile visual programming front-end for browsing data analysis and visualization, base binding Python for script development . It contains a complete set of components for data preprocessing and provides functions for data accounting, transition, modeling, schema evaluation and exploration. It is developed by C++ and Python, and its graphics library is developed by the cross-platform Qt framework.

RapidMiner

RapidMiner , formerly known as YALE (Yet Another Learning Environment), which is an experimental environment for machine learning and data mining and analysis, is also used to study real-world data mining. The experiments it provides are composed of a large number of operators, and these operators are recorded by detailed XML files and displayed by RapidMiner's graphical user interface. RapidMiner provides more than 500 operators for the main machine learning process, and it combines the learning scheme and the property evaluator of the Weka learning environment. It is an independent tool that can be used for data analysis, and it is also a data mining engine that can be integrated into your product.

Weka

Weka (Waikato Environment for Knowledge Analysis) developed by Java is a well-known machine learning machine software that supports several classic data mining tasks, notably data preprocessing, clustering, classification, regression, virtualization, and feature selection. Its technology is based on the assumption that data is in a single file or association, where each data point is labeled with a number of attributes. Weka uses Java's database connection capability to access SQL databases and process the query results of a database. Its main user interface is Explorer, which also supports the command line with the same function, or a component-based knowledge flow interface.

JHepWork

jHepWork is a free open source data analysis framework designed for scientists, engineers and students. It mainly uses open source libraries to create a data analysis environment and provides a rich user interface to compete with those paid software . It is primarily intended for 2D and 3D graphics for scientific computing, and includes Java implementations of mathematical science libraries, random numbers, and other data mining algorithms. jHepWork is based on a high-level programming language Jython, of course, Java code can also be used to call jHepWork's mathematics and graphics library.

KNIME

KNIME (Konstanz Information Miner) is a user-friendly, intelligent, and rich open source data integration, data processing, data analysis and data mining platform. It gives users the ability to visually create data flows or data pipelines, optionally run some or all of the analysis steps, and later explore the results, models, and interactive views. KNIME is written in Java, which is based on Eclipse and provides more functions through plug-ins. Through the plugin file, users can add processing modules for files, images, and time series, and can be integrated into other various open source projects, such as: R language, Weka, Chemistry Development Kit, and LibSVM.





RapidMiner is the world's leading data mining solution that is technologically advanced to a very large extent. It covers a wide range of data mining tasks, including various data arts, and can simplify the design and evaluation of data mining processes.

Functions and Features
Data mining technology and libraries are provided free of charge
100% in Java code (running on the operating system)
The data mining process is simple, powerful and intuitive
Internal XML ensures a standardized format for expressing and exchanging the data mining process
Can automated with a simple scripting language Large-scale process
Multi-level data view to ensure valid and transparent data
Interactive prototyping of GUI Command line
(batch mode) Automatic large-scale application
Java API (Application Programming Interface)
Simple plug-in and promotion mechanism
Powerful visualization engine, The visual modeling of many cutting-edge high-dimensional data
supported by more than 400 data mining operators
has been successfully applied in many different application areas, including text mining, multimedia mining, functional design, data flow mining, integrated development methods and distributed data mining

http://www.rapidminerchina.com/products/

Guess you like

Origin blog.csdn.net/bruce__ray/article/details/49699461