16 articles in this selection
| 2009/08/07 Weka 3
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes....
| |
|
|
| 2009/07/27 Wikipedia on Hadoop
Apache Hadoop is a free Java software framework that supports data intensive distributed applications. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google's MapReduce and Google File System (GFS) papers....
| |
|
|
|
|
| 2009/07/14 Regain
Regain is a search engine similar to web search engines like Google, with the difference that you don't search the web, but your own files and documents. Using regain you can search through large portions of data (several gigabytes!) in split seconds!
| |
|
|
| 2009/07/05 Quint-2 Browser
The ISO 9126 (1991) standard defines six quality characteristics for software products, being Functionality, Usability, Efficiency, Reliability, Maintainability en Portability. The appendix of the standard refines each of these characteristics into a number of sub characteristics. Extended ISO 9126 or Quint is an extension of the ISO 9126 standard for product quality. It adds 11 sub characteristics to the 21 of ISO 9126 that are often used in daily practice. Quint - and its origin and intended use - are described in the book 'Kwaliteit van Softwareprodukten - Ervaringen met een kwaliteitsmodel' from 1996. The model from the book elaborates work from the Quint project and is therefore also known as Quint-2....
| |
|
|
|
|
| 2009/06/30 Manning: Collective Intelligence in Action
Collective Intelligence in Action is a hands-on guidebook for implementing collective-intelligence concepts using Java. It is the first Java-based book to emphasize the underlying algorithms and technical implementation of vital data gathering and mining techniques like analyzing trends, discovering relationships, and making predictions. It provides a pragmatic approach to personalization by combining content-based analysis with collaborative approaches....
| |
|
|
| 2009/06/30 Apache Lucene
Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
| |
|
|
| 2009/06/16 Weka 3 - Data Mining with Open Source Machine Learning Software in Java
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. Weka is open source software issued under the GNU General Public License....
| |
|
|
| 2008/05/02 RapidMiner
RapidMiner (formerly YALE) is the world-wide leading open-source data mining solution due to the combination of its leading-edge technologies and its functional range. Applications of RapidMiner cover a wide range of real-world data mining tasks.
| |
|
|
|
|
|
|
| 2008/01/10 Eclipse BIRT Home
BIRT is an open source Eclipse-based reporting system that integrates with your Java/J2EE application to produce compelling reports.
| |
|
|
| 2008/01/10 JasperSoft - Open Source Business Intelligence
Standalone and Operational BI for reporting, insight and analysis of daily business operations. Share, schedule and distribute critical data - Empowering users without relying on IT.
| |
|
|
| 2008/01/05 Clover.ETL open source data integration tool
clover.ETL and clover.GUI are ETL tools meant for data transformation and data integration. They are based on Java technology and therefore platform independent and resource-efficient. clover.ETL is an open source project, released under LGPL License. clo...
| |
|
|
| 2008/01/05 Talend open source data integration software
Talend, the first provider of open source data integration software, leverages the open source model to make data integration available to all types of organizations, regardless of their size, level of expertise or budgetary constraints. Talend’s soluti...
| |
|
|