35 articles in this selection
|
|
| 2010/11/13 Data Discovery: Tell Me Something I Don’t Know by Neil Mason – Part I
“Data: Discovery: Tell me something that I don’t know” is the definition of data mining – discovering unexpected patterns and relationships in data. In this session Neil explores the approach to insight generation through data mining and predictive analytical technologies. Using real world case studies he covers the ins and outs of data mining analytics on digital data, which types of techniques can be used to solve which kinds of problems and some of the challenges that you will inevitable face along the way. Discover what your data can tell you if you ask it the right questions....
| |
|
|
| 2009/12/22 KNIME - Konstanz Information Miner
KNIME, pronounced [naim], is a modular data exploration platform that enables the user to visually create data flows (often referred to as pipelines), selectively execute some or all analysis steps, and later investigate the results through interactive views on data and models....
| |
|
|
| 2009/10/22 5 Ways to Cut Costs with Predictive Analytics
From a Predictive Analytics World keynote by Eric Siegel. Siegel's presentation offered a primer on five popular forms of predictive analytics: response modeling, response uplift modeling, churn modeling, churn uplift modeling and risk modeling. In the process of describing each approach for segmenting customers and improving marketing performance, Siegel offered the following tips....
| |
|
|
|
|
| 2009/08/18 Sentiment Analysis - Text Technologies
Discussion of sentiment analysis, which is the extraction of indicators of a writer's (or speaker's) opinions and emotional reactions (e.g., about a product feature or brand).
| |
|
|
|
|
| 2009/08/18 Orange - Data Mining Fruitful & Fun
Orange is a component-based framework for machine learning and data mining, intended for both experienced users and researchers in machine learning who want to develop and test their own algorithms while reusing as much of the code as possible, and for those just entering who can enjoy in powerful while easy-to-use visual programming environment....
| |
|
|
| 2009/08/17 The Future Of Work - It's Data, Baby
Consider this: IBM is preparing to expand its data analysis employee base from 200 to 4,000 - a staggering twenty-fold increase. You can be certain that a significant portion of this new work force will be untethered, distributed widely across the globe, implying that one of the core skills for a new generation of web workers will be analysis. So, if you're looking to sharpen up your data analysis skills, where do you start?...
| |
|
|
| 2009/08/10 The top 10 data mining mistakes
This list of John Elder, Ph.D is not so recent any more, but is worth looking at every once in a while in order not to forget them.
| |
|
|
| 2009/08/07 Weka 3
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes....
| |
|
|
| 2009/08/07 Teradata 13 focuses on advanced analytic performance
Last October I wrote about the Teradata 13 release of Teradata’s database management software. Teradata 13, which will be used across the various Teradata product lines, has now been announced for GCA (General Customer Availability). So far as I can tell, there were two main points of emphasis for Teradata 13: Performance and User Defined Functions...
| |
|
|
| 2009/07/28 IBM to Acquire SPSS Inc. to Provide Clients Predictive Analytics Capabilities
ARMONK, N.Y. and CHICAGO, July 28 /PRNewswire-FirstCall/ -- IBM (NYSE: IBM) and SPSS Inc. (Nasdaq: SPSS) today announced that the two companies have entered into a definitive merger agreement for IBM to acquire SPSS, a publicly-held company headquartered in Chicago, in an all cash transaction at a price of $50/share, resulting in a total cash consideration in the merger of approximately $1.2 billion. The acquisition is subject to SPSS shareholder approval, applicable regulatory clearances and other customary closing conditions. It is expected to close later in the second half of 2009....
| |
|
|
| 2009/07/28 GGobi data visualization system.
GGobi is an open source visualization program for exploring high-dimensional data. It provides highly dynamic and interactive graphics such as tours, as well as familiar graphics such as the scatterplot, barchart and parallel coordinates plots. Plots are interactive and linked with brushing and identification....
| |
|
|
| 2009/07/27 Why MapReduce matters to SQL data warehousing
Greenplum and Aster Data have both just announced the integration of MapReduce into their SQL MPP data warehouse products. So why do I think this could be a big deal? The short answer is "Because MapReduce offers dramatic performance gains in analytic application areas that still need great performance speed-up." Read on for the long answer....
| |
|
|
| 2009/07/23 Rise of the Data Scientist
As we've all read by now, Google's chief economist Hal Varian commented in January that the next sexy job in the next 10 years would be statisticians. Obviously, I whole-heartedly agree. Heck, I'd go a step further and say they're sexy now - mentally and physically. However, if you went on to read the rest of Varian's interview, you'd know that by statisticians, he actually meant it as a general title for someone who is able to extract information from large datasets and then present something of use to non-data experts....
| |
|
|
| 2009/07/17 And the Winner of the $1 Million Netflix Prize (Probably) Is ...
After nearly three years and entries from more than 50,000 contestants, a multinational team says that it has met the requirements to win the million-dollar Netflix Prize: It developed powerful algorithms that improve the movie recommendations made by Netflix's existing software by more than 10 percent....
| |
|
|
| 2009/07/16 SQL Server Data Mining Add-ins for Office 2007
Microsoft SQL Server 2005 Data Mining Add-ins for Microsoft Office 2007 (Data Mining Add-ins) allow you take advantage of SQL Server 2005 predictive analytics in Office Excel 2007 and Office Visio 2007.
| |
|
|
|
|
|
|
| 2009/06/19 Open Source Text Analytics
Open source is a great choice for many text analytics users, especially folks who have programming skills, who need custom capabilities or who are trying to get a feel for possibilities before committing themselves. Excellent options are available for all these users. Tools such as Gate, NLTK, R and RapidMiner share the low cost, power, flexibility and community that have driven adoption of open-source software by individual users and enterprises alike. RapidMiner even combines text processing with business intelligence (BI) and visualization functions....
| |
|
|
| 2009/06/16 Google Searches for Staffing Answers
Concerned a brain drain could hurt its long-term ability to compete, Google Inc. is tackling the problem with its typical tool: an algorithm. The Internet search giant recently began crunching data from employee reviews and promotion and pay histories in a mathematical formula Google says can identify which of its 20,000 employees are most likely to quit....
| |
|
|
| 2009/06/16 Weka 3 - Data Mining with Open Source Machine Learning Software in Java
Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes. Weka is open source software issued under the GNU General Public License....
| |
|
|
|
|
| 2008/05/02 RapidMiner
RapidMiner (formerly YALE) is the world-wide leading open-source data mining solution due to the combination of its leading-edge technologies and its functional range. Applications of RapidMiner cover a wide range of real-world data mining tasks.
| |
|
|
|
|
| 2008/04/28 Complex Event Processing
A source of industry neutral information on applications, research, usecases, reference architectures, and developments in event processing, run by Prof David Luckham.
| |
|
|
| 2008/04/28 Coral8, Inc.
Coral8 provides an industry–leading Complex Event Processing platform that is: Easy to Develop: The robust SQL–based language enables rapid development of CEP applications. Easy to Deploy: The Coral8 server allows fast, low-cost deployment and rapid c...
| |
|
|
| 2008/03/20 First-Person Intelligence
Few people call themselves “consumers.” Consumers buy or use a product, service or solution. Period. The word connotes a one-way relationship between seller and buyer that fits poorly in today’s connected marketplace. “Customers,” however, do fa...
| |
|
|
| 2008/02/04 Ahead-of-the-Curve Careers
Cutting-edge careers are often exciting, and they offer a strong job market. Alas, the cutting edge too often turns out to be the bleeding edge, so here are some careers that, while relatively new, are already viable and promise further growth. They emerg...
| |
|
|
| 2008/01/28 Data Protection Day
The aim of the Data Protection Day is to give European citizens the chance to understand what personal data is collected and processed about them and why, and what their rights are with respect to this processing.
| |
|
|
| 2008/01/08 Business Intelligence That Works!
Dit is een themasite over het onderwerp business intelligence. Op deze site vindt u onder meer verwijzingen naar interessante nieuwsfeiten, artikelen en sites. De vraag die op deze site centraal staat is: Hoe behaal je zoveel mogelijk waarde uit business...
| |
|
|
|
|
|
|
| 2008/01/07 Echelon » Visualization of Social Network
What I did is write a program that is able to log in to a very popular German Social Networking website and grab some data from it. I grabbed the friends of my profile (only 2) their friends (about 100) and the friends of their friends (about 5000). I use...
| |
|
|