30 articles in this selection
|
|
| 2009/12/22 KNIME - Konstanz Information Miner
KNIME, pronounced [naim], is a modular data exploration platform that enables the user to visually create data flows (often referred to as pipelines), selectively execute some or all analysis steps, and later investigate the results through interactive views on data and models....
| |
|
|
| 2009/12/09 How to understand risk in 13 clicks
What are we to make of all those stories that warn of lifestyle dangers and slap a giant percentage sign in the headline? Michael Blastland introduces the Risk-o-meter to his regular column.
| |
|
|
| 2009/12/08 Leverage Statistical Control Limits
One factor that is not appreciated enough is that every metric that you report tends to have a natural "biorhythm", i.e. those metrics will fluctuate up or down and change due to "natural occurrences" that just happen (I can see some of you cringe! :)). These biorhythms are hard to understand, harder still to predict and since many of us live in the Puzzle world rather than the Mystery world we spin our cycle like crazy to understand the numbers to "explain" them to the management so that they can take some action. Imagine getting a daily / weekly trend and it goes up and down and you have no idea what the heck is causing it, even after you have done your damdest to isolate all the variables. The result of these natural biorhythms is that it causes Analysts and Marketers to do analysis and deep dive where none is necessary, it causes some of us to look "bad" because we can't explain the data, and it causes a lack of faith the the ability of data to provides insights....
| |
|
|
| 2009/08/18 Google Insights for Search
With Google Insights for Search, you can compare search volume patterns across specific regions, categories, time frames and properties.
| |
|
|
| 2009/08/18 Orange - Data Mining Fruitful & Fun
Orange is a component-based framework for machine learning and data mining, intended for both experienced users and researchers in machine learning who want to develop and test their own algorithms while reusing as much of the code as possible, and for those just entering who can enjoy in powerful while easy-to-use visual programming environment....
| |
|
|
| 2009/08/17 The Future Of Work - It's Data, Baby
Consider this: IBM is preparing to expand its data analysis employee base from 200 to 4,000 - a staggering twenty-fold increase. You can be certain that a significant portion of this new work force will be untethered, distributed widely across the globe, implying that one of the core skills for a new generation of web workers will be analysis. So, if you're looking to sharpen up your data analysis skills, where do you start?...
| |
|
|
| 2009/08/07 For Today's Graduate, Just One Word - Statistics
The rising stature of statisticians, who can earn $125,000 at top companies in their first year after getting a doctorate, is a byproduct of the recent explosion of digital data. In field after field, computing and the Web are creating new realms of data to explore - sensor signals, surveillance tapes, social network chatter, public records and more. And the digital data surge only promises to accelerate, rising fivefold by 2012, according to a projection by IDC, a research firm....
| |
|
|
| 2009/07/31 Microsoft Excel Tutorials
Microsoft Excel is a very powerful software. The tutorials below introduce you with some MS Excel power to solve problem numerically with or without programming. Application of statistical methods in Excel.
| |
|
|
| 2009/07/28 IBM to Acquire SPSS Inc. to Provide Clients Predictive Analytics Capabilities
ARMONK, N.Y. and CHICAGO, July 28 /PRNewswire-FirstCall/ -- IBM (NYSE: IBM) and SPSS Inc. (Nasdaq: SPSS) today announced that the two companies have entered into a definitive merger agreement for IBM to acquire SPSS, a publicly-held company headquartered in Chicago, in an all cash transaction at a price of $50/share, resulting in a total cash consideration in the merger of approximately $1.2 billion. The acquisition is subject to SPSS shareholder approval, applicable regulatory clearances and other customary closing conditions. It is expected to close later in the second half of 2009....
| |
|
|
| 2009/07/28 GGobi data visualization system.
GGobi is an open source visualization program for exploring high-dimensional data. It provides highly dynamic and interactive graphics such as tours, as well as familiar graphics such as the scatterplot, barchart and parallel coordinates plots. Plots are interactive and linked with brushing and identification....
| |
|
|
| 2009/07/23 Rise of the Data Scientist
As we've all read by now, Google's chief economist Hal Varian commented in January that the next sexy job in the next 10 years would be statisticians. Obviously, I whole-heartedly agree. Heck, I'd go a step further and say they're sexy now - mentally and physically. However, if you went on to read the rest of Varian's interview, you'd know that by statisticians, he actually meant it as a general title for someone who is able to extract information from large datasets and then present something of use to non-data experts....
| |
|
|
| 2009/07/17 And the Winner of the $1 Million Netflix Prize (Probably) Is ...
After nearly three years and entries from more than 50,000 contestants, a multinational team says that it has met the requirements to win the million-dollar Netflix Prize: It developed powerful algorithms that improve the movie recommendations made by Netflix's existing software by more than 10 percent....
| |
|
|
| 2009/07/14 Can Data Revitalize Journalism?
The ability to tap into big databases is an essential journalistic tool. It undoubtedly helped Bloomberg to reach its status in the financial information sector. Access to a world of cross-referenced historical data dramatically improves the journalist's ability to put events in perspective, quickly and accurately....
| |
|
|
| 2009/07/08 Federal IT Dashboard
The IT Dashboard provides the public with an online window into the details of Federal information technology investments and provides users with the ability to track the progress of investments over time. The IT Dashboard displays data received from agency reports to the Office of Management and Budget (OMB), including general information on over 7,000 Federal IT investments and detailed data for nearly 800 of those investments that agencies classify as 'major.' The performance data used to track the 800 major IT investments is based on milestone information displayed in agency reports to OMB called 'Exhibit 300s.' Agency CIOs are responsible for evaluating and updating select data on a monthly basis, which is accomplished through interfaces provided on the website....
| |
|
|
| 2009/07/06 The Nike Experiment: How the Shoe Giant Unleashed the Power of Personal Metrics
Few things illustrate the power and promise of Living by Numbers quite as clearly as the Nike+ system. By combining a dead-simple way to amass data with tools to use and share it, Nike has attracted the largest community of runners ever assembled - more than 1.2 million runners who have collectively tracked more than 130 million miles and burned more than 13 billion calories....
| |
|
|
|
|
| 2009/06/17 Jeff Veen Talk: Designing for "Big Data"
A 20-minute talk by Jeff Veen from Small Batch, Inc., also known from WikiRank, which was originally given at the Web2.0 Expo in San Francisco a couple of weeks ago. During the talk, he focuses on some of the classic examples of information visualization (John Snow pump, Minard's map, the tube map, and so on), the issue of "decorating" data versus making it accessible, and the emerging challenge to empower lay people to participate in visualizing and analyzing their own data....
| |
|
|
| 2009/06/16 Google Searches for Staffing Answers
Concerned a brain drain could hurt its long-term ability to compete, Google Inc. is tackling the problem with its typical tool: an algorithm. The Internet search giant recently began crunching data from employee reviews and promotion and pay histories in a mathematical formula Google says can identify which of its 20,000 employees are most likely to quit....
| |
|
|
| 2009/06/16 Monte Carlo Simulation: Random Number Generation
Our example of Monte Carlo simulation in Excel will be a simplified sales forecast model. Each step of the analysis will be described in detail. The Scenario: Company XYZ wants to know how profitable it will be to market their new gadget, realizing there are many uncertainties associated with market size, expenses, and revenue. The Method: Use a Monte Carlo Simulation to estimate profit and evaluate risk....
| |
|
|
|
|
| 2009/04/30 BeyeNETWORK: Bayes and Business Intelligence, Part 1
From a business intelligence perspective, a major advantage to the Bayesian approach is the learning that accumulates over time. Indeed from a BI perspective, the Bayesian model is best viewed as a systematic learning approach for intelligence and decision-making for business. The posterior probabilities for analysis one become the prior probabilities for analysis two; the posteriors for analysis two become the priors for analysis three, etc. Thus the Bayesian orientation promotes learning and better decision-making, all the while refining priors to be more reliable and less “subjective” over time....
| |
|
|
| 2008/03/05 UNdata
The innovative design allows a user to access a large number of UN databases either by browsing the data series or through a keyword search.
| |
|
|
|
|
|
|
|
|
|
|
| 2008/01/07 Google Analytics
Google Analytics helps you find out what keywords attract your most desirable prospects, what advertising copy pulled the most responses, and what landing pages and content make the most money for you.
| |
|
|
| 2008/01/05 Many Eyes Data Visualization
Many Eyes is a bet on the power of human visual intelligence to find patterns. Our goal is to "democratize" visualization and to enable a new social kind of data analysis. Jump right to our visualizations now, take a tour, or read on for a leisurely expla...
| |
|
|
|
|