Saturday, July 14, 2007

dataminingblog.com: List of data mining blogs

Data Mining Research - www.dataminingblog.com: List of data mining blogs

When Data and Decisions Don't Match--Little League Baseball

"Maybe it's because I used to pitch in Little League when I was a kid,
but this article in the July 1 Union Tribune really struck me. It
describes how injuries to Little League pitchers has increased
significantly over the past 10 years from one a week to 3-4 a day with
elbow and/or shoulder injuries from baseball. What's the cause?
Apparently, as the article indicates, it is from "overuse" (i.e.,
pitchers pitching too much). And here is the key statistic:...."

http://abbottanalytics.blogspot.com/

Data Mining as a Service: The Prediction is Not in the Box - DMReview

"Why were there so many failed enterprise customer relationship
management (CRM) implementations? Everyone from executive management
teams to database administrators have their own point of view on where
the failure occurred: upper management didn't buy in; the software
promised to do more than it could; the implementation took too long;
the hidden costs were too high; and countless other reasons...."

http://www.dmreview.com/article_sub.cfm?articleId=1087703

www.dataminingblog.com: Why is Matlab the best language for data mining?

"While starting a new project a few days ago, I had to answer the
recurrent question: What language do I choose? In research, we have
the opportunity of choosing any language, free or not. This is usually
not the case in industry where the language can be fixed for many
reasons (price, customer choice, boss choice, same as existing system,
etc.)...."

http://dataminingresearch.blogspot.com/2007/07/why-is-matlab-best-language-for-data.html

Thursday, July 12, 2007

WSDM 2008

WSDM (pronounced "wisdom") is a brand new ACM conference intended to
be complementary to the World Wide Web Conference tracks in search and
data mining. The pace of innovation in these areas has reached a
level that requires more than one premier annual venue. WSDM invites
original, high quality submissions related to search and data mining
on the Web, with an emphasis on practical but principled novel models,
algorithm design and analysis, economics implications, and in-depth
experimental analysis of accuracy and performance. The goal is to make
WSDM a focused meeting with a single research paper session through
2-3 days. WSDM will also invite keynote talks from some of the best
minds from industrial and academic research.

http://wsdm2008.org/

What Data Mining Can and Can't Do

Peter Fader, Wharton's quantitative marketing wizard, has a message
for CIOs: Stop collecting so much customer data, and stop misusing
data mining.

http://www.cioinsight.com/article2/0,1540,2146294,00.asp

Statistical Aspects of Data Mining (Stats 202) Day 1

This is the Google campus version of Stats 202 which is being taught
at Stanford this summer. I will follow the material from the Stanford
class very closely. That material can be found at www.stats202.com.
The main topics are exploring and visualizing data, association
analysis, classification, and clustering. The textbook is Introduction
to Data Mining by Tan, Steinbach and Kumar. Googlers are welcome to attend any classes which they think might be of interest to them.

http://www.bestechvideos.com/2007/07/06/statistical-aspects-of-data-mining-stats-202-day-1