RSS
热门关键字:  数据挖掘  数据仓库  商业智能  人工智能  搜索引擎

Drilling Down With A Data Mining Pioneer

来源: 作者:Nathan Segal 时间:2007-02-07 点击:
Dr. Usama Fayyad is a data mining pioneer who began working in the field in 1989. He got his start at NASA's Jet Propulsion Laboratory, compiling data on astronomical phenomena such as volcanoes, star systems, etc. From there, he went on to work for Microsoft research and then, frustrated by problems he was seeing in the data mining industry, he left Microsoft and started digiMine to deal with the issues of data mining and data warehousing. In this article, he shares his thoughts about the industry and how to get the most out of your data.

"There are two sides to data mining, descriptive and predictive," says Dr. Fayyad. "Descriptive data mining reorganizes the data, digging deeper into it and pulling out patterns, such as customer similarity, which allows you to create a short description about that group of customers.

数据挖掘研究院

"Predictive data mining looks for the best prediction, such as the best product to pitch to a customer. You won't get much insight, but it increases the performance, the ROI. Using both techniques will give you the best results.

数据挖掘研究院

"An important issue today is SQL, the standard interface for databases, which has proven to be the wrong interface," Fayyad says. "As an example, let's say you worked for a telecommunications company, and you want to find records about cell phone fraud. Well, guess what? These naturally asked questions cannot be answered by today's databases, because the interface was designed to address problems where you know the target and you want the database to quickly retrieve the result. If you don't have an exact description of the target, you're lost with a database today. This is why data mining is seeing a lot of demand. 数据挖掘实验室

"When I started in this field back in 1989, there were many people in large corporations struggling with large data sets. And even though there's a lot of data out there, it's not necessarily the right kind. Also, there's big difference in the ability to store data and the ability to access it in a useful way.

最新评论共有 0 位网友发表了评论
发表评论
评论内容:不能超过250字,需审核,请自觉遵守互联网相关政策法规。
匿名?