What Is Data Warehouse: 1) Predictive Tasks
What Is Data Warehouse: 1) Predictive Tasks
2) Data mart
A data mart contains a subset of corporate-wide data that is of value to a specific
group of users.
The scope is confined to specific selected subjects.
The data contained in data marts tend to be summarized. Data marts are usually
implemented on servers that are Unix/Linux or Windows based.
Depending on the source of data, data marts can be categorized as independent or
dependent.
3) Virtual warehouse
A virtual warehouse is a set of views over operational databases.
For efficient query processing, only some of the possible summary views may be
materialized. A virtual warehouse is easy to build but requires excess capacity on
operational database servers.
Predictive Modeling: It refers to the task of building a model for the target variable as a
function of the explanatory variable. There is two types of predictive modeling tasks:
Classification: It is used for discrete target variables.
Regression: It is used for continuous target variables.
Association Analysis: it is used to find group of data that have related functionality. And its
Goal is to extract the most of interesting patterns in an efficient manner.
Cluster Analysis: Clustering has been used to group sets of related customers.
Anomaly Detection: It is the task of identifying observations whose characteristics are
significantly different from the rest of the data. Such observations are known as anomalies or
outliers.
3) Telecommunication Industry
Today the telecommunication industry is one of the most emerging industries providing
various services such as fax, pager, cellular phone, internet messenger, images, e- mail, web
data transmission, etc. Due to the development of new computer and communication
technologies, the telecommunication industry is rapidly expanding in business field.
For example:
Data mining in telecommunication industry helps in identifying the telecommunication
patterns, catch fraudulent activities and improve quality of service.
4) Biological Data Analysis
In recent times, we have seen a tremendous growth in the field of biology such as genomics,
proteomics, functional Genomics and biomedical research. Biological data mining is a very
important part of Bioinformatics.
For example: