Data warehouse is the database on which we apply data mining.
Data Mining
There is only a slight difference between discrimination and classification in data mining. Discrimination can be negative and classification is generally just factual.
The data model identifies the objects/entities involved in the application and the relationships between them. From that, the ER diagram is solidified and this forms the basis of the database design.
ETL or Dataware Housing Testing:- ETL stands for Extract-Transform-Load and it is a process of how data is loaded from the source system to the data warehouse. Data is extracted from an OLTP database, transformed to match the data warehouse schema and loaded into the data warehouse database. Many data warehouses also incorporate data from non-OLTP systems such as text files, legacy systems and spreadsheets. Traditional (old) way of software testing: -Requirement -Design -Code & Build -Testing -Maintenance Unfortunately that is an erroneous methodology because the earlier you find an error - the more funds you can save. For example, fixing an error in maintenance is ten times more expensive than fixing it during execution. But many organizations have improved this way of thinking and choose modern way of software testing. In this philosophy testing should take place in every stage.
You can get the Java source code for the BIDE data mining algorithm here : (link moved to link section) It is an open-source data mining framework that includes the BIDE algorithm
it's data warehouse....data warehouse: it is a collection of multiple databases or it it is repository of data.data mining it is the process of extracting data from data warehouse.
Data warehouse is a technology that aggregates structured data from one or more sources so that it can be compared and analyzed rather than transaction processing, whereas Data mining is the process of analyzing unknown patterns of data.
catch important data from data warehouse.
mining the data is called data mining. Mining the text is called text mining
pages 471-473 C. data mining.
difference between Data Mining and OLAP
A data warehouse functions as a repository for all the data held by an organisation. The main functions are to reduce cost of data storage, facilitate data mining, and facilitate ability to back up data at an organisational level.
One of the biggest benefits is that you can archive your data to a data warehouse. This can keep your main "production" database smaller which can provide some performance benefits. Also you can use the data warehouse to run complex queries and data-mining without adverse effects on the performance of your "production" application.
data warehouse A data warehouse is the main repository of an organization's historical data, its corporate memory. It contains the raw material for management's decision support system. The critical factor leading to the use of a data warehouse is that a data analyst can perform complex queries and analysis, such as data mining, on the information without slowing down the operational systems. data mining The development of computational algorithms for the identification or extraction of structure from data. This is done in order to help reduce, model, understand, or analyze the data. Tasks supported by data mining include prediction, segmentation, dependency modeling, summarization, and change and deviation detection. Database systems have brought digital data capture and storage to the mainstream of data processing, leading to the creation of large data warehouses.
Data about other data is metadata.
Directed data mining involves using predefined goals or objectives to guide the analysis and modeling of data. In contrast, undirected data mining aims to discover patterns or relationships in data without specifying a particular outcome in advance. Directed data mining is typically used for tasks such as classification and regression, while undirected data mining techniques include clustering and anomaly detection.
The process of extracting high-quality data from unstructured text is known as text mining. Text mining, in its most basic form, seeks out facts, relationships, and affirmation from large amounts of unstructured textual data.