answersLogoWhite

0


Best Answer

Dirty data is incorrect or incomplete input and can originate in many ways. However, we should also consider the method through which data is determined to be, in fact, dirty. For example, a record may have the incorrect address for a customer but that isn't to say that when the record was created the address was incorrect, it may have changed. Timing and maintenance also play a role.

Generally speaking, data is moved through a three step process known as E.T.L. (Extract, Transform & Load). Extraction is the process of sourcing the data; data entry or from a device or location. Transformation is the process used to validate and coerce the data into predetermined formats. Loading is the process by which the formated data is moved to it's destination.

Most dirty data originates from processes involving data entry and is a result of human err, poorly defined or understood process requirements and/or inadequate validation/error handling methods.

The first step is to completely and accurately define requirements. What data is being collected, where does it come from and how often (Extraction)? How will the data be validated and errors reported (Transformation)? How will it be stored (Loading)?

Only after these questions have been answered can the E.T.L. process can be designed. Once designed, the process requirements are disseminated to the extraction source.

Within user based extraction processes there are many common methods to reduce dirty data. Some of these are; spell checks, data type validations, required fields and value limited input controls such as check, combo and list boxes. When designing the extraction method (or front end for user based systems) the general rule of thumb is the less data entry the better. Other common methods include tool tips (floating help boxes), top to bottom, left to right, tab ordered fields (for heads down data entry) and visual/audible cues for data validation exceptions.

User Avatar

Wiki User

13y ago
This answer is:
User Avatar

Add your answer:

Earn +20 pts
Q: What are the ways to avoid dirty data?
Write your answer...
Submit
Still have questions?
magnify glass
imp
Related questions

When does dirty data occur?

Data are considered dirty when they are modified, and they have not yet been written to backing storage.


When configured properly what can avoid data leakage?

A firewall can avoid data leakage.


What is dirty page in operating system?

A data can be retrieved from the secondary storage for use in main memory, but if we edit the data and not save the data in to the secondary storage, it is termed as a dirty page.


How can you avoid corruption of data?

How can you avoid corruption?


how avoid making assumptions while conducting a research help you?

There are ways, however, to try to maintain objectivity and avoid bias with qualitative data analysis: Use multiple people to code the data. ... Have participants review your results. ... Verify with more data sources. ... Check for alternative explanations. ... Review findings with peers.


What do you call incomplete outdated or otherwise inaccurate data is a type of?

dirty data


When was Ways to Avoid the Sun created?

Ways to Avoid the Sun was created in 2003.


What is the dirty data in dbms?

Dirty data in a database management system (DBMS) refers to data that is inaccurate, incomplete, or inconsistent. This can include missing values, duplicate records, formatting errors, or outdated information. Dirty data can lead to mistakes in decision-making and analysis, so it's important to regularly clean and maintain the data in a database.


How can you protect your sense of touch?

avoid touch a dirty thing


3 ways of presenting data in statistics?

ways of presenting data in statistics


How do you avoid thrush?

To avoid thrush, keep the horse in a clean environment, and clean his feet thoroughly every day. Avoid wet and dirty bedding.


What are ways of presenting experimental data?

wat are the two ways of presenting experimental data