answersLogoWhite

0


Best Answer

An itemset is closed if none of its immediate supersets has the same support as the itemset.

So for example, if {Bread, Milk} is an itemset that has support=4, and all of its supersets has support<4, then {Bread, Milk} is a closed itemset.

Counter e.g.: If, let's say, {Bread, Milk, Sugar} has support=4, then {Bread, Milk} is not a closed itemset anymore.

Note: The definition states "the same" and doesn't say "the same or more" because it's impossible for a superset to have a support greater than one of its subsets.

User Avatar

Wiki User

13y ago
This answer is:
User Avatar

Add your answer:

Earn +20 pts
Q: What is closed frequent itemset in datamining?
Write your answer...
Submit
Still have questions?
magnify glass
imp
Related questions

What is downward closure property?

Every subset of a frequent itemset is also frequent. Also known as Apriori Property or Downward Closure Property, this rule essentially says that we don't need to find the count of an itemset, if all its subsets are not frequent. This is made possible because of the anti-monotone property of support measure - the support for an itemset never exceeds the support for its subsets. Stay tuned for this.


What is maximal frequent itemset in datamining?

MAFIA: MAximal Frequent Itemset AlgorithmMAFIA is a new algorithm for mining maximal frequent itemsets from a transactional database. Our algorithm is especially efficient when the itemsets in the database are very long. The search strategy of our algorithm integrates a depth-first traversal of the itemset lattice with effective pruning mechanisms.Our implementation of the search strategy combines a vertical bitmap representation of the database with an efficient relative bitmap compression schema. In a thorough experimental analysis of our algorithm on real data, we isolate the effect of the individual components of the algorithm. Our performance numbers show that our algorithm outperforms previous work by up to an order of magnitude.An animated gif demonstrates the MAFIA algorithm here.Candidate Itemset TreeThe process of generating candidate itemsets is done using a depth-first search, and the process can be represented as a candidate itemset tree. With each step down the tree, a single item is extended onto an itemset. As the itemsets grow larger and larger, the percentage of customers who have the itemset, or the support %, will grow smaller and smaller. Eventually, this support value will go below the minimum support required for an itemset to be deemed frequent. When looking at the lexicographic tree, it is possible to draw a line that crosses all points at which an occurrence of an itemset being extended goes from frequent to infrequent. All itemsets directly above this line are termed the maximal frequent itemsets. By the Apriori principle, no itemset extensions below this line can be frequent since they all contain other itemsets within them that were found to be infrequent.Search Space PruningWe have found that in certain cases, branches of the candidate itemset tree can be "pruned" away, leading to fewer itemsets that need to be checked, and therefore a faster running time. This section explains what each of these pruning steps do. Parent Equivalence Pruning - If an itemset in the tree has the same support as one of its candidate extensions, then it can be pruned from the tree because it must only occur in the database as part of that candidate extension.HUTMFI Superset Pruning - If the union of an itemset and its leftmost tail on the ordered subtree is frequent then the entire subtree can be pruned away. This process checks the current list of maximal frequent itemsets to see if this head-union-tail is already on this list.FHUT - Frequent Head-Union-Tail - This pruning method is identical to HUTMFI except it actually checks the support of the HUT rather than searching to see if it is already in the MFI list. FHUT has been found to yield fewer performance increases than HUTMFI.Vertical Bitmap RepresentationMAFIA efficiently stores the transactional database as a series of vertical bitmaps, where each bitmap represents an itemset in the database and a bit in each bitmap represents whether or not a given customer has the corresponding itemset. Initially, each bitmap corresponds to a 1-itemset, or a single item. The itemsets that are checked for frequency in the database become recursively longer and longer, and the vertical bitmap representation works perfectly in conjunction with this itemset extension. For example, the bitmap for the itemset (a,b) can be constructed simply by performing an AND operation on all of the bits in the bitmaps for (a) and (b). Then, to count the number of customers that have (a,b), all that needs to be done is count the number of one bits in the (a,b) bitmap equals the number of customers who have (a,b). Clearly, the bitmap structure is ideal for both candidate itemset generation and support counting.Source Code DownloadThe SourceForge download page has instructions on downloading the last stable version of the code. You can also download the datasets used for testing. CVS access is also available:Browse the source tree hosted at SourceForge: CVS TreeType 'cvs -d:pserver:anonymous@cvs.sourceforge.net:/cvsroot/himalaya-tools login'Press Enter when prompted for a password.Type 'cvs -z3 -d:pserver:anonymous@cvs.sourceforge.net:/cvsroot/himalaya-tools co Mafia'A source tree rooted in a directory called Mafia will be created.ContactPlease send email to or contact the authors directly: Manuel CalimlimJohannes GehrkePublicationsDoug Burdick, Manuel Calimlim and Johannes Gehrke. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases.In Proceedings of the 17th International Conference on Data Engineering.Heidelberg, Germany, April 2001.


ER diagram of telephone billing system?

how to draw a telephone billing system in huge and sparse datacubes in datamining


Was the Suez canal closed to civilian traffic in WW2?

The Suez Canal was closed to civilian traffic for 76 days during the Second World War because of frequent yet unpredictable air raids.


What happens to dividend miles after cancelling US Airways mastercard?

The miles should still be in the account as it is the credit card account and not the Frequent Flyer account that has been closed.


Difference between discrimination and classification in datamining?

There is only a slight difference between discrimination and classification in data mining. Discrimination can be negative and classification is generally just factual.


Is frequent a noun?

No, the word 'frequent' is a verb (frequent, frequents, frequenting, frequented) and an adjective (frequent, more frequent, most frequent).Examples:Sometimes we go to the big supermarket but we frequentthe store on our corner. (verb)In the summer we make frequent trips to the beach. (adjective)The noun forms of the verb to 'frequent' are frequency and the gerund, frequenting.The noun form of the adjective 'frequent' is frequentness.


What is frequent the same as?

Frequent is the same as often.


How many syllables does frequent have?

their are two syllables in frequent


What is the best definition of frequent?

what is the defenition of frequent


How many syllables in frequent?

There are 2 syllables in frequent


Is 'frequent this establishment' correct?

If used thusly, yes: "Do you frequent this establishment often?" "Yes, I often frequent this establishment!"