Synthesizing high-frequency rules from different data sources

UTSePress Research/Manakin Repository

Search UTSePress Research


Advanced Search

Browse

My Account

Show simple item record

dc.contributor.author Wu Xindong en_US
dc.contributor.author Zhang Shichao en_US
dc.date.accessioned 2009-12-21T03:52:29Z
dc.date.available 2009-12-21T03:52:29Z
dc.date.issued 2003 en_US
dc.identifier 2003000487 en_US
dc.identifier.citation Wu Xindong and Zhang Shichao 2003, 'Synthesizing high-frequency rules from different data sources', IEEE Computer Soc, vol. 15, no. 2, pp. 353-367. en_US
dc.identifier.issn 1041-4347 en_US
dc.identifier.other C1 en_US
dc.identifier.uri http://hdl.handle.net/10453/5742
dc.description.abstract Many large organizations have multiple data sources, such as different branches of an interstate company. While putting all data together from different sources might amass a huge database for centralized processing, mining association rules at different data sources and forwarding the rules (rather than the original raw data) to the centralized company headquarter provides a feasible way to deal with multiple data source problems. In the meanwhile, the association rules at each data source may be required for that data source in the first instance, so association analysis at each data source is also important and useful. However, the forwarded rules from different data sources may be too many for the centralized company headquarter to use. This paper presents a weighting model for synthesizing high-frequency association rules from different data sources. There are two reasons to focus on high-frequency rules. First, a centralized company headquarter is interested in high-frequency rules because they are supported by most of its branches for corporate profitability. Second, high-frequency rules have larger chances to become valid rules in the union of all data sources. In order to extract high-frequency rules efficiently, a procedure of rule selection is also constructed to enhance the weighting model by coping with low-frequency rules. Experimental results show that our proposed weighting model is efficient and effective. en_US
dc.publisher Institute of Electrical and Electronics Engineers en_US
dc.relation.isbasedon http://dx.doi.org/10.1109/TKDE.2003.1185839 en_US
dc.title Synthesizing high-frequency rules from different data sources en_US
dc.parent IEEE Transactions On Knowledge And Data Engineering en_US
dc.journal.volume 15 en_US
dc.journal.number 2 en_US
dc.publocation Piscataway, USA en_US
dc.identifier.startpage 353 en_US
dc.identifier.endpage 367 en_US
dc.cauo.name Software Engineering en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record