skip to main content
article

Efficient mining of both positive and negative association rules

Authors Info & Claims
Published:01 July 2004Publication History
Skip Abstract Section

Abstract

This paper presents an efficient method for mining both positive and negative association rules in databases. The method extends traditional associations to include association rules of forms A ⇒ ¬ B, ¬ AB, and ¬ A ⇒ ¬ B, which indicate negative associations between itemsets. With a pruning strategy and an interestingness measure, our method scales to large databases. The method has been evaluated using both synthetic and real-world databases, and our experimental results demonstrate its effectiveness and efficiency.

References

  1. Aggarawal, C. and Yu, P. 1998. A new framework for itemset generation. In Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM, Seattle, Washington, 18--24.]] Google ScholarGoogle Scholar
  2. Agrawal, R., Imielinski, T., and Swami, A. 1993a. Database mining: A performance perspective. IEEE Trans. Knowledge and Data Eng. 5, 6 (Nov.), 914--925.]] Google ScholarGoogle Scholar
  3. Agrawal, R., Imielinski, T., and Swami, A. 1993b. Mining association rules between sets of items in massive databases. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data. ACM, Washington D.C., 207--216.]] Google ScholarGoogle Scholar
  4. Bayardo, B. 1998. Efficiently mining long patterns from databases. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. ACM, Seattle, Washington, 85--93.]] Google ScholarGoogle Scholar
  5. Brin, S., Motwani, R., and Silverstein, C. 1997. Beyond market baskets: Generalizing association rules to correlations. In Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data. ACM, Tucson, Arizona, 265--276.]] Google ScholarGoogle Scholar
  6. Carter, C., Hamilton, H., and Cercone, N. 1997. Share based measures for itemsets. In Principles of Data Mining and Knowledge Discovery. Springer, Trondheim, Norway, 14--24.]] Google ScholarGoogle Scholar
  7. Chen, M., Han, J., and Yu, P. 1996. Data mining: An overview from a database perspective. IEEE Trans. Knowledge and Data Eng. 8, 6 (Nov.), 866--881.]] Google ScholarGoogle Scholar
  8. Han, J., Pei, J., and Yin, Y. 2000. Mining frequent patterns without candidate generation. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. ACM, Dallas, Texas, 1--12.]] Google ScholarGoogle Scholar
  9. Hussain, F., Liu, H., Suzuki, E., and Lu, H. 2000. Exception rule mining with a relative interestingness measure. In Proceedings of The Third Pacific Asia Conference on Knowledge Discovery and Data Mining, PADKK 2000. Springer, Kyoto, Japan, 86--97.]] Google ScholarGoogle Scholar
  10. Hwang, S., Ho, S., and Tang, J. 1999. Mining exception instances to facilitate workflow exception handling. In Proceedings of the Sixth International Conference on Database Systems for Advanced Applications (DASFAA). IEEE Computer Society, Hsinchu, Taiwan, 45--52.]] Google ScholarGoogle Scholar
  11. Liu, H., Lu, H., Feng, L., and Hussain, F. 1999. Efficient search of reliable exceptions. In Proceedings of The Third Pacific Asia Conference on Knowledge Discovery and Data Mining, PADKK 1999. Springer, Beijing, China, 194--204.]] Google ScholarGoogle Scholar
  12. Padmanabhan, B. and Tuzhilin, A. 1998. A belief-driven method for discovering unexpected patterns. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98). AAAI, Newport Beach, California, USA, 94--100.]]Google ScholarGoogle Scholar
  13. Padmanabhan, B. and Tuzhilin, A. 2000. Small is beautiful: discovering the minimal set of unexpected patterns. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, Boston, MA, USA, 54--63.]] Google ScholarGoogle Scholar
  14. Park, J., Chen, M., and Yu, P. 1997. Using a hash-based method with transaction trimming for mining association rules. IEEE Trans. Knowl. Data Eng. 9, 5 (Sept.), 813--824.]] Google ScholarGoogle Scholar
  15. Piatetsky-Shapiro, G. 1991. Discovery, analysis, and presentation of strong rules. In Knowledge discovery in Databases. AAAI/MIT, Menlo Park, Calif., USA, 229--248.]]Google ScholarGoogle Scholar
  16. Savasere, A., Omiecinski, E., and Navathe, S. 1998. Mining for strong negative associations in a large database of customer transactions. In Proceedings of the Fourteenth International Conference on Data Engineering. IEEE Computer Society, Orlando, Florida, 494--502.]] Google ScholarGoogle Scholar
  17. Shintani, T. and Kitsuregawa, M. 1998. Parallel mining algorithms for generalized association rules with classification hierarchy. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. ACM, Seattle, Washington, 25--36.]] Google ScholarGoogle Scholar
  18. Shortliffe, E. 1976. Computer Based Medical Consultations: MYCIN. Elsevier, New York.]]Google ScholarGoogle Scholar
  19. Srikant, R. and Agrawal, R. 1996. Mining quantitative association rules in large relational tables. In Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data. ACM, Montreal, Quebec, Canada, 1--12.]] Google ScholarGoogle Scholar
  20. Srikant, R. and Agrawal, R. 1997. Mining generalized association rules. Future Generation Computer Systems 13, 2-3 (Nov.), 161--180.]] Google ScholarGoogle Scholar
  21. Suzuki, E. 1997. Autonomous discovery of reliable exception rules. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (KDD-97). AAAI, Newport Beach, California, USA, 259--262.]]Google ScholarGoogle Scholar
  22. Suzuki, E. and Shimura, M. 1996. Exceptional knowledge discovery in databases based on information theory. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96). AAAI, Portland, Oregon, USA, 275--278.]]Google ScholarGoogle Scholar
  23. Tan, P., Kumar, V., and Srivastava, J. 2000. Indirect association: Mining higher order dependencies in data. In Principles of Data Mining and Knowledge Discovery. Springer, Lyon, France, 632--637.]] Google ScholarGoogle Scholar
  24. Tan, P. and Kumar, V. 2002. Mining indirect associations in web data. In WEBKDD 2001---Mining Web Log Data Across All Customers Touch Points, Third International Workshop, San Francisco, CA, USA, August 26, 2001, Revised Papers. Springer, San Francisco, CA, 145--166.]] Google ScholarGoogle Scholar
  25. Tsur, D., Ullman, J., Abiteboul, S., Clifton, C., Motwani, R., Nestorov, S., and Rosenthal, A. 1998. Query flocks: A generalization of association-rule mining. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data. ACM, Seattle, Washington, USA, 1--12.]] Google ScholarGoogle Scholar

Index Terms

  1. Efficient mining of both positive and negative association rules

      Recommendations

      Reviews

      Ming-Yen Lin

      Association rule mining is used to discover the relationships between items in a large database. The relationship, in general, shows that the occurrence of an item set would imply the occurrence of another item set. The wide applicability of association rules has motivated extensive research to improve the efficiency of the mining process, and the effectiveness of the mining results. This paper presents an efficient method for mining association rules, including negative associations between sets of items (named itemsets). Three forms of negative associations are discovered, such as A __?__ B, A __?__ B, and A __?__ B, where A and B are disjoint itemsets. While most of the papers in the literature describe the mining of either traditional associations or negative associations, the authors discover both associations at the same time. The proposed approach identifies all itemsets of interest, and then generates valid combinations of associations. The first step divides possible itemsets into frequent sets and negative sets, and prunes the uninteresting ones. The second step uses conditional-probability increment ratios to extract support-and-confidence valid rules. The main contribution of the paper is its presentation of a technique to mine both positive and negative associations concurrently. The authors define association rules of interest, and systematically shrink the search space of all combinations. A second contribution is the authors' application of conditional probability in the computation of rule generation; such a technique avoids the inefficiency of enumerating all positive and negative association rules. The paper efficiently mines associations, and helps decision making. Online Computing Reviews Service

      Access critical reviews of Computing literature here

      Become a reviewer for Computing Reviews.

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Information Systems
        ACM Transactions on Information Systems  Volume 22, Issue 3
        July 2004
        145 pages
        ISSN:1046-8188
        EISSN:1558-2868
        DOI:10.1145/1010614
        Issue’s Table of Contents

        Copyright © 2004 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 July 2004
        Published in tois Volume 22, Issue 3

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader