Selected and/or recent papers by William W. Cohen

[RSS feed]

Recent papers: 2009

  1. William W. Cohen, Natalie Glance, Charles Schafer, Roy Tromble, Yuk Wah Wong (2009): Data Integration for Many Data Sources using Context-Sensitive Similarity Metrics in preparation.
  2. Vitor R. Carvalho, Ramnath Balasubramanyan and William W. Cohen (2009): Information Leaks and Suggestions: A Case Study using Mozilla Thunderbird in CEAS-2009.
  3. Richard Wang and William W. Cohen (2009): Character-level Analysis of Semi-Structured Documents for Set Expansion in EMNLP 2009.
  4. Ramnath Balasubramanyan and William W. Cohen and Matthew Hurst (2009): Modeling corpora of timestamped documents using semisupervised nonparametric topic models in preparation.
  5. Frank Lin and William W. Cohen (2009): Power Iteration Clustering in preparation.
  6. Richard Wang and William W. Cohen (2009): Automatic Set Instance Extraction using the Web in ACL-IJNLP 2009.
  7. Noboru Matsuda, Andrew Lee, William W. Cohen, and Ken Koedinger (2009): A Computational Model of How Learner Errors Arise from Weak Prior Knowledge in CogSci-2009.
  8. Amr Ahmed, Andrew O. Arnold, Luis Pedro Coelho, Joshua Kangas, Abdul-Saboor Sheikk, Eric P. Xing, William W. Cohen, and Robert F. Murphy (2009): Structured Literature Image Finder in Biolink-2009.
  9. Amr Ahmed, Eric P. Xing, William W. Cohen, and Robert F. Murphy (2009): Structured Correspondence Topic Models for Mining Captioned Figures in Biological Literature in KDD-2009.
  10. Tae Yano, Noah A. Smith, and William W. Cohen (2009): Predicting Response to Political Blog Posts with Topic Models in NAACL-2009.
  11. Ramnath Balasubramanyan, Frank Lin, William W. Cohen, Noah A. Smith, and Matthew Hurst (2009): From Episodes to Sagas: Understanding the News by Identifying Temporally Related Story Sequences in ICWSM-2009 (poster).
  12. Andrew Arnold and William W. Cohen (2009): Information Extraction as Link Prediction: Using Curated Citation Networks to Improve Gene Detection in WASA-2009.
  13. Andrew Arnold and William W. Cohen (2009): Information Extraction as Link Prediction: Using Curated Citation Networks to Improve Gene Detection in ICWSM-2009 (poster).

Recent papers: 2008

  1. Richard Wang and William W. Cohen (2008): Iterative Set Expansion of Named Entities Using the Web in ICDM-2008.
  2. Andrew Arnold and William W. Cohen (2008): Intra-document Structural Frequency Features for Semi-Supervised Domain Adaptation in CIKM-2008.
  3. Vitor R. Carvalho, Jonathan L. Elsas, William W. Cohen, and Jaime G. Carbonell (2008): Suppressing Outliers in Pairwise Preference Ranking in CIKM-2008.
  4. Richard Wang, Nico Schlaefer, William W. Cohen, and Eric Nyberg (2008): Automatic Set Expansion for List Question Answering in EMNLP-2008.
  5. Einat Minkov and William W. Cohen (2008): Learning Graph Walk Based Similarity Measures for Parsed Text in EMNLP-2008.
  6. Andrew Arnold, Ramesh Nallapati and William W. Cohen (2008): Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition in ACL-2008.
  7. Ramnath Balasubramanyan, Vitor Carvalho, and William W. Cohen (2008): CutOnce - Recipient Recommendation and Leak Detection in Action in AAAI-2008 Workshop on Enhanced Messaging.
  8. Einat Minkov, Ramnath Balasubramanyan, and William W. Cohen (2008): Activity-centric Search in Email in AAAI-2008 Workshop on Enhanced Messaging.
  9. Noboru Matsuda, William W. Cohen, Jonathan Sewall, Gustavo Lacerda, Kenneth Koedinger (2008): SimStudent: Building an Intelligent Tutoring System by Tutoring a Synthetic Student in preparation.
  10. Einat Minkov and William W. Cohen (2008): Learning to Walk Structured Text Networks in CMU SCS Technical Report Series (CMU-LTI-08-02).
  11. Ramesh Nallapati, Amr Ahmed, Eric Xing, and William W. Cohen (2008): Joint Latent Topic Models for Text and Citations in KDD-2008.
  12. Frank Lin and William W. Cohen (2008): Accurate Semi-supervised Classification for Graph Data in preparation.
  13. Ramesh Nallapati and William W. Cohen (2008): Link-PLSA-LDA: A New Unsupervised Model for Topics and Influence of Blogs in ICWSM-2008.
  14. Yi-Chia Wang, Mahesh Joshi, William Cohen, and Carolyn Rose (2008): Recovering Implicit Thread Structure in Newsgroup Style Conversations in ICWSM-2008.
  15. Frank Lin and William W. Cohen (2008): The MultiRank Bootstrap Algorithm: SemiSupervised Political Blog Classification and Ranking Using SemiSupervised Link Classification in ICWSM-2008 (poster).
  16. Frank Lin and William W. Cohen (2008): The MultiRank Bootstrap Algorithm: SemiSupervised Political Blog Classification and Ranking Using SemiSupervised Link Classification in CMU SCS Technical Report Series (CMU-LTI-08-03).
  17. Noboru Matsuda, William W. Cohen, Jonathan Sewall, Gustavo Lacerda, and Kenneth R. Koedinger (2008): Why Tutored Problem Solving may be better than Example Study: Theoretical Implications from a Simulated-Student Study in ITS-2008.
  18. Vitor Carvalho and William W. Cohen (2008): Ranking Users for Intelligent Message Addressing in ECIR-2008.

Recent papers: 2007

  1. Andrew Arnold, Ramesh Nallapati and William W. Cohen (2007): A Comparative Study of Methods for Transductive Transfer Learning in ICDM Workshop on Mining and Management of Biological Data.
  2. Ramesh Nallapati, William W. Cohen, and John Lafferty (2007): Parallelized Variational EM for Latent Dirichlet Allocation: An Experimental Evaluation of Speed and Scalability in ICDM Workshop on High Performance Data Mining.
  3. Ramesh Nallapati, Amr Ahmed, William Cohen and Eric Xing (2007): Sparse Word Graphs: A Scalable Algorithm for Capturing Word Correlations in Topic Models in ICDM Workshop on High Performance Data Mining.
  4. William Cohen (2007): Graph Walks and Graphical Models in preparation.
  5. Richard Wang and William Cohen (2007): Language-Independent Set Expansion of Named Entities using the Web in ICDM-2007.
  6. Einat Minkov and William Cohen (2007): Learning to Rank Typed Graph Walks: Local and Global Approaches in WebKDD-2007.
  7. Sarah Zelikovitz, William Cohen, and Haym Hirsh (2007): Extending WHIRL with background knowledge for improved text classification in Information Retrieval 10(1) pp 35-67.
  8. Vitor Carvalho, Wen Wu and William Cohen (2007): Discovering Leadership Roles in Email Workgroups in CEAS-2007.
  9. Zhenzhen Kou, Vitor Carvalho and William Cohen (2007): Online Stacked Graphical Learning in NIPS-07 Workshop on Efficient Machine Learning .
  10. Vitor Carvalho and William Cohen (2007): Recommending Recipients in the Enron Corpus in preparation.
  11. Ramesh Nallapati, William Cohen, Susan Ditmore, John Lafferty and Kin Ung (2007): Multiscale Topic Tomography in KDD-2007.
  12. Noboru Matsuda, William Cohen, Jonathan Sewall, Gustavo Lacerda and Ken Koedinger (2007): Predicting students performance with a SimStudent that learns cognitive skills from observation in AIED-2007.
  13. Noboru Matsuda, William Cohen, Jonathan Sewall, Gustavo Lacerda and Ken Koedinger (2007): Evaluating a simulated student using real students data for training and testing in UM-2007.
  14. Juchang Hua, Orhan Ayasli, William Cohen and Robert Murphy (2007): Identifying Fluorescence Microscope Images in Online Journal Publications using Both Image and Text Features in ISBI-2007.
  15. Vitor Carvalho and William W. Cohen (2007): Preventing Information Leaks in Email in SDM-2007.
  16. Zhenzhen Kou and William W. Cohen (2007): Stacked Graphical Models for Efficient Inference in Markov Random Fields in SDM-2007.
  17. Zhenzhen Kou, William W. Cohen, and Robert F. Murphy (2007): A Stacked Graphical Model for Associating Information from Text And Images In Figures in PSB-2007.

Selected other papers

  1. Einat Minkov, Andrew Ng and William W. Cohen (2006): Contextual Search and Name Disambiguation in Email using Graphs in SIGIR-2006.
  2. William W. Cohen & Einat Minkov (2006): A Graph-Search Framework for Associating Gene Identifiers with Documents in BMC Bioinformatics.
  3. William W. Cohen & Vitor Carvalho (2005): Stacked Sequential Learning in IJCAI-2005.
  4. Vitor Carvalho & William W. Cohen (2005): On the Collective Classification of Email Speech Acts in SIGIR 2005.
  5. Zhenzhen Kou, William W. Cohen & Robert F. Murphy (2005): High-Recall Protein Entity Recognition Using a Dictionary in ISMB-2005.
  6. Sunita Sarawagi & William W. Cohen (2004): Semi-Markov Conditional Random Fields for Information Extraction in NIPS 2004.
  7. William W. Cohen, Vitor R. Carvalho & Tom Mitchell (2004): Learning to Classify Email into "Speech Acts" in EMNLP 2004.
  8. Pradeep Ravikumar & William W. Cohen (2004): A Hierarchical Graphical Model for Record Linkage in UAI 2004.
  9. William W. Cohen & Sunita Sarawagi (2004): Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods in KDD 2004: 89-98.
  10. William W. Cohen (2003): Learning and Discovering Structure in Web Pages in IEEE Data Eng. Bull. 26(3): 3-10 (2003).
  11. Mikael Bilenko, Ray Mooney, William W. Cohen, Pradeep Ravikumar & Steve Fienberg (2003): Adaptive Name-Matching in Information Integration in IEEE Intelligent Systems 18(5): 16-23 (2003).
  12. William W. Cohen (2003): Infrastructure Components for Large-Scale Information Extraction Systems in IAAI 2003: 71-78.
  13. Cheng Zhai, William W. Cohen & John Lafferty (2003): Beyond Independent Topical Relevance: Methods and Evaluation Metrics for Subtopic Retrieval in SIGIR 2003: 10-17.
  14. William W. Cohen, Matthew Hurst & Lee S. Jensen (2003): A Flexible Learning System for Wrapping Tables and Lists in HTML Documents in Web Document Analysis: Challenges and Opportunities, ed. Antonacopoulos & Hu, Word Scientific Publishing. (Originally published as: William W. Cohen, Matthew Hurst & Lee S. Jensen (2002): A Flexible Learning System for Wrapping Tables and Lists in HTML Documents in WWW 2002: 232-241; Lee S. Jensen & William W. Cohen (2001): A Structured Wrapper Induction System for Extracting Information from Semi-Structured Documents in Proc. of the IJCAI-2001 Workshop on Adaptive Text Extraction and Mining).
  15. Chumki Basu, Haym Hirsh, William W. Cohen & Craig Neville-Manning (2001): Technical Paper Recommendation: A Study in Combining Multiple Information Sources in J. Artif. Intell. Res. (JAIR) 14: 231-252 (2001). (Originally published as: Chumki Basu, Haym Hirsh, William W. Cohen (1998): Recommendation as Classification: Using Social and Content-Based Information in Recommendation. in AAAI/IAAI 1998: 714-720).
  16. William W. Cohen, David McAllester, and Henry Kautz (2000): Hardening Soft Information Sources in KDD 2000: 255-259.
  17. William W. Cohen (2000): Automatically extracting features for concept learning from the Web in ICML 2000: 159-166.
  18. William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in Computer Networks 33(1-6): 685-698 (2000). (Originally published as: William W. Cohen and Wei Fan (2000): Web-Collaborative Filtering: Recommending Music by Crawling The Web in WWW 2000).
  19. William W. Cohen (2000): Data Integration using Similarity Joins and a Word-based Information Representation Language in ACM Trans. Inf. Syst. 18(3): 288-321 (2000). (Originally published as: William W. Cohen (1998): Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity in SIGMOD Conference 1998: 201-212; William W. Cohen (1997): Knowledge Integration for Structured Information Sources Containing Text (Extended Abstract) in SIGIR Workshop on Networked IR (informal proceedings)).
  20. William W. Cohen and Yoram Singer (1999): Simple, Fast, and Effective Rule Learner in AAAI/IAAI 1999: 335-342.
  21. William W. Cohen, Rob Schapire, Yoram Singer (1999): Learning to Order Things in J. Artif. Intell. Res. (JAIR) 10: 243-270 (1999). (Originally published as: William W. Cohen, Robert E. Schapire, Yoram Singer (1997): Learning to Order Things in NIPS 1997).
  22. William W. Cohen (1996): Learning Trees and Rules with Set-valued Features in AAAI/IAAI, Vol. 1 1996: 709-716.
  23. William W. Cohen (1996): Learning Rules that Classify E-Mail in AAAI Spring Symposium on ML and IR 1996.
  24. William W. Cohen (1995): Fast effective rule induction in ICML 1995: 115-123.
  25. William W. Cohen and Haym Hirsh (1994): Learning the CLASSIC description logic: Theoretical and experimental results in KR 1994: 121-133.

[Selected papers| By topic: Matching/Data Integration| Text Categorization| Rule Learning| Explanation-Based Learning| Formal Results| Inductive Logic Programming| Information Extraction| Collaborative Filtering| Applications| By year: All papers| RSS]