CSE 597I Reading List
Most topics have only one primary paper, a few have two. These are usually
shorter papers. If any of the links to papers are missing, please try the
Citeseer
database. Please inform me of
missing links and, if you find them, where so.
Reader: You must read the "Primary" papers; others are for background
and more information and are recommended reading.
Presenter: Your presentation is about a topic and must discuss
the primary paper(s) in detail; you must also read "Supplemental"
papers for background. Presenters should spend most of their presentation
on the papers assigned and not on introductory material. They should also
briefly cover secondary papers in their presentation. The sections marked
"assigned" have been taken. All papers should be available electronically.
If a topic you are interested in is not on the list, please consult with
the instructor.
Please see the timetable for date assignments.
Adaptive Web Sites (assigned)
-
Primary: M. Perkowitz, O. Etizoni, "Toward
Adaptive Web Sites: Concept and Case Study," Artificial Intelligence 118(1-2)
2000. PDF
Web Networks (assigned)
-
Primary: R. Albert, H. Jeong, A. Barabasi,
"The Internet's Achillies Heel: Error and Attack Tolerance of Complex Networks,"
Nature, 2000. Postscript
-
Supplemental: B. Huberman, L. Adamic, "Evolutionary
Dynamics of the World Wide Web." (A shorter version appeared in Nature,
1999). HTML
Web Growth Dynamics
-
Primary: B. Huberman, L. Adamic, "Evolutionary
Dynamics of the World Wide Web." (A shorter version appeared in Nature,
1999) HTML
-
Supplemental: R. Albert, H. Jeong, A. Barabasi,
"The Internet's Achillies Heel: Error and Attack Tolerance of Complex Networks,"
Nature, 2000. Postscript
Link Structure & Prediction
-
Primary: R. Sarukkai, "Link Prediction and
Path Analysis Using Markov Chains: 9th World Wide Wide Conference, May,
2000. HTML
-
Supplemental: R. Lempel, S. Moran, "The Stochastic
Approach for Link-Structure Analysis (SALSA) and the TKC Effect," 9th World
Wide Wide Conference, May, 2000. HTML
Web Analysis/Search Engine Comparison
-
Primary: S. Lawrence and C.L. Giles, "Accessibility
of Information on the Web," Nature, 400, 6740, pp. 107-109, 1999. PDF.
-
Primary: S. Lawrence and C.L.Giles, "Searching
the World Wide Web, Science," 280, 5360, pp. 98--100. PDF
-
Supplemental: Krishna Bharat and Andrei Broder.
A technique for measuring the relative size and overlap of public web search
engines. In Proceedings of the 7th International World Wide Web Conference,
Brisbane, Australia, pages 379-388. Elsevier Science, April 1998.
HTML
Web Page Dynamics (assigned)
-
Primary: B.E. Brewington, G. Cybenko, "How
Dynamic Is the Web?" 9th World Wide Wide Conference, May, 2000. HTML
-
Supplemental: E. G. Coffman, Z. Liu, R. Weber,
"Optimal Robot Scheduling for Web Search Engines," INRIA RR, No. 3317,
1997. to appear in Journal of Scheduling. Postscript.gz
Focused Crawling (assigned)
-
Primary: Soumen Chakrabati, Martin van den
Berg, and Byron Dom, "Focused Crawling: A New Approach to Topic-Specific
Web Resource Discovery," Proceedings of the 8th World Wide Web conference,
May 1999. HTML
-
Supplemental: M. Diligenti, F.M. Coetzee,
S. Lawrence, C.L. Giles, M. Gori, "Focused Crawling Using Context Graphs,"
26th Internaiional Conference on Very Large Databases (VLDB 2000), September
2000. PDF
URL Crawling (Google) (assigned)
-
Primary: J. Cho, H. Garcia-Molina, L. Page,
"Efficient Crawling Through URL Ordering," Proceedings of the 7th World
Wide Web conference, April 1998. HTML
-
Supplemental: S. Brin, L. Page, "The Anatomy
of a Large-Scale Hypertexual Web Search Engine," Proceedings of the 7th
World Wide Web conference, April 1998. HTML
Web Partitioning (assigned)
-
Primary: G. Flake,
S. Lawrence, C.L. Giles, "Efficient Identification of Web Communities,"
In the Proceedings of the Sixth International Conference on Knowledge Discovery
and Data Mining (ACM SIGKDD-2000), Boston, MA, USA. August 20-23, 2000.
Postcript
-
Supplemental: J. Kleinberg, "Authoritative
Sources in a Hyperlinked Environment," Proceedings ACM-SIAM Symposium on
Discrete Algorithms, San Francisco, California, 25-27 January, 668-677,
1998. Postscript
Document Similarity / Plagiarism Detection
-
Primary: N. Shivakumar, H. Garcia-Molina.
"SCAM: A Copy Detection Mechanism for Digital Documents," Proceedings of
the 2nd International Conference on Theory and Practice of Digital Libraries,
Austin, Texas, 1995. Postscript
-
Supplemental: Andrei Broder, Steve Glassman,
Mark Manasse, Geoffrey Zweig, "Syntactic Clustering of the Web," Sixth
International World Wide Web Conference, 391--404, 1997. SRC Technical
Note, 1997-015. HTML
Web Usage Patterns
-
Primary: Bernardo A. Huberman, Peter L. T.
Pirolli, James E. Pitkow, Rajan M. Lukose, "Strong Regularities in World
Wide Web Surfing," Science, 280, 95-97, April 3, 1998.
PDF
-
Primary: Bernardo A. Huberman, Rajan M. Lukose,
"Social Dilemmas and Internet Congestion," Science, 277, 535--537, July
25, 1997.
HTML
-
Supplemental: Rajan M. Lukose and Bernardo
A. Huberman, "Surfing as a Real Option."
HTML
Web Hyperlink Ranking
-
Primary: J. Kleinberg, "Authoritative Sources
in a Hyperlinked Environment," Proceedings ACM-SIAM Symposium on Discrete
Algorithms, San Francisco, California, 25-27 January, 668-677, 1998
Postscript
-
Primary: L. Page and S. Brin and R. Motwani
and T. Winograd, The PageRank Citation Ranking: Bringing Order to the Web,
Manuscript in progress.
Postscript
Recommender Systems (assigned)
-
Primary: D.M. Pennock, E. Horvitz, C.L.
Giles. "Social choice theory and recommender systems: Analysis of the axiomatic
foundations of collaborative filtering," Proceedings of the 17th National
Conference on Artificial Intelligence (AAAI
2000), Austin, TX, July 2000. PDF
-
Supplemental: J Ben Schafer, Joseph Konstan,
John Riedl, Recommender Systems in E-Commerce. ACM Conference on
Electronic Commerce (EC-99), pp 158-166, 1999. Postscript
Clustering
-
Primary: Oren Zamir and Oren Etzioni. Web
Document Clustering: A Feasibility Demonstration, ACM SIGIR'98, 1998, Postscript
-
Primary:D. Gibson, J. Kleinberg, P. Raghavan.
Clustering categorical data: An approach based on dynamical systems. Proc.
24th Intl. Conference on Very Large Databases, 1998. HTML
Scientific Literature Digital Libraries
-
Primary: S. Lawrence, C.L. Giles and K. Bollacker.
"Digital Libraries and Autonomous Citation Indexing," 32, 6, 67-71, IEEE
Computer, 1999. HTMLPostscriptPostscript.Z
-
Primary: A.M. Odlyzko, "Tragic Loss or Good
Riddance? The Impending Demise of Traditional Scholarly Journals," International
Journal of Human-Computer Studies, 42, 71--122, 1995. HTMLHTML
-
Supplemental: S.L. Esler and M.L. Nelson,
"Evolution of Scientific-Technical Information Distribution," Journal of
the American Society for Information Science, Volume 49, Number 1, pp.
82--91, 1998.
PDF
Web Citation Databases
-
Primary: C.L. Giles, K. Bollacker, S. Lawrence,
"CiteSeer: An Automatic Citation Indexing System," Digital Libraries 98
- The Third ACM Conference on Digital Libraries, (eds) Ian Witten and Rob
Akscyn and Frank M. Shipman III, ACM Press, pp. 89--98, 1998
HTML
-
Primary: Robert D. Cameron, "A Universal Citation
Database as a Catalyst for Reform in Scholarly Communication, First Monday,"
2(4), 1997. HTML
-
Supplemental: Steve Hitchcock and Les Carr
and Wendy Hall and Steve Harris and Steve Probets and David Evans and David
Brailsford, Linking Electronic Journals: Lessons From the Open Journal
Project, D-Lib Magazine, December, 1998
HTML
Metasearch Engines (assigned)
-
Primary: Erik Selberg and Oren Etzioni, "The
MetaCrawler Architecture for Resource Aggregation on the Web." HTML
-
Primary: S. Lawrence and C.L. Giles, "Context
and Page Analysis for Improved Web Search," IEEE Internet Computing, July/August
1998, Volume 2, Number 4, pp. 38-46, 1998. HTML
-
Supplemental: Adele Howe, Daniel Dreilinger,
"Experiences with Selecting Search Engines using Meta-Search,"ACM Transactions
on Information Systems. Postscript.gz
Web Auction Parameters (assigned)
-
Primary: P.R. Wurman, M.P. Wellman, W.E. Walsh,
"A Parametrization of the Auction Design Space, "Games and Economic Behavior
(to appear). PDF
-
Supplemental: M. Kumar and S. Feldman, "Internet
Auctions," IBM Institute for Advance Commerce TR, Nov. 1998. PDF
Web storage: permanence of information
-
Primary: A. G. Goldberg and P. N. Yianilos,
"Towards an Archival Intermemory," Proc. IEEE Advances in Digital Libraries
(ADL), 1998. HTML
-
Primary: Keith Shafer, Stuart Weibel, Erik
Jul, Jon Fausey, "Introduction to Persistent Uniform Resource Locators,"
HTML
Search source selection
-
Primary: L. Gravano, H. Garcia-Molina, A.
Tomasic, "GlOSS: Text-Source Discovery over the Internet," to appear in
ACM Transactions on Database Systems, 1999.
Postscript
-
Supplemental: Adele Howe, Daniel Dreilinger,
"Experiences with Selecting Search Engines using Meta-Search,"ACM Transactions
on Information Systems. Postscript.gz
Multimedia Search and Retrieval
-
Primary: S.-F. Chang, Q. Huang, T. Huang,
A. Puri, B. Sharhraray, "Multmedia Search and Retrieval," Advances in Multimedia:
Systems, Standards and Networks, eds. A. Puri, T. Chen, Marcel Dekker,
NY. 1999. PDF
-
Supplemental: H. Stone, "Image Libraries and
the Internet," IEEE Communications Magazine, 37(1), pp. 110-115, 1999.
Postscript.gz
Video Information Retrieval
-
Primary: Christel, M., Kanade, T., Mauldin,
M., Reddy, R., Stevens, S., Wactlar, H., Techniques for the Creation and
Exploration of Digital Video Libraries" , Multimedia Tools and Applications
(Volume 2), Ch. 8. Borko Furht, editor. Boston, MA: Kluwer Academic Publishers,
1996. PDF
Privacy (assigned)
-
Primary: Bernardo A.
Huberman, Matt Franklin and Tad Hogg, "Enhancing Privacy and Trust in Electronic
Communities", Proceedings of the ACM Conference on Electronic Commerce
(EC99), pp 78-86, 1999. Postscript.gz
-
Supplemental: Jim Omura, "Digital Signatures
and Certificates", Cylink white paper. HTML
Query Expansion
-
Primary: Mitra, Singhal and Buckley, "Improving
Automatic Query Expansion", Proceedings of the 21st Annual International
ACM-SIGIR Conference on Research and Development in Information Retrieval,
pp. 206-214, 1998. Postscript
Formal Models of Web Queries
-
Primary: Alberto Mendelzon and Tova Milo,
"Formal Models of Web Queries," Proc. PODS'97, Tucson, May 1997. Full version
to appear in Information Systems. Postscript.Z
Web Symbolic Knowledge Extraction (assigned)
-
Primary: Craven, M., DiPasquo, D., Freitag,
D., McCallum, A., Mitchell, T., Nigam, K., & Slattery, S., "Learning
to extract symbolic knowledge from the World Wide Web," Proceedings of
the Fifteenth National Conference on Artificial Intellligence (AAAI98)
, pp. 509--516, 1998. HTML
-
Supplemental: Riloff,
E., & Jones, R., "Learning dictionaries for information extraction
using multi-level boot-strapping." Proceedings of the Sixteenth National
Conference on Artificial Intellligence (AAAI-99), 1999. HTML