|
Nick Craswell 数据挖掘研究院
|
|
Research Overview I am interested in Web search evaluation, mostly on enterprise-scale webs but also the World Wide Web. I built the VLC, VLC2, WT2g and .GOV test collections, which have been made available to research groups around the world. David Hawking and I coordinated the TREC Web Track experiments. I am currently involved in the TREC Terabyte Track and Enterprise Track. Some publications: Book chapter preprint (pdf), IR′01 (citeseer) and CSIRO′01 (pdf). I also work on effective Web search, which means making use of information in pages, link structure and URL structure to generate more useful Web search results. Some papers: SIGIR′05 (pdf), SIGIR′01 (pdf), TOIS′03 (pdf) (copying is by permission of ACM, Inc.) and ADCS′03 (pdf). My PhD was in distributed information retrieval (thesis pdf) which means building a system on top of multiple engines/databases that already exist. My recent work in the area has considered whether (or when) DIR is really practical. Some papers: ADC′99 (ps), DL′00 (pdf), ADC′03 (pdf) and ADC′04 (pdf). |
|
(Numbers in square brackets are citation counts from Google scholar, including self citations, at August 2005.) 2005 Relevance weighting for query independent evidence (pdf) 数据挖掘研究院
Focused crawling for both topical relevance and quality of medical information (pdf) 数据挖掘研究院
Quality and Relevance of Domain-specific Search: A Case Study in Mental Health (pdf) 数据挖掘研究院
Very Large Scale Retrieval and Web Search (pdf) 数据挖掘研究院
2004 Toward Better Weighting of Anchors (pdf) 数据挖掘实验室
Testbed for Information Extraction from Deep Web (pdf) 数据挖掘研究院
How Valuable is External Link Evidence when Searching Enterprise Webs? (pdf) 数据挖掘研究院
Overview of the TREC-2004 Web Track (pdf) 数据挖掘研究院
Performance and Cost Tradeoffs in Web Search (pdf)
2003 数据挖掘研究院 [56] Engineering a multi-purpose test collection for Web retrieval experiments (doi)
[10] Query-independent evidence in home page finding (pdf) (copying is by permission of ACM, Inc.)
[6] Automated Discovery of Search Interfaces on the Web (pdf)
Overview of the TREC-2003 Web Track (pdf)
Predicting Fame and Fortune: PageRank or Indegree? (pdf)
TREC12 Web Track at CSIRO (pdf)
2002 数据挖掘研究院 [37] Overview of the TREC-2002 Web Track (pdf) 数据挖掘实验室
Buying bestsellers online: A case study in Search & Searchability (pdf)
CSIRO INEX experiments: XML search using PADRE (pdf)
Enterprise search: What works and what doesn′t (pdf) 数据挖掘研究院
TREC11 Web and Interactive Tracks at CSIRO (pdf) 数据挖掘研究院
XML Document Retrieval with PADRE (pdf) 数据挖掘研究院
2001 [64] Effective site finding using link anchor information (pdf) 数据挖掘研究院
[52] Overview of the TREC-2001 Web Track (pdf) 数据挖掘研究院
[31] Measuring search engine quality (citeseer)
[8] Which search engine is best at finding online services? (pdf)
Visual Clustering of Image Search Results (citeseer) 数据挖掘实验室
Panoptic Expert: Searching for experts not just for documents (pdf) 数据挖掘研究院
TREC10 Web and Interactive Tracks at CSIRO (pdf)
Which search engine is best at finding airline site home pages? (pdf) 数据挖掘研究院
2000 [41] Server Selection on the World Wide Web (pdf)
[9] Dark matter on the Web (pdf) 数据挖掘实验室
Methods for Distributed Information Retrieval (pdf) 数据挖掘研究院
An intranet reality check for TREC ad hoc (pdf) 数据挖掘研究院
Chart of darkness: Mapping a large intranet (pdf)
Efficient and flexible search using text and metadata (pdf)
1999 [79] Results and challenges in Web search evaluation (pdf)
[30] Merging Results from Isolated Search Engines (ps)
Is it fair to evaluate Web systems using TREC ad hoc methods? (pdf) 数据挖掘研究院
ACSys TREC-8 experiments (pdf)
Overview of TREC-8 Web track (ps) 数据挖掘研究院
1998 数据挖掘研究院 [59] Overview of TREC-7 Very Large Collection Track (pdf) 数据挖掘研究院
[11] ACSys TREC-7 experiments (pdf) 数据挖掘研究院
1997 ANU/ACSys TREC-6 experiments (pdf) 数据挖掘研究院
Aglets: A good idea for spidering? (pdf) 数据挖掘研究院
|


