Browse by Current Projects:
TREC | VCoB | DGov | JiTTDL | CSKD | WIDIT
Projects Arrow
TREC : There are multiple Tracks under the TREC (Text REtrieval Conference) domain:
 
  • SPAM Track: 2005

    A new track in TREC 2005. The goal of the SPAM track is to provide a standard evaluation of current and proposed spam filtering approaches, thereby laying the foundation for the evaluation of more general email filtering and retrieval tasks. The task of SPAM track is to create an automatic spam filter that classifies a chronological sequence of email messages as SPAM or HAM (non-spam). The spam filter is run on several email sequences, some public and some private. The performance of the filter is measured with respect to gold standard judgements by a human assessor.
 
  • Genomics Track: 2004

    Due to the explosion of new data-intensive technologies for sequencing and examining genomes and proteomes during the '90s, the TREC Genomics Track was started in 2003. Its goal is to explore new methods for efficiently discovering and retrieving documents associated with the function of various genes and proteins within a given biology domain or 'sub-area'.
 
  • HARD Track: 2004 - Present

    The goal of HARD (High Accuracy Retrieval from Documents) is to achieve high accuracy retrieval from documents by leveraging additional information about the searcher and/or the search context, through techniques such as passage retrieval, and using very targeted interaction with the searcher.
 
  • Robust Track: 2003 - Present

    The Robust Retrieval Track was a new track introduced in TREC 2003. The goal of the track is to improve the consistency of retrieval technology by focusing on poorly performing topics. In addition, the tracks brings back a classic, ad hoc retrieval task to TREC that provides a natural home for new participants.
 
  • Web Track: 2003 - 2004

    The goals of Web Track are:
    1. To investigate methods for effective topic distillation: Finding a set of the best home pages, given a broad query.
    2. To investigate methods for effective navigational search, with a mixture of home page and named page queries: Finding a particular page desired by the user.
    3. To increase the available queries/judgments for the .GOV test collection.
Projects Arrow
VCoB 2004 - Present
 
  • The aim of the Virtual Collection Builder (VCoB) project is to develop an adaptive, interactive agent for building and maintaining a virtual collection of Web documents.
  • VCob approach will employ a wide array of methods from content analysis, citation analysis, machine learning, and information retrieval to collect, organize, and maintain a searchable and browsable collection of documents that are custom-taylored to the needs and preferences of individual users.
Projects Arrow
DGov :  2004 - Present
 
  • The aim of the Digital Government (DGov) project is to develop a more efficient and effective approach to organizing the U.S. Government websites that can facilitate the access and enhance the retrieval of government information.
  • DGov approach will combine information retrieval (e.g. keyword search) and information organization methods (e.g. Semantic Web) to optimize the government information discovery process on the Web.
Projects Arrow
JiTTDL : 2004 - Present
 
  • JiTTDL is an NSF-funded project at US Air Force Academy that will develop the digital library for Just-in-Time pedagogical resources. Dr. Elin Jacob and myself are consultants on JiTTDL project, where we are employing CSKD-based approaches to construct a prototype digital library system.
Projects Arrow
CSKD : 2003 - Present
 
  • The Classification-based Search and Knowledge Discovery (CSKD) project aims to leverage an existing body of manually classified documents to enhance information retrieval and knowledge discovery on the Web. CSKD research, which explores methods of leveraging both the ontological and link-structural knowledge embedded in classified corpora of Web documents for searching and organizing the Web, is a multi-dimensional project that entails investigations in such area as machine learning, classification, clustering, link analysis, and fusion.
Projects Arrow
WIDIT : 2002 - Present
 
  • IRISWeb technology to be revamped for an integrated approach to Web information discovery.
  • Leveraging of text, hyperlink, and classification information on the Web for interactive retrieval, automatic classification, and virtual collection development.
Go Top
Projects Arrow
IRISWeb : Spring 1998 - 2002
 
  • Kick-started by Chancellor’s UNC Instructional Technology Grant (spring 1998)
  • Work-in-progress prototype of the next-generation web search engine
  • Dynamic indexing, meta-search, relevance-feedback, and collection development
  • Implemented in Java, C++, C, and Perl.
Projects Arrow
Public User Data Navigation System : 1999 - 2000
 
  • Design and implementation of a navigation system for NIH to help users to find data of interest and the associated documentation.
  • To be considered for use as a template for other data navigation systems.
  • Implemented in Java.
Projects Arrow
Sitemap Tree Project : Spring 1999
 
  • dynamic tree manipulation application allowing creation and maintenance of a hierarchical tree structure, which can represent various entities from a file system (e.g. Windows Explorer) to a hyperlink-based map of a given Web site.
  • Implemented in Java.
Go Top
Projects Arrow
Web Search Evaluation : Fall 1998
 
  • A Web-based Search Engine evaluation survey.
  • Precursor to the Meta-Search Engine Evaluation.
Projects Arrow
Link Summarizer : Spring 1998
 
  • Displays a summary of linked pages in a given URL.
  • Modified version of IRIS crawler.
  • Implemented in Java, and Perl.
Projects Arrow
IRIS Sitemap-Crawler : Fall 1997
 
  • Web Indexing Interface for IRISWeb.
  • Web Crawler/Indexer module indexes a target URL and its embedded links.
  • Sitemap Display module displays the indexed pages in a directory tree structure.
  • Implemented in Java.
Go Top
Projects Arrow
IRIS Multimedia Project : Fall 1997
 
  • IRIS online tutorial and next-generation interface prototype.
  • Implemented in Macromedia Director and Javascript.
Projects Arrow
NICE STEMMER : Fall 1996
 
  • Implements 4 stemmer algorithms.
  • Component of the IRIS indexing module.
  • Implemented in C++.
Projects Arrow
IRIS Topaz : 1997-1998
 
  • An upgraded version of IRISRuby designed to handle large document collections.
  • Used in TREC-6 and TREC-7 experiments.
  • Implemented in Perl, C, C++.
Projects Arrow
IRIS Ruby : Spring 1996
 
  • The first incarnation of IRIS (Interactive Information Retrieval System).
  • Designed to index and search documents stored on an intranet server.
  • Implemented in Perl.
Go Top
Projects Arrow
SOCSA : Fall 1996
 
  • SILS Course Advising System (prototype).
Projects Arrow
SEED : Spring 1996
 
  • SILS Online Directory (prototype).
Projects Arrow
CANN : Fall 1995
 
  • WWW Asian American Resource Network.
Go Top

Page Last Updated: November 19, 2004 1:10