Terrier is a software for the rapid development of Web, intranet and desktop search engines. More generally, it is a modular platform for the rapid development of large-scale Information Retrieval applications, providing indexing and retrieval functionalities.
Terrier has various cutting-edge features including parameter-free probabilistic retrieval approaches (such as Divergence from Randomness models), automatic query expansion/re-formulation methodologies, and efficient data compression techniques. Terrier comes with a powerful proof-of-concept Desktop search application [Screenshots], and full TREC capabilities including the ability to index, query and evaluate the standard TREC collections, such as AP, WSJ, WT10G, .GOV and .GOV2.
Terrier is written in Java [Requirements] and has been successfully used for adhoc retrieval, Web search and cross-language retrieval, in a centralised or distributed setting. Currently, it is also being used for running various applications.
A core version of Terrier is now available for download [What's new], as open source software (version 1.0.2) under the Mozilla Public License (MPL), with the aim to facilitate experimentation and research in the wider Information Retrieval community.
Terrier is a modular platform for the rapid development of large-scale Information Retrieval applications, providing indexing and retrieval functionalities. Terrier is based on the Divergence from Randomness (DFR) framework. It can index various document collections, including the standard TREC collections, such as AP, WSJ, WT10G, .GOV and .GOV2. It also provides a wide range of parameter-free weighting approaches and full-text search algorithms, aiming to offer a public testbed for performing Information Retrieval experiments.
Terrier is a modular platform for the rapid development of Information Retrieval applications with JAVA, providing indexing and retrieval functionalities, based on the Divergence from Randomness (DFR) framework. It can index standard TREC test collections, such as AP, WSJ, WT10G, .GOV and .GOV2. It also provides a wide range of parameter-free weighting approaches and full-text search algorithms, aiming to offer a public testbed for performing Information Retrieval experiments. Terrier has been successfully used for adhoc, Web search and cross-language retrieval, in a centralised or distributed setting. Currently it is used for running the departmental search engine.
A version of Terrier is now available for download as open source software . It is distributed under the Mozilla Public License (MPL).
–>
You can find more information about Terrier, the DFR Framework and how to install/run Terrier in the documentation section, and in the Terrier Wiki
Terrier was developed by members of the Information Retrieval Research Group, Department of Computing Science, at the University of Glasgow. The project is funded by a UK Engineering and Physical Sciences Research Council (EPSRC) grant, number GR/R90543/01.