General Architecture for Text Engineering

GATE
Developer(s)GATE research team, Dept. Computer Science, University of Sheffield
Initial release1995 (1995)
Stable release8.6.1 (January 17, 2020 (2020-01-17)) [±]
Preview release9.0-SNAPSHOT (June 17, 2025 (Nightly builds released every day)) [±]
Repository
Written inJava
Operating systemCross-platform
Available inEnglish
TypeText mining Information extraction
LicenseLGPL
Websitegate.ac.uk

General Architecture for Text Engineering (GATE) is a Java suite of natural language processing (NLP) tools for man tasks, including information extraction in many languages. It is now used worldwide by a wide community of scientists, companies, teachers and students. It was originally developed at the University of Sheffield beginning in 1995.

As of May 28, 2011, 881 people are on the gate-users mailing list at SourceForge.net, and 111,932 downloads from SourceForge are recorded since the project moved to SourceForge in 2005. The paper "GATE: A framework and graphical development environment for robust NLP tools and applications" has received over 2000 citations since publication (according to Google Scholar). Books covering the use of GATE, in addition to the GATE User Guide, include "Building Search Applications: Lucene, LingPipe, and Gate", by Manu Konchady, and "Introduction to Linguistic Annotation and Text Analytics", by Graham Wilcock.

GATE community and research has been involved in several European research projects including: Transitioning Applications to Ontologies, SEKT, NeOn, Media-Campaign, Musing, Service-Finder, LIRICS and KnowledgeWeb.