Jump to content

UIMA: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
removed Category:Apache Software Foundation projects using HotCat Only for subtopic
Citation bot (talk | contribs)
Added date. | Use this bot. Report bugs. | Suggested by Лисан аль-Гаиб | #UCB_webform 203/237
 
(12 intermediate revisions by 10 users not shown)
Line 1: Line 1:
'''UIMA''' ({{IPAc-en|j|u|ˈ|iː|m|ə}} {{respell|yoo|EE|mə}}),<ref>[http://uima.apache.org/d/uimaj-2.4.0/overview_and_setup.html#ugr.faqs UIMA Frequently Asked Questions (FAQ's)] The Apache Software Foundation</ref> short for '''Unstructured Information Management Architecture''', is an [[OASIS (organization)|OASIS standard]]<ref>[http://uima.apache.org/uima-specification.html UIMA Specification] The Apache Software Foundation.</ref> for [[content analytics]], originally developed at [[IBM]]. It provides a [[component software]] architecture for the development, discovery, composition, and deployment of [[multi-modal analytics]] for the analysis of [[unstructured information]] and integration with [[Search algorithm|search technologies]].
'''UIMA''' ({{IPAc-en|j|u|ˈ|iː|m|ə}} {{respell|yoo|EE|mə}}),<ref>[http://uima.apache.org/d/uimaj-2.4.0/overview_and_setup.html#ugr.faqs UIMA Frequently Asked Questions (FAQ's)] The Apache Software Foundation</ref> short for '''Unstructured Information Management Architecture''', is an [[OASIS (organization)|OASIS standard]]<ref>[http://uima.apache.org/uima-specification.html UIMA Specification] The Apache Software Foundation.</ref> for [[content analytics]], originally developed at [[IBM]]. It provides a [[component software]] architecture for the development, discovery, composition, and deployment of [[multi-modal analytics]] for the analysis of [[unstructured information]] and integration with [[search algorithm|search technologies]].


== Structure ==
== Structure ==

The UIMA architecture can be thought of in four dimensions:
The UIMA architecture can be thought of in four dimensions:
# It specifies component interfaces in an analytics [[pipeline (software)|pipeline]].
# It specifies component interfaces in an analytics [[pipeline (software)|pipeline]].
# It describes a set of [[Design pattern (computer science)|Design patterns]].
# It describes a set of [[design pattern (computer science)|design patterns]].
# It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an [[XML]] representation of annotations for integration with remote web services.
# It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an [[XML]] representation of annotations for integration with remote web services.
# It suggests development roles allowing tools to be used by users with diverse skills.
# It suggests development roles allowing tools to be used by users with diverse skills.


== Implementations and uses ==
== Implementations and uses ==
{{Infobox software

| name = Apache UIMA
{{ Infobox Software
| name = Apache UIMA
| logo = Apache UIMA logo.svg
| screenshot =
| logo =
| caption =
| screenshot =
| collapsible =
| caption =
| developer = [[IBM]], [[Apache Software Foundation]] (since October 2006)
| collapsible =
| latest release version = 3.1.1
| developer = [[IBM]], [[Apache Software Foundation]] (since October 2006)
| latest release date = {{Start date and age|mf=yes|2019|11|08}}<ref>{{cite web|url=http://uima.apache.org/news.html#08%20Nov%202019|title=Apache UIMA - News|website=uima.apache.org|access-date=11 December 2019}}</ref>
| latest release version = 2.9.0
| latest preview version =
| latest release date = {{release date|mf=yes|2016|08|30}}<ref>{{cite web|url=http://uima.apache.org/news.html#30+Aug+2016|title=Apache UIMA - News|author=|date=|website=uima.apache.org|accessdate=23 April 2018}}</ref>
| latest preview version =
| latest preview date =
| operating system = [[cross-platform]]
| latest preview date =
| programming language = [[Java (programming language)|Java]] with [[C++]] enablement
| operating system = [[cross-platform]]
| genre = [[text mining]], [[information extraction]]
| programming language = [[Java (programming language)|Java]] with [[C++]] enablement
| license = [[Apache License]] 2.0
| genre = [[text mining]], [[information extraction]]
| website = {{URL|https://uima.apache.org/}}
| license = [[Apache License]] 2.0
| website = https://uima.apache.org/
}}
}}


Line 32: Line 30:


UIMA is used in a number of software projects:
UIMA is used in a number of software projects:
* [[IBM Research]]'s [[Watson (computer)|Watson]] uses UIMA for analyzing unstructured data.<ref>{{cite web|url=https://blogs.apache.org/foundation/entry/apache_innovation_bolsters_ibm_s|title=Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog|author=|date=|website=blogs.apache.org|accessdate=23 April 2018}}</ref>
* [[IBM Research]]'s [[Watson (computer)|Watson]] uses UIMA for analyzing [[unstructured data]].<ref>{{cite web|url=https://blogs.apache.org/foundation/entry/apache_innovation_bolsters_ibm_s|title=Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog|website=blogs.apache.org|date=14 February 2011 |access-date=23 April 2018}}</ref>
* The Clinical Text Analysis and Knowledge Extraction System ([[CTAKES|Apache cTAKES]]) is a UIMA-based system for information extraction from medical records.
* The Clinical Text Analysis and Knowledge Extraction System ([[CTAKES|Apache cTAKES]]) is a UIMA-based system for information extraction from medical records.
* [[Ubiquitous Knowledge Processing Lab#DKPro|DKPro Core]] is a collection of reusable UIMA components for general-purpose natural language processing.
* [[Ubiquitous Knowledge Processing Lab#DKPro|DKPro Core]] is a collection of reusable UIMA components for general-purpose natural language processing.
Line 41: Line 39:
* [[General Architecture for Text Engineering]] (GATE)
* [[General Architecture for Text Engineering]] (GATE)
* [[IBM Omnifind]]
* [[IBM Omnifind]]
* [[Languageware]]
* [[LanguageWare]]
* [[List of natural language processing toolkits]]


== References ==
== References ==
Line 49: Line 46:
== External links ==
== External links ==
*[https://uima.apache.org/ Apache UIMA home page]
*[https://uima.apache.org/ Apache UIMA home page]
*[http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=uima OASIS Unstructured Information Management Architecture (UIMA) TC]


{{Apache}}
{{Apache Software Foundation}}


[[Category:Apache Software Foundation projects]]
[[Category:Software architecture]]
[[Category:Software architecture]]
[[Category:Data mining and machine learning software]]
[[Category:Data mining and machine learning software]]

Latest revision as of 03:00, 12 April 2024

UIMA (/juˈmə/ yoo-EE-mə),[1] short for Unstructured Information Management Architecture, is an OASIS standard[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.

Structure

[edit]

The UIMA architecture can be thought of in four dimensions:

  1. It specifies component interfaces in an analytics pipeline.
  2. It describes a set of design patterns.
  3. It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
  4. It suggests development roles allowing tools to be used by users with diverse skills.

Implementations and uses

[edit]
Apache UIMA
Developer(s)IBM, Apache Software Foundation (since October 2006)
Stable release
3.1.1 / November 8, 2019; 4 years ago (2019-11-08)[3]
Repository
Written inJava with C++ enablement
Operating systemcross-platform
Typetext mining, information extraction
LicenseApache License 2.0
Websiteuima.apache.org

Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.

UIMA is used in a number of software projects:

  • IBM Research's Watson uses UIMA for analyzing unstructured data.[4]
  • The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
  • DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.

See also

[edit]

References

[edit]
  1. ^ UIMA Frequently Asked Questions (FAQ's) The Apache Software Foundation
  2. ^ UIMA Specification The Apache Software Foundation.
  3. ^ "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.
  4. ^ "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.
[edit]