UIMA: Difference between revisions

Apache UIMA
Developer(s)	IBM, Apache Software Foundation (since October 2006)
Stable release	3.1.1 / November 8, 2019; 4 years ago
Repository	svn.apache.org/repos/asf/uima/ ;
Written in	Java with C++ enablement
Operating system	cross-platform
Type	text mining, information extraction
License	Apache License 2.0
Website	uima.apache.org

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Inline

Latest revision as of 03:00, 12 April 2024

UIMA (/juˈiːmə/ yoo-EE-mə),^[1] short for Unstructured Information Management Architecture, is an OASIS standard^[2] for content analytics, originally developed at IBM. It provides a component software architecture for the development, discovery, composition, and deployment of multi-modal analytics for the analysis of unstructured information and integration with search technologies.

Structure

The UIMA architecture can be thought of in four dimensions:

It specifies component interfaces in an analytics pipeline.
It describes a set of design patterns.
It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
It suggests development roles allowing tools to be used by users with diverse skills.

Implementations and uses

Apache UIMA, a reference implementation of UIMA, is maintained by the Apache Software Foundation.

UIMA is used in a number of software projects:

IBM Research's Watson uses UIMA for analyzing unstructured data.^[4]
The Clinical Text Analysis and Knowledge Extraction System (Apache cTAKES) is a UIMA-based system for information extraction from medical records.
DKPro Core is a collection of reusable UIMA components for general-purpose natural language processing.

References

^ UIMA Frequently Asked Questions (FAQ's) The Apache Software Foundation
^ UIMA Specification The Apache Software Foundation.
^ "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.
^ "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.

External links

Apache UIMA home page

[1] UIMA Frequently Asked Questions (FAQ's) The Apache Software Foundation

[2] UIMA Specification The Apache Software Foundation.

[3] "Apache UIMA - News". uima.apache.org. Retrieved 11 December 2019.

[4] "Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog". blogs.apache.org. 14 February 2011. Retrieved 23 April 2018.

[1]

[2]

[3]

[4]

@@ Line 1: / Line 1: @@
-'''UIMA''' ({{IPAc-en|j|u|ˈ|iː|m|ə}} {{respell|yoo|EE|mə}}),<ref>[http://uima.apache.org/d/uimaj-2.4.0/overview_and_setup.html#ugr.faqs UIMA Frequently Asked Questions (FAQ's)] The Apache Software Foundation</ref> short for '''Unstructured Information Management Architecture''', is an [[OASIS (organization)|OASIS standard]]<ref>[http://uima.apache.org/uima-specification.html UIMA Specification] The Apache Software Foundation.</ref>  for [[content analytics]], originally developed at [[IBM]].  It provides a [[component software]] architecture for the development, discovery, composition, and deployment of [[multi-modal analytics]] for the analysis of [[unstructured information]] and integration with [[Search algorithm|search technologies]].
+'''UIMA''' ({{IPAc-en|j|u|ˈ|iː|m|ə}} {{respell|yoo|EE|mə}}),<ref>[http://uima.apache.org/d/uimaj-2.4.0/overview_and_setup.html#ugr.faqs UIMA Frequently Asked Questions (FAQ's)] The Apache Software Foundation</ref> short for '''Unstructured Information Management Architecture''', is an [[OASIS (organization)|OASIS standard]]<ref>[http://uima.apache.org/uima-specification.html UIMA Specification] The Apache Software Foundation.</ref>  for [[content analytics]], originally developed at [[IBM]].  It provides a [[component software]] architecture for the development, discovery, composition, and deployment of [[multi-modal analytics]] for the analysis of [[unstructured information]] and integration with [[search algorithm|search technologies]].
 == Structure ==
 The UIMA architecture can be thought of in four dimensions:
 # It specifies component interfaces in an analytics [[pipeline (software)|pipeline]].
-# It describes a set of [[Design pattern (computer science)|Design patterns]].
+# It describes a set of [[design pattern (computer science)|design patterns]].
 # It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an [[XML]] representation of annotations for integration with remote web services.
 # It suggests development roles allowing tools to be used by users with diverse skills.
 == Implementations and uses ==
+{{Infobox software
+| name = Apache UIMA
-{{ Infobox Software
-| name                   = Apache UIMA
+| logo = Apache UIMA logo.svg
+| screenshot =
-| logo                   =
+| caption =
-| screenshot             =
+| collapsible =
-| caption                =
+| developer = [[IBM]], [[Apache Software Foundation]] (since October 2006)
-| collapsible            =
+| latest release version = 3.1.1
-| developer              = [[IBM]], [[Apache Software Foundation]] (since October 2006)
+| latest release date = {{Start date and age|mf=yes|2019|11|08}}<ref>{{cite web|url=http://uima.apache.org/news.html#08%20Nov%202019|title=Apache UIMA - News|website=uima.apache.org|access-date=11 December 2019}}</ref>
-| latest release version = 2.9.0
+| latest preview version =
-| latest release date    = {{release date|mf=yes|2016|08|30}}<ref>{{cite web|url=http://uima.apache.org/news.html#30+Aug+2016|title=Apache UIMA - News|author=|date=|website=uima.apache.org|accessdate=23 April 2018}}</ref>
-| latest preview version =
+| latest preview date =
+| operating system = [[cross-platform]]
-| latest preview date    =
+| programming language = [[Java (programming language)|Java]] with [[C++]] enablement
-| operating system       = [[cross-platform]]
+| genre = [[text mining]], [[information extraction]]
-| programming language   = [[Java (programming language)|Java]] with [[C++]] enablement
+| license = [[Apache License]] 2.0
-| genre                  = [[text mining]], [[information extraction]]
+| website = {{URL|https://uima.apache.org/}}
-| license                = [[Apache License]] 2.0
-| website                = https://uima.apache.org/
 }}
@@ Line 32: / Line 30: @@
 UIMA is used in a number of software projects:
-* [[IBM Research]]'s [[Watson (computer)|Watson]] uses UIMA for analyzing unstructured data.<ref>{{cite web|url=https://blogs.apache.org/foundation/entry/apache_innovation_bolsters_ibm_s|title=Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog|author=|date=|website=blogs.apache.org|accessdate=23 April 2018}}</ref>
+* [[IBM Research]]'s [[Watson (computer)|Watson]] uses UIMA for analyzing [[unstructured data]].<ref>{{cite web|url=https://blogs.apache.org/foundation/entry/apache_innovation_bolsters_ibm_s|title=Apache Innovation Bolsters IBM's "Smartest Machine on Earth" in First-ever Man vs. Machine Competition on Jeopardy! Quiz Show : The Apache Software Foundation Blog|website=blogs.apache.org|date=14 February 2011 |access-date=23 April 2018}}</ref>
 * The Clinical Text Analysis and Knowledge Extraction System ([[CTAKES|Apache cTAKES]]) is a UIMA-based system for information extraction from medical records.
 * [[Ubiquitous Knowledge Processing Lab#DKPro|DKPro Core]] is a collection of reusable UIMA components for general-purpose natural language processing.
@@ Line 41: / Line 39: @@
 * [[General Architecture for Text Engineering]] (GATE)
 * [[IBM Omnifind]]
-* [[Languageware]]
+* [[LanguageWare]]
-* [[List of natural language processing toolkits]]
 == References ==
@@ Line 49: / Line 46: @@
 == External links ==
 *[https://uima.apache.org/ Apache UIMA home page]
-*[http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=uima OASIS Unstructured Information Management Architecture (UIMA) TC]
-{{Apache}}
+{{Apache Software Foundation}}
+[[Category:Apache Software Foundation projects]]
 [[Category:Software architecture]]
 [[Category:Data mining and machine learning software]]

v t e The Apache Software Foundation
Top-level projects	Accumulo ActiveMQ Airavata Airflow Allura Ambari Ant Aries Arrow Apache HTTP Server APR Avro Axis Axis2 Beam Bloodhound Brooklyn Calcite Camel CarbonData Cassandra Cayenne CloudStack Cocoon Cordova CouchDB cTAKES CXF Derby Directory Drill Druid Empire-db Felix Flex Flink Flume FreeMarker Geronimo Groovy Guacamole Gump Hadoop HBase Helix Hive Iceberg Ignite Impala Jackrabbit James Jena JMeter Kafka Kudu Kylin Lucene Mahout Maven MINA mod_perl MyFaces Mynewt NiFi NetBeans Nutch NuttX OFBiz Oozie OpenEJB OpenJPA OpenNLP OрenOffice ORC PDFBox Parquet Phoenix POI Pig Pinot Pivot Qpid Roller RocketMQ Samza Shiro SINGA Sling Solr Spark Storm SpamAssassin Struts 1 Struts 2 Subversion Superset SystemDS Tapestry Thrift Tika TinkerPop Tomcat Trafodion Traffic Server UIMA Velocity Wicket Xalan Xerces XMLBeans Yetus ZooKeeper
Commons	BCEL BSF Daemon Jelly Logging
Incubator	Taverna
Other projects	Batik FOP Ivy Log4j
Attic	Apex AxKit Beehive iBATIS Click Continuum Deltacloud Etch Giraph Hama Harmony Jakarta Marmotta MXNet ODE River Shale Slide Sqoop Stanbol Tuscany Wave XML
Licenses	Apache License
Category

Latest revision as of 03:00, 12 April 2024

Structure

Implementations and uses

See also

References

External links