Skip to main content

Showing 1–13 of 13 results for author: Ganchev, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05938  [pdf, other

    cs.CL

    DOLOMITES: Domain-Specific Long-Form Methodical Tasks

    Authors: Chaitanya Malaviya, Priyanka Agrawal, Kuzman Ganchev, Pranesh Srinivasan, Fantine Huot, Jonathan Berant, Mark Yatskar, Dipanjan Das, Mirella Lapata, Chris Alberti

    Abstract: Experts in various fields routinely perform methodical writing tasks to plan, organize, and report their work. From a clinician writing a differential diagnosis for a patient, to a teacher writing a lesson plan for students, these tasks are pervasive, requiring to methodically generate structured long-form output for a given input. We develop a typology of methodical tasks structured in the form o… ▽ More

    Submitted 28 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Dataset now available at https://dolomites-benchmark.github.io

  2. arXiv:2305.00034  [pdf, other

    cs.CL

    Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation

    Authors: Fantine Huot, Joshua Maynez, Shashi Narayan, Reinald Kim Amplayo, Kuzman Ganchev, Annie Louis, Anders Sandholm, Dipanjan Das, Mirella Lapata

    Abstract: While conditional generation models can now generate natural language well enough to create fluent text, it is still difficult to control the generation process, leading to irrelevant, repetitive, and hallucinated content. Recent work shows that planning can be a useful intermediate step to render conditional generation less opaque and more grounded. We present a web browser-based demonstration fo… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: Accepted at EACL Call for System Demonstrations 2023

  3. arXiv:2212.08037  [pdf, other

    cs.CL

    Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

    Authors: Bernd Bohnet, Vinh Q. Tran, Pat Verga, Roee Aharoni, Daniel Andor, Livio Baldini Soares, Massimiliano Ciaramita, Jacob Eisenstein, Kuzman Ganchev, Jonathan Herzig, Kai Hui, Tom Kwiatkowski, Ji Ma, Jianmo Ni, Lierni Sestorain Saralegui, Tal Schuster, William W. Cohen, Michael Collins, Dipanjan Das, Donald Metzler, Slav Petrov, Kellie Webster

    Abstract: Large language models (LLMs) have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of… ▽ More

    Submitted 10 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  4. arXiv:2211.09070  [pdf, other

    cs.CL

    Towards Computationally Verifiable Semantic Grounding for Language Models

    Authors: Chris Alberti, Kuzman Ganchev, Michael Collins, Sebastian Gehrmann, Ciprian Chelba

    Abstract: The paper presents an approach to semantic grounding of language models (LMs) that conceptualizes the LM as a conditional model generating text given a desired semantic message formalized as a set of entity-relationship triples. It embeds the LM in an auto-encoder by feeding its output to a semantic parser whose output is in the same representation domain as the input message. Compared to a baseli… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  5. arXiv:2211.08264  [pdf, other

    cs.CL

    QAmeleon: Multilingual QA with Only 5 Examples

    Authors: Priyanka Agrawal, Chris Alberti, Fantine Huot, Joshua Maynez, Ji Ma, Sebastian Ruder, Kuzman Ganchev, Dipanjan Das, Mirella Lapata

    Abstract: The availability of large, high-quality datasets has been one of the main drivers of recent progress in question answering (QA). Such annotated datasets however are difficult and costly to collect, and rarely exist in languages other than English, rendering QA technology inaccessible to underrepresented languages. An alternative to building large monolingual training datasets is to leverage pre-tr… ▽ More

    Submitted 7 August, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: To Appear at Transactions of Association for Computational Linguistics (TACL)

  6. arXiv:2207.00397  [pdf, ps, other

    cs.CL

    Conditional Generation with a Question-Answering Blueprint

    Authors: Shashi Narayan, Joshua Maynez, Reinald Kim Amplayo, Kuzman Ganchev, Annie Louis, Fantine Huot, Anders Sandholm, Dipanjan Das, Mirella Lapata

    Abstract: The ability to convey relevant and faithful information is critical for many tasks in conditional generation and yet remains elusive for neural seq-to-seq models whose outputs often reveal hallucinations and fail to correctly cover important details. In this work, we advocate planning as a useful intermediate representation for rendering conditional generation less opaque and more grounded. Our wo… ▽ More

    Submitted 1 May, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: 22 pages, Accepted at TACL. Pre-MIT Press publication version

  7. arXiv:2109.15121  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Feature-Rich Named Entity Recognition for Bulgarian Using Conditional Random Fields

    Authors: Georgi Georgiev, Preslav Nakov, Kuzman Ganchev, Petya Osenova, Kiril Ivanov Simov

    Abstract: The paper presents a feature-rich approach to the automatic recognition and categorization of named entities (persons, organizations, locations, and miscellaneous) in news text for Bulgarian. We combine well-established features used for other languages with language-specific lexical, syntactic and morphological information. In particular, we make use of the rich tagset annotation of the BulTreeBa… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: named entity recognition, NER, conditional random fields, CRF, Bulgarian, BulTreeBank

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: RANLP-2009

  8. arXiv:1808.06511  [pdf, ps, other

    cs.CL cs.AI

    State-of-the-art Chinese Word Segmentation with Bi-LSTMs

    Authors: Ji Ma, Kuzman Ganchev, David Weiss

    Abstract: A wide variety of neural-network architectures have been proposed for the task of Chinese word segmentation. Surprisingly, we find that a bidirectional LSTM model, when combined with standard deep learning techniques and best practices, can achieve better accuracy on many of the popular datasets as compared to models based on more complex neural-network architectures. Furthermore, our error an… ▽ More

    Submitted 24 August, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

  9. arXiv:1603.06042  [pdf, ps, other

    cs.CL cs.LG cs.NE

    Globally Normalized Transition-Based Neural Networks

    Authors: Daniel Andor, Chris Alberti, David Weiss, Aliaksei Severyn, Alessandro Presta, Kuzman Ganchev, Slav Petrov, Michael Collins

    Abstract: We introduce a globally normalized transition-based neural network model that achieves state-of-the-art part-of-speech tagging, dependency parsing and sentence compression results. Our model is a simple feed-forward neural network that operates on a task-specific transition system, yet achieves comparable or better accuracies than recurrent models. We discuss the importance of global as opposed to… ▽ More

    Submitted 8 June, 2016; v1 submitted 18 March, 2016; originally announced March 2016.

  10. arXiv:1412.1820  [pdf, other

    cs.CL

    Context-Dependent Fine-Grained Entity Type Tagging

    Authors: Dan Gillick, Nevena Lazic, Kuzman Ganchev, Jesse Kirchner, David Huynh

    Abstract: Entity type tagging is the task of assigning category labels to each mention of an entity in a document. While standard systems focus on a small set of types, recent work (Ling and Weld, 2012) suggests that using a large fine-grained label set can lead to dramatic improvements in downstream tasks. In the absence of labeled training data, existing fine-grained tagging systems obtain examples automa… ▽ More

    Submitted 1 August, 2016; v1 submitted 3 December, 2014; originally announced December 2014.

  11. arXiv:1401.6131  [pdf

    cs.CL cs.LG

    Controlling Complexity in Part-of-Speech Induction

    Authors: João V. Graça, Kuzman Ganchev, Luisa Coheur, Fernando Pereira, Ben Taskar

    Abstract: We consider the problem of fully unsupervised learning of grammatical (part-of-speech) categories from unlabeled text. The standard maximum-likelihood hidden Markov model for this task performs poorly, because of its weak inductive bias and large model capacity. We address this problem by refining the model and modifying the learning objective to control its capacity via para- metric and non-param… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 41, pages 527-551, 2011

  12. arXiv:1206.3256  [pdf

    cs.LG stat.ML

    Multi-View Learning over Structured and Non-Identical Outputs

    Authors: Kuzman Ganchev, Joao Graca, John Blitzer, Ben Taskar

    Abstract: In many machine learning problems, labeled training data is limited but unlabeled data is ample. Some of these problems have instances that can be factored into multiple views, each of which is nearly sufficent in determining the correct labels. In this paper we present a new algorithm for probabilistic multi-view learning which uses the idea of stochastic agreement between views as regularization… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-204-211

  13. arXiv:1205.2646  [pdf

    cs.LG cs.GT

    Censored Exploration and the Dark Pool Problem

    Authors: Kuzman Ganchev, Michael Kearns, Yuriy Nevmyvaka, Jennifer Wortman Vaughan

    Abstract: We introduce and analyze a natural algorithm for multi-venue exploration from censored data, which is motivated by the Dark Pool Problem of modern quantitative finance. We prove that our algorithm converges in polynomial time to a near-optimal allocation policy; prior results for similar problems in stochastic inventory control guaranteed only asymptotic convergence and examined variants in which… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-185-194