Skip to main content

Showing 1–3 of 3 results for author: Kikani, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.01268  [pdf, other

    cs.CL

    MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems

    Authors: Keyur Faldu, Amit Sheth, Prashant Kikani, Darshan Patel

    Abstract: Recently, quite a few novel neural architectures were derived to solve math word problems by predicting expression trees. These architectures varied from seq2seq models, including encoders leveraging graph relationships combined with tree decoders. These models achieve good performance on various MWPs datasets but perform poorly when applied to an adversarial challenge dataset, SVAMP. We present a… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 10 pages, 3 figures, 3 tables

  2. arXiv:2111.05364  [pdf, other

    cs.CL cs.AI

    Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems

    Authors: Keyur Faldu, Amit Sheth, Prashant Kikani, Manas Gaur, Aditi Avasthi

    Abstract: Mathematical reasoning would be one of the next frontiers for artificial intelligence to make significant progress. The ongoing surge to solve math word problems (MWPs) and hence achieve better mathematical reasoning ability would continue to be a key line of research in the coming time. We inspect non-neural and neural methods to solve math word problems narrated in a natural language. We also hi… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: 15 pages, 2 tables, 4 figures

  3. arXiv:2104.08145  [pdf, other

    cs.CL cs.AI cs.LG

    KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding

    Authors: Keyur Faldu, Amit Sheth, Prashant Kikani, Hemang Akbari

    Abstract: Contextualized entity representations learned by state-of-the-art transformer-based language models (TLMs) like BERT, GPT, T5, etc., leverage the attention mechanism to learn the data context from training data corpus. However, these models do not use the knowledge context. Knowledge context can be understood as semantics about entities and their relationship with neighboring entities in knowledge… ▽ More

    Submitted 3 September, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 10 pages, 4 figures, 4 tables