Skip to main content

Showing 1–28 of 28 results for author: Purohit, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.17712  [pdf, other

    cs.DS cs.LG

    Improving Online Algorithms via ML Predictions

    Authors: Ravi Kumar, Manish Purohit, Zoya Svitkina

    Abstract: In this work we study the problem of using machine-learned predictions to improve the performance of online algorithms. We consider two classical problems, ski rental and non-clairvoyant job scheduling, and obtain new online algorithms that use predictions to make their decisions. These algorithms are oblivious to the performance of the predictor, improve with better predictions, but do not degrad… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Conference version appeared in Neurips 2018

  2. Online Load and Graph Balancing for Random Order Inputs

    Authors: Sungjin Im, Ravi Kumar, Shi Li, Aditya Petety, Manish Purohit

    Abstract: Online load balancing for heterogeneous machines aims to minimize the makespan (maximum machine workload) by scheduling arriving jobs with varying sizes on different machines. In the adversarial setting, where an adversary chooses not only the collection of job sizes but also their arrival order, the problem is well-understood and the optimal competitive ratio is known to be $Θ(\log m)$ where $m$… ▽ More

    Submitted 20 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

  3. arXiv:2312.13526  [pdf, other

    cs.DS

    New Tools for Peak Memory Scheduling

    Authors: Ce Jin, Manish Purohit, Zoya Svitkina, Erik Vee, Joshua R. Wang

    Abstract: We study scheduling of computation graphs to minimize peak memory consumption, an increasingly critical task due to the surge in popularity of large deep-learning models. This problem corresponds to the weighted version of the classical one-shot black pebbling game. We propose the notion of a dominant schedule to capture the idea of finding the ``best'' schedule for a subgraph and introduce new to… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  4. arXiv:2311.08657  [pdf, other

    cs.CV cs.LG

    ConeQuest: A Benchmark for Cone Segmentation on Mars

    Authors: Mirali Purohit, Jacob Adler, Hannah Kerner

    Abstract: Over the years, space scientists have collected terabytes of Mars data from satellites and rovers. One important set of features identified in Mars orbital images is pitted cones, which are interpreted to be mud volcanoes believed to form in regions that were once saturated in water (i.e., a lake or ocean). Identifying pitted cones globally on Mars would be of great importance, but expert geologis… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024

  5. arXiv:2305.07440  [pdf, other

    cs.PF cs.AI cs.LG

    Optimizing Memory Mapping Using Deep Reinforcement Learning

    Authors: Pengming Wang, Mikita Sazanovich, Berkin Ilbeyi, Phitchaya Mangpo Phothilimthana, Manish Purohit, Han Yang Tay, Ngân Vũ, Miaosen Wang, Cosmin Paduraru, Edouard Leurent, Anton Zhernov, Po-Sen Huang, Julian Schrittwieser, Thomas Hubert, Robert Tung, Paula Kurylowicz, Kieran Milan, Oriol Vinyals, Daniel J. Mankowitz

    Abstract: Resource scheduling and allocation is a critical component of many high impact systems ranging from congestion control to cloud computing. Finding more optimal solutions to these problems often has significant impact on resource and time savings, reducing device wear-and-tear, and even potentially improving carbon emissions. In this paper, we focus on a specific instance of a scheduling problem, n… ▽ More

    Submitted 17 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  6. arXiv:2305.02508  [pdf, other

    cs.DS

    Efficient Caching with Reserves via Marking

    Authors: Sharat Ibrahimpur, Manish Purohit, Zoya Svitkina, Erik Vee, Joshua R. Wang

    Abstract: Online caching is among the most fundamental and well-studied problems in the area of online algorithms. Innovative algorithmic ideas and analysis -- including potential functions and primal-dual techniques -- give insight into this still-growing area. Here, we introduce a new analysis technique that first uses a potential function to upper bound the cost of an online algorithm and then pairs that… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 23 pages

    ACM Class: F.2.2

  7. arXiv:2304.14065  [pdf, other

    cs.CV cs.AI

    Lightweight, Pre-trained Transformers for Remote Sensing Timeseries

    Authors: Gabriel Tseng, Ruben Cartuyvels, Ivan Zvonkov, Mirali Purohit, David Rolnick, Hannah Kerner

    Abstract: Machine learning methods for satellite data have a range of societally relevant applications, but labels used to train models can be difficult or impossible to acquire. Self-supervision is a natural solution in settings with limited labeled data, but current self-supervised models for satellite data fail to take advantage of the characteristics of that data, including the temporal dimension (which… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

  8. arXiv:2207.05975  [pdf, ps, other

    cs.DS

    Caching with Reserves

    Authors: Sharat Ibrahimpur, Manish Purohit, Zoya Svitkina, Erik Vee, Joshua Wang

    Abstract: Caching is a crucial component of many computer systems, so naturally it is a well-studied topic in algorithm design. Much of traditional caching research studies cache management for a single-user or single-processor environment. In this paper, we propose two related generalizations of the classical caching problem that capture issues that arise in a multi-user or multi-processor environment. In… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  9. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  10. arXiv:2204.07600  [pdf, other

    cs.CL

    In-BoXBART: Get Instructions into Biomedical Multi-Task Learning

    Authors: Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral

    Abstract: Single-task models have proven pivotal in solving specific tasks; however, they have limitations in real-world applications where multi-tasking is necessary and domain shifts are exhibited. Recently, instructional prompts have shown significant improvement towards multi-task generalization; however, the effect of instructional prompts and Multi-Task Learning (MTL) has not been systematically studi… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

    Comments: NAACL 2022 Findings

  11. arXiv:2202.04262  [pdf, other

    cs.DS cs.LG

    Parsimonious Learning-Augmented Caching

    Authors: Sungjin Im, Ravi Kumar, Aditya Petety, Manish Purohit

    Abstract: Learning-augmented algorithms -- in which, traditional algorithms are augmented with machine-learned predictions -- have emerged as a framework to go beyond worst-case analysis. The overarching goal is to design algorithms that perform near-optimally when the predictions are accurate yet retain certain worst-case guarantees irrespective of the accuracy of the predictions. This framework has been s… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  12. arXiv:2112.03255  [pdf, other

    cs.CC

    NP-completeness of the Active Time Scheduling Problem

    Authors: Sagnik Saha, Manish Purohit

    Abstract: In this paper, we study the active time scheduling problem. We are given n jobs with integral processing times each of which has an integral release time and deadline. The goal is to schedule all the jobs on a machine that can work on b jobs simultaneously, and the objective is to minimize the number of time slots for which the machine is active. The active time scheduling model was introduced by… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  13. arXiv:2111.05257  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Logarithmic Regret from Sublinear Hints

    Authors: Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

    Abstract: We consider the online linear optimization problem, where at every step the algorithm plays a point $x_t$ in the unit ball, and suffers loss $\langle c_t, x_t\rangle$ for some cost vector $c_t$ that is then revealed to the algorithm. Recent work showed that if an algorithm receives a hint $h_t$ that has non-trivial correlation with $c_t$ before it plays $x_t$, then it can achieve a regret guarante… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  14. arXiv:2108.02770  [pdf, other

    cs.DS

    Scheduling with Communication Delay in Near-Linear Time

    Authors: Quanquan C. Liu, Manish Purohit, Zoya Svitkina, Erik Vee, Joshua R. Wang

    Abstract: We consider the problem of efficiently scheduling jobs with precedence constraints on a set of identical machines in the presence of a uniform communication delay. Such precedence-constrained jobs can be modeled as a directed acyclic graph, $G = (V, E)$. In this setting, if two precedence-constrained jobs $u$ and $v$, with $v$ dependent on $u$ ($u \prec v$), are scheduled on different machines, th… ▽ More

    Submitted 29 January, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: To appear in STACS 2022

  15. arXiv:2012.13115  [pdf, other

    cs.LG stat.ML

    Upper Confidence Bounds for Combining Stochastic Bandits

    Authors: Ashok Cutkosky, Abhimanyu Das, Manish Purohit

    Abstract: We provide a simple method to combine stochastic bandit algorithms. Our approach is based on a "meta-UCB" procedure that treats each of $N$ individual bandit algorithms as arms in a higher-level $N$-armed bandit problem that we solve with a variant of the classic UCB algorithm. Our final regret depends only on the regret of the base algorithm with the best regret in hindsight. This approach provid… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  16. arXiv:2011.09076  [pdf, other

    cs.DS

    Learning-Augmented Weighted Paging

    Authors: Nikhil Bansal, Christian Coester, Ravi Kumar, Manish Purohit, Erik Vee

    Abstract: We consider a natural semi-online model for weighted paging, where at any time the algorithm is given predictions, possibly with errors, about the next arrival of each page. The model is inspired by Belady's classic optimal offline algorithm for unweighted paging, and extends the recently studied model for learning-augmented paging (Lykouris and Vassilvitskii, 2018) to the weighted setting. For… ▽ More

    Submitted 9 November, 2021; v1 submitted 17 November, 2020; originally announced November 2020.

  17. arXiv:2010.03082  [pdf, ps, other

    cs.LG

    Online Linear Optimization with Many Hints

    Authors: Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

    Abstract: We study an online linear optimization (OLO) problem in which the learner is provided access to $K$ "hint" vectors in each round prior to making a decision. In this setting, we devise an algorithm that obtains logarithmic regret whenever there exists a convex combination of the $K$ hints that has positive correlation with the cost vectors. This significantly extends prior work that considered only… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at Neurips 2020

  18. arXiv:2008.07788  [pdf, other

    eess.AS cs.LG

    CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion

    Authors: Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil

    Abstract: Recently, Generative Adversarial Networks (GAN)-based methods have shown remarkable performance for the Voice Conversion and WHiSPer-to-normal SPeeCH (WHSP2SPCH) conversion. One of the key challenges in WHSP2SPCH conversion is the prediction of fundamental frequency (F0). Recently, authors have proposed state-of-the-art method Cycle-Consistent Generative Adversarial Networks (CycleGAN) for WHSP2SP… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: Accepted in 28th European Signal Processing Conference (EUSIPCO), 2020

  19. arXiv:2002.04726  [pdf, ps, other

    cs.LG math.OC stat.ML

    Online Learning with Imperfect Hints

    Authors: Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit

    Abstract: We consider a variant of the classical online linear optimization problem in which at every step, the online player receives a "hint" vector before choosing the action for that round. Rather surprisingly, it was shown that if the hint vector is guaranteed to have a positive correlation with the cost vector, then the online player can achieve a regret of $O(\log T)$, thus significantly improving ov… ▽ More

    Submitted 2 October, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

    Comments: appeared in ICML 2020

  20. Near Optimal Coflow Scheduling in Networks

    Authors: Mosharaf Chowdhury, Samir Khuller, Manish Purohit, Sheng Yang, Jie You

    Abstract: The coflow scheduling problem has emerged as a popular abstraction in the last few years to study data communication problems within a data center. In this basic framework, each coflow has a set of communication demands and the goal is to schedule many coflows in a manner that minimizes the total weighted completion time. A coflow is said to complete when all its communication needs are met. This… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

  21. arXiv:1905.02709  [pdf, other

    cs.DS

    Hiring Under Uncertainty

    Authors: Manish Raghavan, Manish Purohit, Sreenivas Gollupadi

    Abstract: In this paper we introduce the hiring under uncertainty problem to model the questions faced by hiring committees in large enterprises and universities alike. Given a set of $n$ eligible candidates, the decision maker needs to choose the sequence of candidates to make offers so as to hire the $k$ best candidates. However, candidates may choose to reject an offer (for instance, due to a competing o… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  22. arXiv:1812.00134  [pdf, other

    cs.DS

    Semi-Online Bipartite Matching

    Authors: Ravi Kumar, Manish Purohit, Aaron Schild, Zoya Svitkina, Erik Vee

    Abstract: In this paper we introduce the \emph{semi-online} model that generalizes the classical online computational model. The semi-online model postulates that the unknown future has a predictable part and an adversarial part; these parts can be arbitrarily interleaved. An algorithm in this model operates as in the standard online model, i.e., makes an irrevocable decision at each step. We consider bip… ▽ More

    Submitted 4 September, 2019; v1 submitted 30 November, 2018; originally announced December 2018.

  23. arXiv:1707.04331  [pdf, other

    cs.DS

    A Tight Approximation for Co-flow Scheduling for Minimizing Total Weighted Completion Time

    Authors: Sungjin Im, Manish Purohit

    Abstract: Co-flows model a modern scheduling setting that is commonly found in a variety of applications in distributed and cloud computing. In co-flow scheduling, there are $m$ input ports and $m$ output ports. Each co-flow $j \in J$ can be represented by a bipartite graph between the input and output ports, where each edge $(i,o)$ with demand $d_{i,o}^j$ means that $d_{i,o}^j$ units of packets must be del… ▽ More

    Submitted 1 December, 2018; v1 submitted 13 July, 2017; originally announced July 2017.

  24. arXiv:1510.03130  [pdf, other

    cs.LG

    On Correcting Inputs: Inverse Optimization for Online Structured Prediction

    Authors: Hal Daumé III, Samir Khuller, Manish Purohit, Gregory Sanders

    Abstract: Algorithm designers typically assume that the input data is correct, and then proceed to find "optimal" or "sub-optimal" solutions using this input data. However this assumption of correct data does not always hold in practice, especially in the context of online learning systems where the objective is to learn appropriate feature weights given some training samples. Such scenarios necessitate the… ▽ More

    Submitted 11 October, 2015; originally announced October 2015.

    Comments: Conference version to appear in FSTTCS, 2015

  25. arXiv:1507.00662  [pdf, ps, other

    cs.DS

    On the Approximability of Digraph Ordering

    Authors: Sreyash Kenkre, Vinayaka Pandit, Manish Purohit, Rishi Saket

    Abstract: Given an n-vertex digraph D = (V, A) the Max-k-Ordering problem is to compute a labeling $\ell : V \to [k]$ maximizing the number of forward edges, i.e. edges (u,v) such that $\ell$(u) < $\ell$(v). For different values of k, this reduces to Maximum Acyclic Subgraph (k=n), and Max-Dicut (k=2). This work studies the approximability of Max-k-Ordering and its generalizations, motivated by their applic… ▽ More

    Submitted 2 July, 2015; originally announced July 2015.

    Comments: 21 pages, Conference version to appear in ESA 2015

  26. arXiv:1507.00648  [pdf, other

    cs.DS

    Approximation Algorithms for Connected Maximum Cut and Related Problems

    Authors: MohammadTaghi Hajiaghayi, Guy Kortsarz, Robert MacDavid, Manish Purohit, Kanthi Sarpatwar

    Abstract: An instance of the Connected Maximum Cut problem consists of an undirected graph G = (V, E) and the goal is to find a subset of vertices S $\subseteq$ V that maximizes the number of edges in the cut δ(S) such that the induced graph G[S] is connected. We present the first non-trivial Ω(1/log n) approximation algorithm for the connected maximum cut problem in general graphs using novel techniques. W… ▽ More

    Submitted 2 July, 2015; originally announced July 2015.

    Comments: 17 pages, Conference version to appear in ESA 2015

  27. arXiv:1311.2309  [pdf, other

    cs.DS

    Analyzing the Optimal Neighborhood: Algorithms for Budgeted and Partial Connected Dominating Set Problems

    Authors: Samir Khuller, Manish Purohit, Kanthi Sarpatwar

    Abstract: We study partial and budgeted versions of the well studied connected dominating set problem. In the partial connected dominating set problem, we are given an undirected graph G = (V,E) and an integer n', and the goal is to find a minimum subset of vertices that induces a connected subgraph of G and dominates at least n' vertices. We obtain the first polynomial time algorithm with an O(\ln Δ) appro… ▽ More

    Submitted 10 November, 2013; originally announced November 2013.

    Comments: 15 pages, Conference version to appear in ACM-SIAM SODA 2014

  28. arXiv:1301.4958  [pdf, other

    cs.GT

    Improved algorithms and analysis for the laminar matroid secretary problem

    Authors: David Harris, Manish Purohit

    Abstract: In a matroid secretary problem, one is presented with a sequence of objects of various weights in a random order, and must choose irrevocably to accept or reject each item. There is a further constraint that the set of items selected must form an independent set of an associated matroid. Constant-competitive algorithms (algorithms whose expected solution weight is within a constant factor of the o… ▽ More

    Submitted 21 January, 2013; originally announced January 2013.