Skip to main content

Showing 1–25 of 33 results

.
  1. arXiv:2403.11013  [pdf, other

    cs.LG math.ST

    Improved Algorithm and Bounds for Successive Projection

    Authors: Jiashun Jin, Zheng Tracy Ke, Gabriel Moryoussef, Jiajun Tang, Jingming Wang

    Abstract: Given a $K$-vertex simplex in a $d$-dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 32 pages, 5 figures

  2. arXiv:2403.10946  [pdf, other

    stat.ML cs.LG

    The Fallacy of Minimizing Local Regret in the Sequential Task Setting

    Authors: Ziping Xu, Kelly W. Zhang, Susan A. Murphy

    Abstract: In the realm of Reinforcement Learning (RL), online RL is often conceptualized as an optimization problem, where an algorithm interacts with an unknown environment to minimize cumulative regret. In a stationary setting, strong theoretical guarantees, like a sublinear ($\sqrt{T}$) regret bound, can be obtained, which typically implies the convergence to an optimal policy and the cessation of explor… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  3. arXiv:2403.05911  [pdf, other

    cs.HC cs.AI

    Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning

    Authors: Zana Buçinca, Siddharth Swaroop, Amanda E. Paluch, Susan A. Murphy, Krzysztof Z. Gajos

    Abstract: Imagine if AI decision-support tools not only complemented our ability to make accurate decisions, but also improved our skills, boosted collaboration, and elevated the joy we derive from our tasks. Despite the potential to optimize a broad spectrum of such human-centric objectives, the design of current AI tools remains focused on decision accuracy alone. We propose offline reinforcement learning… ▽ More

    Submitted 14 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  4. arXiv:2402.17739  [pdf, other

    cs.AI cs.LG

    reBandit: Random Effects based Online RL algorithm for Reducing Cannabis Use

    Authors: Susobhan Ghosh, Yongyi Guo, Pei-Yao Hung, Lara Coughlin, Erin Bonar, Inbal Nahum-Shani, Maureen Walton, Susan Murphy

    Abstract: The escalating prevalence of cannabis use, and associated cannabis-use disorder (CUD), poses a significant public health challenge globally. With a notably wide treatment gap, especially among emerging adults (EAs; ages 18-25), addressing cannabis use and CUD remains a pivotal objective within the 2030 United Nations Agenda for Sustainable Development Goals (SDG). In this work, we develop an onlin… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  5. arXiv:2402.17003  [pdf, other

    cs.LG cs.AI cs.CY

    Monitoring Fidelity of Online Reinforcement Learning Algorithms in Clinical Trials

    Authors: Anna L. Trella, Kelly W. Zhang, Inbal Nahum-Shani, Vivek Shetty, Iris Yan, Finale Doshi-Velez, Susan A. Murphy

    Abstract: Online reinforcement learning (RL) algorithms offer great potential for personalizing treatment for participants in clinical trials. However, deploying an online, autonomous algorithm in the high-stakes healthcare setting makes quality control and data quality especially difficult to achieve. This paper proposes algorithm fidelity as a critical requirement for deploying online RL algorithms in cli… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2402.11771  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Evaluating the Effectiveness of Index-Based Treatment Allocation

    Authors: Niclas Boehmer, Yash Nair, Sanket Shah, Lucas Janson, Aparna Taneja, Milind Tambe

    Abstract: When resources are scarce, an allocation policy is needed to decide who receives a resource. This problem occurs, for instance, when allocating scarce medical resources and is often solved using modern ML methods. This paper introduces methods to evaluate index-based allocation policies -- that allocate a fixed number of resources to those who need them the most -- by using data from a randomized… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  7. arXiv:2402.04933  [pdf, other

    cs.LG stat.AP

    A Bayesian Approach to Online Learning for Contextual Restless Bandits with Applications to Public Health

    Authors: Biyonka Liang, Lily Xu, Aparna Taneja, Milind Tambe, Lucas Janson

    Abstract: Restless multi-armed bandits (RMABs) are used to model sequential resource allocation in public health intervention programs. In these settings, the underlying transition dynamics are often unknown a priori, requiring online reinforcement learning (RL). However, existing methods in online RL for RMABs cannot incorporate properties often present in real-world public health applications, such as con… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 26 pages, 18 figures

  8. arXiv:2402.03110  [pdf, other

    cs.LG cs.AI

    Non-Stationary Latent Auto-Regressive Bandits

    Authors: Anna L. Trella, Walter Dempsey, Finale Doshi-Velez, Susan A. Murphy

    Abstract: We consider the stochastic multi-armed bandit problem with non-stationary rewards. We present a novel formulation of non-stationarity in the environment where changes in the mean reward of the arms over time are due to some unknown, latent, auto-regressive (AR) state of order $k$. We call this new environment the latent AR bandit. Different forms of the latent AR bandit appear in many real-world s… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  9. arXiv:2402.01995  [pdf, other

    cs.LG math.OC

    Online Uniform Risk Times Sampling: First Approximation Algorithms, Learning Augmentation with Full Confidence Interval Integration

    Authors: Xueqing Liu, Kyra Gan, Esmaeil Keyvanshokooh, Susan Murphy

    Abstract: In digital health, the strategy of allocating a limited treatment budget across risk times is crucial to reduce user fatigue. This strategy, however, encounters a significant obstacle due to the unknown actual number of risk times, a factor not adequately addressed by existing methods lacking theoretical guarantees. This paper introduces, for the first time, the online uniform risk times sampling… ▽ More

    Submitted 1 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  10. arXiv:2401.14923  [pdf, other

    cs.AI cs.LG

    Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks

    Authors: Eura Nofshin, Siddharth Swaroop, Weiwei Pan, Susan Murphy, Finale Doshi-Velez

    Abstract: Many important behavior changes are frictionful; they require individuals to expend effort over a long period with little immediate gratification. Here, an artificial intelligence (AI) agent can provide personalized interventions to help individuals stick to their goals. In these settings, the AI agent must personalize rapidly (before the individual disengages) and interpretably, to help us unders… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: In AAMAS 2024

  11. arXiv:2401.08167  [pdf, other

    math.ST cs.IT cs.SI math.PR stat.ML

    Fundamental limits of community detection from multi-view data: multi-layer, dynamic and partially labeled block models

    Authors: Xiaodong Yang, Buyu Lin, Subhabrata Sen

    Abstract: Multi-view data arises frequently in modern network analysis e.g. relations of multiple types among individuals in social network analysis, longitudinal measurements of interactions among observational units, annotated networks with noisy partial labeling of vertices etc. We study community detection in these disparate settings via a unified theoretical framework, and investigate the fundamental t… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 75 pages, 9 figures

  12. arXiv:2401.06795  [pdf, other

    cs.CL cs.AI cs.LG

    AI and Generative AI for Research Discovery and Summarization

    Authors: Mark Glickman, Yi Zhang

    Abstract: AI and generative AI tools, including chatbots like ChatGPT that rely on large language models (LLMs), have burst onto the scene this year, creating incredible opportunities to increase work productivity and improve our lives. Statisticians and data scientists have begun experiencing the benefits from the availability of these tools in numerous ways, such as the generation of programming code from… ▽ More

    Submitted 26 March, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 29 pages, 9 figures

  13. arXiv:2401.06383  [pdf, other

    stat.ME

    Decomposition with Monotone B-splines: Fitting and Testing

    Authors: Lijun Wang, Xiaodan Fan, Hongyu Zhao, Jun S. Liu

    Abstract: A univariate continuous function can always be decomposed as the sum of a non-increasing function and a non-decreasing one. Based on this property, we propose a non-parametric regression method that combines two spline-fitted monotone curves. We demonstrate by extensive simulations that, compared to standard spline-fitting methods, the proposed approach is particularly advantageous in high-noise s… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  14. Recent Advances in Text Analysis

    Authors: Zheng Tracy Ke, Pengsheng Ji, Jiashun Jin, Wanshan Li

    Abstract: Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze MADSta… ▽ More

    Submitted 7 February, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Journal ref: Annual Review of Statistics and Its Application 2024 11:1

  15. Ensemble methods for testing a global null

    Authors: Yaowu Liu, Zhonghua Liu, Xihong Lin

    Abstract: Testing a global null is a canonical problem in statistics and has a wide range of applications. In view of the fact that no uniformly most powerful test exists, prior and/or domain knowledge are commonly used to focus on a certain class of alternatives to improve the testing power. However, it is generally challenging to develop tests that are particularly powerful against a certain class of alte… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Journal of the Royal Statistical Society: Series B (Statistical Methodology), 2024

  16. arXiv:2309.16855  [pdf, other

    stat.ME math.ST

    A Variational Spike-and-Slab Approach for Group Variable Selection

    Authors: Buyu Lin, Changhao Ge, Jun S. Liu

    Abstract: We introduce a class of generic spike-and-slab priors for high-dimensional linear regression with grouped variables and present a Coordinate-ascent Variational Inference (CAVI) algorithm for obtaining an optimal variational Bayes approximation. Using parameter expansion for a specific, yet comprehensive, family of slab distributions, we obtain a further gain in computational efficiency. The method… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 64 pages, 6 figures

  17. arXiv:2309.16843  [pdf, other

    math.ST stat.ME stat.ML

    A Mean Field Approach to Empirical Bayes Estimation in High-dimensional Linear Regression

    Authors: Sumit Mukherjee, Bodhisattva Sen, Subhabrata Sen

    Abstract: We study empirical Bayes estimation in high-dimensional linear regression. To facilitate computationally efficient estimation of the underlying prior, we adopt a variational empirical Bayes approach, introduced originally in Carbonetto and Stephens (2012) and Kim et al. (2022). We establish asymptotic consistency of the nonparametric maximum likelihood estimator (NPMLE) and its (computable) naive… ▽ More

    Submitted 25 October, 2023; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 38 pages, 1 figure; Clarified some details in this draft

    MSC Class: 62C12; 62G20; 62J05

  18. arXiv:2309.12584  [pdf, other

    stat.ME stat.AP

    Testing a Large Number of Composite Null Hypotheses Using Conditionally Symmetric Multidimensional Gaussian Mixtures in Genome-Wide Studies

    Authors: Ryan Sun, Zachary McCaw, Xihong Lin

    Abstract: Causal mediation analysis, pleiotropy analysis, and replication analysis are three highly popular genetic study designs. Although these analyses address different scientific questions, the underlying inference problems all involve large-scale testing of composite null hypotheses. The goal is to determine whether all null hypotheses - as opposed to at least one - in a set of individual tests should… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  19. arXiv:2309.04002  [pdf, other

    stat.ME

    Total Variation Floodgate for Variable Importance Inference in Classification

    Authors: Wenshuo Wang, Lucas Janson, Lihua Lei, Aaditya Ramdas

    Abstract: Inferring variable importance is the key problem of many scientific studies, where researchers seek to learn the effect of a feature $X$ on the outcome $Y$ in the presence of confounding variables $Z$. Focusing on classification problems, we define the expected total variation (ETV), which is an intuitive and deterministic measure of variable importance that does not rely on any model context. We… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  20. arXiv:2308.15370  [pdf, other

    stat.ML cs.LG

    Multi-Response Heteroscedastic Gaussian Process Models and Their Inference

    Authors: Taehee Lee, Jun S. Liu

    Abstract: Despite the widespread utilization of Gaussian process models for versatile nonparametric modeling, they exhibit limitations in effectively capturing abrupt changes in function smoothness and accommodating relationships with heteroscedastic errors. Addressing these shortcomings, the heteroscedastic Gaussian process (HeGP) regression seeks to introduce flexibility by acknowledging the variability o… ▽ More

    Submitted 30 August, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: submitted to the Journal of the American Statistical Association (JASA)

  21. arXiv:2308.07843  [pdf, other

    cs.LG stat.AP stat.ML

    Dyadic Reinforcement Learning

    Authors: Shuangning Li, Lluis Salvat Niell, Sung Won Choi, Inbal Nahum-Shani, Guy Shani, Susan Murphy

    Abstract: Mobile health aims to enhance health outcomes by delivering interventions to individuals as they go about their daily life. The involvement of care partners and social support networks often proves crucial in helping individuals managing burdensome medical conditions. This presents opportunities in mobile health to design interventions that target the dyadic relationship -- the relationship betwee… ▽ More

    Submitted 1 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

  22. arXiv:2307.13916  [pdf, other

    stat.ML cs.LG

    Online learning in bandits with predicted context

    Authors: Yongyi Guo, Ziping Xu, Susan Murphy

    Abstract: We consider the contextual bandit problem where at each time, the agent only has access to a noisy version of the context and the error variance (or an estimator of this variance). This setting is motivated by a wide range of applications where the true context for decision-making is unobserved, and only a prediction of the context by a potentially complex machine learning algorithm is available.… ▽ More

    Submitted 17 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

  23. arXiv:2307.01748  [pdf, other

    stat.ME astro-ph.IM stat.CO

    Monotone Cubic B-Splines with a Neural-Network Generator

    Authors: Lijun Wang, Xiaodan Fan, Huabai Li, Jun S. Liu

    Abstract: We present a method for fitting monotone curves using cubic B-splines, which is equivalent to putting a monotonicity constraint on the coefficients. We explore different ways of enforcing this constraint and analyze their theoretical and empirical properties. We propose two algorithms for solving the spline fitting problem: one that uses standard optimization techniques and one that trains a Multi… ▽ More

    Submitted 17 November, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  24. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  25. arXiv:2306.10983  [pdf, other

    stat.ML cs.LG

    Effect-Invariant Mechanisms for Policy Generalization

    Authors: Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

    Abstract: Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call f… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.