JMLR: Vol 24, No 1

Volume 24, Issue 1January 2023Current Issue

Latest Issue

Volume 24, Issue 1

January 2023

Editor:

Pradeep Ravikumar
Carnegie Mellon University
,
Tong Zhang
University of Illinois Urbana-Champaign

Publisher:

JMLR.org

ISSN:1532-4435

EISSN:1533-7928

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Free

Approximation bounds for hierarchical clustering: average linkage, bisecting k-means, and local search

Article No.: 1, Pages 1–36

Hierarchical clustering is a data analysis method that has been used for decades. Despite its widespread use, the method has an underdeveloped analytical foundation. Having a well understood foundation would both support the currently used methods and ...

research-article

Free

The Brier score under administrative censoring: problems and a solution

Article No.: 2, Pages 37–62

The Brier score is commonly used for evaluating probability predictions. In survival analysis, with right-censored observations of the event times, this score can be weighted by the inverse probability of censoring (IPCW) to retain its original ...

research-article

Free

Bayesian spiked Laplacian graphs

Article No.: 3, Pages 63–97

In network analysis, it is common to work with a collection of graphs that exhibit heterogeneity. For example, neuroimaging data from patient cohorts are increasingly available. A critical analytical task is to identify communities, and graph Laplacian-...

research-article

Free

Efficient structure-preserving support tensor train machine

Article No.: 4, Pages 98–119

An increasing amount of the collected data are high-dimensional multi-way arrays (tensors), and it is crucial for efficient learning algorithms to exploit this tensorial structure as much as possible. The ever present curse of dimensionality for high ...

research-article

Free

Cluster-specific predictions with multi-task Gaussian processes

Article No.: 5, Pages 120–167

A model involving Gaussian processes (GPs) is introduced to simultaneously handle multitask learning, clustering, and prediction for multiple functional data. This procedure acts as a model-based clustering method for functional data as well as a ...

research-article

Free

AutoKeras: an AutoML library for deep learning

Article No.: 6, Pages 169–174

To use deep learning, one needs to be familiar with various software tools like TensorFlow or Keras, as well as various model architecture and optimization best practices. Despite recent progress in software usability, deep learning remains a highly ...

research-article

Free

On distance and kernel measures of conditional dependence

Article No.: 7, Pages 175–190

Measuring conditional dependence is one of the important tasks in statistical inference and is fundamental in causal discovery, feature selection, dimensionality reduction, Bayesian network learning, and others. In this work, we explore the connection ...

research-article

Free

A relaxed inertial forward-backward-forward algorithm for solving monotone inclusions with application to GANs

Article No.: 8, Pages 191–227

We introduce a relaxed inertial forward-backward-forward (RIFBF) splitting algorithm for approaching the set of zeros of the sum of a maximally monotone operator and a single-valued monotone and Lipschitz continuous operator. This work aims to extend ...

research-article

Free

Sampling random graph homomorphisms and applications to network data analysis

Article No.: 9, Pages 228–306

A graph homomorphism is a map between two graphs that preserves adjacency relations. We consider the problem of sampling a random graph homomorphism from a graph into a large network. We propose two complementary MCMC algorithms for sampling random graph ...

research-article

Free

A line-search descent algorithm for strict saddle functions with complexity guarantees

Stephen J. Wright

Article No.: 10, Pages 307–340

We describe a line-search algorithm which achieves the best-known worst-case complexity results for problems with a certain "strict saddle" property that has been observed to hold in low-rank matrix optimization problems. Our algorithm is adaptive, in ...

research-article

Free

Optimal strategies for reject option classifiers

Article No.: 11, Pages 341–389

In classification with a reject option, the classifier is allowed in uncertain cases to abstain from prediction. The classical cost-based model of a reject option classifier requires the rejection cost to be defined explicitly. The alternative bounded-...

research-article

Free

Learning-augmented count-min sketches via Bayesian nonparametrics

Article No.: 12, Pages 390–449

The count-min sketch (CMS) is a time and memory efficient randomized data structure that provides estimates of tokens' frequencies in a data stream of tokens, i.e. point queries, based on random hashed data. A learning-augmented version of the CMS, ...

research-article

Free

Adaptation to the range in K-armed bandits

Article No.: 13, Pages 450–482

We consider stochastic bandit problems with K arms, each associated with a distribution supported on a given finite range [m,M]. We do not assume that the range [m,M] is known and show that there is a cost for learning this range. Indeed, a new trade-off ...

research-article

Free

Python package for causal discovery based on LiNGAM

Article No.: 14, Pages 483–490

Causal discovery is a methodology for learning causal graphs from data, and LiNGAM is a well-known model for causal discovery. This paper describes an open-source Python package for causal discovery based on LiNGAM. The package implements various LiNGAM ...

research-article

Free

Extending adversarial attacks to produce adversarial class probability distributions

Article No.: 15, Pages 491–532

Despite the remarkable performance and generalization levels of deep learning models in a wide range of artificial intelligence tasks, it has been demonstrated that these models can be easily fooled by the addition of imperceptible yet malicious ...

research-article

Free

Globally-consistent rule-based summary-explanations for machine learning models: application to credit-risk evaluation

Article No.: 16, Pages 533–576

We develop a method for understanding specific predictions made by (global) predictive models by constructing (local) models tailored to each specific observation (these are also called "explanations" in the literature). Unlike existing work that "...

research-article

Free

Learning mean-field games with discounted and average costs

Article No.: 17, Pages 577–635

We consider learning approximate Nash equilibria for discrete-time mean-field games with stochastic nonlinear state dynamics subject to both average and discounted costs. To this end, we introduce a mean-field equilibrium (MFE) operator, whose fixed ...

research-article

Free

An inertial block majorization minimization framework for nonsmooth nonconvex optimization

Article No.: 18, Pages 636–676

In this paper, we introduce TITAN, a novel inerTIal block majorizaTion minimizAtioN framework for nonsmooth nonconvex optimization problems. To the best of our knowledge, TITAN is the first framework of block-coordinate update method that relies on the ...

research-article

Free

Regularized joint mixture models

Article No.: 19, Pages 677–723

Regularized regression models are well studied and, under appropriate conditions, offer fast and statistically interpretable results. However, large data in many applications are heterogeneous in the sense of harboring distributional differences between ...

research-article

Free

Interpolating classifiers make few mistakes

Article No.: 20, Pages 724–750

This paper provides elementary analyses of the regret and generalization of minimum-norm interpolating classifiers (MNIC). The MNIC is the function of smallest Reproducing Kernel Hilbert Space norm that perfectly interpolates a label pattern on a finite ...

research-article

Free

Graph-aided online multi-kernel learning

Article No.: 21, Pages 751–794

Multi-kernel learning (MKL) has been widely used in learning problems involving function learning tasks. Compared with single kernel learning approach which relies on a preselected kernel, the advantage of MKL is its exibility results from combining a ...

research-article

Free

Lower bounds and accelerated algorithms for bilevel optimization

Article No.: 22, Pages 795–850

Bilevel optimization has recently attracted growing interests due to its wide applications in modern machine learning problems. Although recent studies have characterized the convergence rate for several such popular algorithms, it is still unclear how ...

research-article

Free

Bayesian data selection

Article No.: 23, Pages 851–922

Insights into complex, high-dimensional data can be obtained by discovering features of the data that match or do not match a model of interest. To formalize this task, we introduce the "data selection" problem: finding a lower-dimensional statistic--...

research-article

Free

Calibrated multiple-output quantile regression with representation learning

Article No.: 24, Pages 923–970

We develop a method to generate predictive regions that cover a multivariate response variable with a user-specified probability. Our work is composed of two components. First, we use a deep generative model to learn a representation of the response that ...

research-article

Free

Discrete variational calculus for accelerated optimization

Article No.: 25, Pages 971–1003

Many of the new developments in machine learning are connected with gradient-based optimization methods. Recently, these methods have been studied using a variational perspective (Betancourt et al., 2018). This has opened up the possibility of ...

research-article

Free

Generalization bounds for noisy iterative algorithms using properties of additive noise channels

Article No.: 26, Pages 1004–1046

Machine learning models trained by different optimization algorithms under different data distributions can exhibit distinct generalization behaviors. In this paper, we analyze the generalization of models trained by noisy iterative algorithms. We derive ...

research-article

Free

The SKIM-FA kernel: high-dimensional variable selection and nonlinear interaction discovery in linear time

Article No.: 27, Pages 1047–1106

Many scientific problems require identifying a small set of covariates that are associated with a target response and estimating their effects. Often, these effects are nonlinear and include interactions, so linear and additive methods can lead to poor ...

research-article

Free

Impact of classification difficulty on the weight matrices spectra in deep learning and application to early-stopping

Article No.: 28, Pages 1107–1146

Much recent research effort has been devoted to explain the success of deep learning. Random Matrix Theory (RMT) provides an emerging way to this end by analyzing the spectra of large random matrices involved in a trained deep neural network (DNN) such ...

research-article

Free

HiClass: a Python library for local hierarchical classification compatible with Scikit-learn

Article No.: 29, Pages 1147–1163

HiClass is an open-source Python library for local hierarchical classification entirely compatible with scikit-learn. It contains implementations of the most common design patterns for hierarchical machine learning models found in the literature, that is,...

research-article

Free

Attacks against federated learning defense systems and their mitigation

Article No.: 30, Pages 1164–1213

The susceptibility of federated learning (FL) to attacks from untrustworthy endpoints has led to the design of several defense systems. FL defense systems enhance the federated optimization algorithm using anomaly detection, scaling the updates from ...

The Journal of Machine Learning Research

Sections

Approximation bounds for hierarchical clustering: average linkage, bisecting k-means, and local search

The Brier score under administrative censoring: problems and a solution

Bayesian spiked Laplacian graphs

Efficient structure-preserving support tensor train machine

Cluster-specific predictions with multi-task Gaussian processes

AutoKeras: an AutoML library for deep learning

On distance and kernel measures of conditional dependence

A relaxed inertial forward-backward-forward algorithm for solving monotone inclusions with application to GANs

Sampling random graph homomorphisms and applications to network data analysis

A line-search descent algorithm for strict saddle functions with complexity guarantees

Optimal strategies for reject option classifiers

Learning-augmented count-min sketches via Bayesian nonparametrics

Adaptation to the range in K-armed bandits

Python package for causal discovery based on LiNGAM

Extending adversarial attacks to produce adversarial class probability distributions

Globally-consistent rule-based summary-explanations for machine learning models: application to credit-risk evaluation

Learning mean-field games with discounted and average costs

An inertial block majorization minimization framework for nonsmooth nonconvex optimization

Regularized joint mixture models

Interpolating classifiers make few mistakes

Graph-aided online multi-kernel learning

Lower bounds and accelerated algorithms for bilevel optimization

Bayesian data selection

Calibrated multiple-output quantile regression with representation learning

Discrete variational calculus for accelerated optimization

Generalization bounds for noisy iterative algorithms using properties of additive noise channels

The SKIM-FA kernel: high-dimensional variable selection and nonlinear interaction discovery in linear time

Impact of classification difficulty on the weight matrices spectra in deep learning and application to early-stopping

HiClass: a Python library for local hierarchical classification compatible with Scikit-learn

Attacks against federated learning defense systems and their mitigation

Sections

Save to Binder

Subjects

Comments