<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Submodular Optimization on Nam Le</title><link>https://blog.namln.org/en/categories/submodular-optimization/</link><description>Recent content in Submodular Optimization on Nam Le</description><generator>Hugo</generator><language>en-US</language><lastBuildDate>Thu, 29 Sep 2022 00:00:00 +0000</lastBuildDate><atom:link href="https://blog.namln.org/en/categories/submodular-optimization/index.xml" rel="self" type="application/rss+xml"/><item><title>Optimization Research Papers in JMLR Volume 23</title><link>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v23/</link><pubDate>Thu, 29 Sep 2022 00:00:00 +0000</pubDate><guid>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v23/</guid><description>&lt;h1 class="heading" id="optimization-research-papers-in-jmlr-volume-23-2022"&gt;
 Optimization Research Papers in JMLR Volume 23 (2022)&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-research-papers-in-jmlr-volume-23-2022"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h1&gt;&lt;p&gt;This document lists papers from JMLR Volume 23 (2022) that focus on optimization research, categorized by their primary themes. Each paper is numbered starting from 1 within its subsection, with a brief description of its key contributions to optimization theory, algorithms, or applications.&lt;/p&gt;
&lt;h2 class="heading" id="convex-optimization"&gt;
 Convex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#convex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing convex optimization problems, including sparse PCA, L1-regularized SVMs, and metric-constrained problems.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops convex optimization techniques for large-scale sparse principal component analysis with certifiable near-optimal solutions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Novel Min-Max Reformulations of Linear Inverse Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mohammed Rayyan Sheriff, Debasish Chatterjee&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes min-max reformulations for linear inverse problems using convex optimization frameworks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;New Insights for the Multivariate Square-Root Lasso&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Aaron J. Molstad&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the square-root Lasso in multivariate settings, focusing on its convex optimization properties.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Towards An Efficient Approach for the Nonconvex lp Ball Projection: Algorithm and Analysis&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xiangyu Yang, Jiashan Wang, Hao Wang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops efficient algorithms for lp ball projection, addressing both convex and nonconvex aspects.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Solving L1-Regularized SVMs and Related Linear Programs: Revisiting the Effectiveness of Column and Constraint Generation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Antoine Dedieu, Rahul Mazumder, Haoyue Wang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates L1-regularized SVMs using convex optimization with column and constraint generation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Extensions to the Proximal Distance Method of Constrained Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alfonso Landeros, Oscar Hernan Madrid Padilla, Hua Zhou, Kenneth Lange&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Extends the proximal distance method for constrained convex optimization problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Subgradient for Composite Convex Optimization with Functional Constraints&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ion Necoara, Nitesh Kumar Singh&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes stochastic subgradient methods for composite convex optimization with functional constraints.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Regularized Square-Root Regression Problems: Distributionally Robust Interpretation and Fast Computations&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Hong T.M. Chu, Kim-Chuan Toh, Yangjing Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies regularized square-root regression with a distributionally robust perspective and efficient computational methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Project and Forget: Solving Large-Scale Metric Constrained Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Rishi Sonthalia, Anna C. Gilbert&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a convex optimization approach for large-scale metric-constrained problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Faster Randomized Interior Point Methods for Tall/Wide Linear Programs&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Agniva Chowdhury, Gregory Dexter, Palma London, Haim Avron, Petros Drineas&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops randomized interior point methods for efficient optimization of tall/wide linear programs.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="nonconvex-optimization"&gt;
 Nonconvex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#nonconvex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers tackling nonconvex optimization, focusing on optimality, stability, and convergence in nonsmooth and game settings.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimality and Stability in Non-Convex Smooth Games&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Guojun Zhang, Pascal Poupart, Yaoliang Yu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes optimality and stability in nonconvex smooth games with convergence guarantees.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Simple and Optimal Stochastic Gradient Methods for Nonsmooth Nonconvex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zhize Li, Jian Li&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes simple and optimal stochastic gradient methods for nonsmooth, nonconvex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Oracle Complexity in Nonsmooth Nonconvex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Guy Kornowski, Ohad Shamir&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies the oracle complexity of nonsmooth nonconvex optimization problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Brian Swenson, Ryan Murray, H. Vincent Poor, Soummya Kar&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates distributed SGD for nonconvex, nonsmooth optimization with convergence to local minima.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="stochastic-optimization"&gt;
 Stochastic Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#stochastic-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on stochastic optimization methods, including bundle methods, zeroth-order algorithms, and adaptive techniques.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Stochastic Bundle Method for Interpolation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alasdair Paren, Leonard Berrada, Rudra P. K. Poudel, M. Pawan Kumar&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces a stochastic bundle method for efficient interpolation in optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Biased Stochastic Gradient Estimation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Derek Driggs, Jingwei Liang, Carola-Bibiane Schönlieb&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes biases in stochastic gradient estimation and their impact on optimization performance.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Feihu Huang, Shangqian Gao, Jian Pei, Heng Huang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes accelerated zeroth-order and first-order momentum methods for a range of optimization problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Zeroth-Order Optimization under Nonstationarity and Nonconvexity&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies zeroth-order optimization in nonstationary and nonconvex settings.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Accelerating Adaptive Cubic Regularization of Newton’s Method via Random Sampling&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xi Chen, Bo Jiang, Tianyi Lin, Shuzhong Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Enhances Newton’s method with adaptive cubic regularization using random sampling.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Momentumized, Adaptive, Dual Averaged Gradient Method&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Aaron Defazio, Samy Jelassi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops a momentum-based adaptive gradient method for stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic DCA with Variance Reduction and Applications in Machine Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Hoai An Le Thi, Hoang Phuc Hau Luu, Hoai Minh Le, Tao Pham Dinh&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces a stochastic difference-of-convex-functions algorithm with variance reduction for machine learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alireza Fallah, Mert Gürbüzbalaban, Asuman Ozdaglar, Umut Şimşekli, Lingjiong Zhu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes robust stochastic gradient methods for distributed optimization in multi-agent networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Acceleration for Convex Composite Minimization with Noise-Corrupted Gradients and Approximate Proximal Mapping&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Qiang Zhou, Sinno Jialin Pan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Addresses acceleration in convex composite minimization with noisy gradients.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Asymptotic Study of Stochastic Adaptive Algorithms in Non-Convex Landscape&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Sébastien Gadat, Ioana Gavra&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the asymptotic behavior of stochastic adaptive algorithms in nonconvex settings.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch Acceleration&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Congliang Chen, Li Shen, Fangyu Zou, Wei Liu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies the Adam optimizer, focusing on nonconvexity, convergence, and mini-batch acceleration.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;An Efficient Sampling Algorithm for Non-Smooth Composite Potentials&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Wenlong Mou, Nicolas Flammarion, Martin J. Wainwright, Peter L. Bartlett&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops an efficient sampling algorithm for nonsmooth composite potentials in stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;SGD with Coordinate Sampling: Theory and Practice&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Rémi Leluc, François Portier&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores coordinate sampling in stochastic gradient descent with theoretical and practical insights.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="distributeddecentralized-optimization"&gt;
 Distributed/Decentralized Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#distributeddecentralized-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing distributed or decentralized optimization algorithms, focusing on communication efficiency and convergence.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Asymptotic Network Independence and Step-Size for a Distributed Subgradient Method&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alex Olshevsky&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes step-size and convergence for a distributed subgradient optimization method.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Projection-Free Distributed Online Learning with Sublinear Communication Complexity&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yuanyu Wan, Guanghui Wang, Wei-Wei Tu, Lijun Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops projection-free algorithms for distributed online learning with reduced communication complexity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Variance Reduced EXTRA and DIGing and Their Optimal Acceleration for Strongly Convex Decentralized Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Huan Li, Zhouchen Lin, Yongchun Fang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes variance-reduced methods for decentralized optimization with optimal acceleration.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="submodular-optimization"&gt;
 Submodular Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#submodular-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on submodular optimization, particularly in model selection.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Joint Continuous and Discrete Model Selection via Submodularity&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jonathan Bunton, Paulo Tabuada&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Uses submodularity for joint continuous and discrete model selection in optimization.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="bandits-and-online-learning"&gt;
 Bandits and Online Learning&lt;span class="heading__anchor"&gt; &lt;a href="#bandits-and-online-learning"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing multi-armed bandits, online optimization, and regret minimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies multi-agent online optimization with delays, focusing on asynchronicity and optimism.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Online Mirror Descent and Dual Averaging: Keeping Pace in the Dynamic Case&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Huang Fang, Nicholas J. A. Harvey, Victor S. Portella, Michael P. Friedlander&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes online mirror descent and dual averaging for dynamic online optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;No Weighted-Regret Learning in Adversarial Bandits with Delays&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates regret minimization in adversarial bandits with delays.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;KL-UCB-Switch: Optimal Regret Bounds for Stochastic Bandits from Both a Distribution-Dependent and a Distribution-Free Viewpoints&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Aurélien Garivier, Hédi Hadiji, Pierre Ménard, Gilles Stoltz&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides optimal regret bounds for stochastic bandits using KL-UCB-Switch.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Multi-Agent Multi-Armed Bandits with Limited Communication&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mridul Agarwal, Vaneet Aggarwal, Kamyar Azizzadenesheli&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores multi-agent bandits with limited communication, focusing on regret minimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Nonstochastic Bandits with Composite Anonymous Feedback&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Claudio Gentile, Yishay Mansour&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies nonstochastic bandits with composite feedback, analyzing regret and optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Expected Regret and Pseudo-Regret are Equivalent When the Optimal Arm is Unique&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Daron Anderson, Douglas J. Leith&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proves equivalence of expected regret and pseudo-regret in specific bandit settings.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="bayesian-and-hyperparameter-optimization"&gt;
 Bayesian and Hyperparameter Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#bayesian-and-hyperparameter-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing Bayesian optimization and hyperparameter tuning for efficient optimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Marius Lindauer, Katharina Eggensperger, Matthias Feurer, André Biedenkapp, Difan Deng, Carolin Benjamins, Tim Ruhkopf, René Sass, Frank Hutter&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Presents SMAC3, a versatile Bayesian optimization package for hyperparameter tuning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Implicit Differentiation for Fast Hyperparameter Selection in Non-Smooth Convex Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Uses implicit differentiation for efficient hyperparameter selection in nonsmooth convex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Auto-Sklearn 2.0: Hands-Free AutoML via Meta-Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Matthias Feurer, Katharina Eggensperger, Stefan Falkner, Marius Lindauer, Frank Hutter&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces Auto-Sklearn 2.0, leveraging meta-learning for automated hyperparameter optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="optimization-in-reinforcement-learning"&gt;
 Optimization in Reinforcement Learning&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-in-reinforcement-learning"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on optimization techniques for reinforcement learning, including policy gradient and value estimation.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Generalized Projected Bellman Error for Off-Policy Value Estimation in Reinforcement Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Andrew Patterson, Adam White, Martha White&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops optimization methods for off-policy value estimation using a generalized projected Bellman error.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates greedification operators for policy optimization, focusing on KL divergences.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yanwei Jia, Xun Yu Zhou&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes policy gradient and actor-critic methods for continuous-time RL optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Convergence Rates of Policy Gradient Methods&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Lin Xiao&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies convergence rates of policy gradient methods in reinforcement learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor-Critic under State Distribution Mismatch&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Shangtong Zhang, Remi Tachet des Combes, Romain Laroche&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Examines global optimality in softmax off-policy actor-critic methods under distribution mismatch.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="other-optimization-topics"&gt;
 Other Optimization Topics&lt;span class="heading__anchor"&gt; &lt;a href="#other-optimization-topics"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers covering miscellaneous optimization topics, including proximal algorithms, tensor completion, and learning-to-optimize frameworks.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;TFPnP: Tuning-Free Plug-and-Play Proximal Algorithms with Applications to Inverse Imaging Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Kaixuan Wei, Angelica Aviles-Rivero, Jingwei Liang, Ying Fu, Hua Huang, Carola-Bibiane Schönlieb&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces tuning-free proximal algorithms for inverse imaging problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Complexity of Approximating Multimarginal Optimal Transport&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tianyi Lin, Nhat Ho, Marco Cuturi, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the complexity of approximating multimarginal optimal transport problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Riemannian Stochastic Proximal Gradient Methods for Nonsmooth Optimization over the Stiefel Manifold&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Bokun Wang, Shiqian Ma, Lingzhou Xue&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes stochastic proximal gradient methods for nonsmooth optimization on the Stiefel manifold.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Provable Tensor-Train Format Tensor Completion by Riemannian Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jian-Feng Cai, Jingyang Li, Dong Xia&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops Riemannian optimization for tensor-train format tensor completion.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Let’s Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Julie Nutini, Issam Laradji, Mark Schmidt&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Enhances block coordinate descent with faster convergence techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Efficiency of Entropic Regularized Algorithms for Optimal Transport&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tianyi Lin, Nhat Ho, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies entropic regularization for efficient optimal transport algorithms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Explicit Convergence Rates of Greedy and Random Quasi-Newton Methods&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dachao Lin, Haishan Ye, Zhihua Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides explicit convergence rates for greedy and random quasi-Newton methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tian Tong, Cong Ma, Ashley Prater-Bennette, Erin Tripp, Yuejie Chi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Addresses nonconvex low-rank tensor estimation with provable guarantees.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Learning to Optimize: A Primer and A Benchmark&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tianlong Chen, Xiaohan Chen, Wuyang Chen, Howard Heaton, Jialin Liu, Zhangyang Wang, Wotao Yin&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides a primer and benchmark for learning-to-optimize techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Clustering with Semidefinite Programming and Fixed Point Iteration&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Pedro Felzenszwalb, Caroline Klivans, Alice Paul&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Uses semidefinite programming and fixed-point iteration for clustering optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Bregman Learning Framework for Sparse Neural Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Leon Bungert, Tim Roith, Daniel Tenbrinck, Martin Burger&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces a Bregman learning framework for optimizing sparse neural networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yoav Freund, Yi-An Ma, Tong Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes dimension-independent convergence of Langevin algorithms from a composite optimization perspective.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sparse Continuous Distributions and Fenchel-Young Losses&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: André F. T. Martins, Marcos Treviso, António Farinhas, Pedro M. Q. Aguiar, Mário A. T. Figueiredo, Mathieu Blondel, Vlad Niculae&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores sparse continuous distributions using Fenchel-Young losses for optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Handling Hard Affine SDP Shape Constraints in RKHSs&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Pierre-Cyril Aubin-Frankowski, Zoltan Szabo&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Addresses affine SDP constraints in reproducing kernel Hilbert spaces for optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;OMLT: Optimization &amp;amp; Machine Learning Toolkit&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Francesco Ceccon, Jordan Jalving, Joshua Haddad, Alexander Thebelt, Calvin Tsay, Carl D Laird, Ruth Misener&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Presents OMLT, a toolkit integrating optimization and machine learning techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Optimization Research Papers in JMLR Volume 22</title><link>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v22/</link><pubDate>Wed, 29 Sep 2021 00:00:00 +0000</pubDate><guid>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v22/</guid><description>&lt;h1 class="heading" id="optimization-research-papers-in-jmlr-volume-22-2021"&gt;
 Optimization Research Papers in JMLR Volume 22 (2021)&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-research-papers-in-jmlr-volume-22-2021"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h1&gt;&lt;p&gt;This document lists papers from JMLR Volume 22 (2021) that focus on optimization research, categorized by their primary themes. Each paper is numbered starting from 1 within its subsection, with a brief description of its key contributions to optimization theory, algorithms, or applications.&lt;/p&gt;
&lt;h2 class="heading" id="convex-optimization"&gt;
 Convex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#convex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing convex optimization problems, including clustering, Wasserstein barycenters, sparse optimization, and bandits.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Convex Clustering: Model, Theoretical Guarantee and Efficient Algorithm&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Defeng Sun, Kim-Chuan Toh, Yancheng Yuan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a convex clustering model with theoretical guarantees and an efficient algorithm.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Fast Globally Linearly Convergent Algorithm for the Computation of Wasserstein Barycenters&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Lei Yang, Jia Li, Defeng Sun, Kim-Chuan Toh&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops a fast, globally linearly convergent algorithm for computing Wasserstein barycenters.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Wasserstein Barycenters Can Be Computed in Polynomial Time in Fixed Dimension&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jason M. Altschuler, Enric Boix-Adsera&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Demonstrates that Wasserstein barycenters can be computed in polynomial time for fixed dimensions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;From Low Probability to High Confidence in Stochastic Convex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Damek Davis, Dmitriy Drusvyatskiy, Lin Xiao, Junyu Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes methods to achieve high-confidence solutions in stochastic convex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sparse and Smooth Signal Estimation: Convexification of L0-Formulations&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alper Atamturk, Andres Gomez, Shaoning Han&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes convexification techniques for L0-formulations in sparse and smooth signal estimation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Proximal AUC Maximization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yunwen Lei, Yiming Ying&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops stochastic proximal methods for maximizing the area under the ROC curve (AUC) in convex settings.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Sparse Convex Optimization via Adaptively Regularized Hard Thresholding&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Kyriakos Axiotis, Maxim Sviridenko&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces adaptively regularized hard thresholding for sparse convex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Learning Sparse Classifiers: Continuous and Mixed Integer Optimization Perspectives&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Antoine Dedieu, Hussein Hazimeh, Rahul Mazumder&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores continuous and mixed-integer optimization approaches for learning sparse classifiers.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;First-Order Convergence Theory for Weakly-Convex-Weakly-Concave Min-max Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mingrui Liu, Hassan Rafique, Qihang Lin, Tianbao Yang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides first-order convergence theory for weakly convex-weakly concave min-max problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Convex Geometry and Duality of Over-parameterized Neural Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tolga Ergen, Mert Pilanci&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes convex geometry and duality in over-parameterized neural networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Linear Bandits on Uniformly Convex Sets&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Thomas Kerdreux, Christophe Roux, Alexandre d&amp;rsquo;Aspremont, Sebastian Pokutta&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies linear bandits on uniformly convex sets, focusing on convex optimization techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="nonconvex-optimization"&gt;
 Nonconvex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#nonconvex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers tackling nonconvex optimization, including stochastic gradient descent, neural network training, and stability properties.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Online Stochastic Gradient Descent on Non-Convex Losses from High-Dimensional Inference&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Gerard Ben Arous, Reza Gheissari, Aukosh Jagannath&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes online stochastic gradient descent for nonconvex losses in high-dimensional inference.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Non-attracting Regions of Local Minima in Deep and Wide Neural Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Henning Petzka, Cristian Sminchisescu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates non-attracting regions of local minima in deep and wide neural networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;When Does Gradient Descent with Logistic Loss Find Interpolating Two-Layer Networks?&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Niladri S. Chatterji, Philip M. Long, Peter L. Bartlett&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Examines conditions under which gradient descent with logistic loss finds interpolating two-layer networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Replica Exchange for Non-Convex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jing Dong, Xin T. Tong&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes replica exchange methods for nonconvex optimization problems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Failures of Model-Dependent Generalization Bounds for Least-Norm Interpolation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Peter L. Bartlett, Philip M. Long&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes limitations of model-dependent generalization bounds in least-norm interpolation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Stability Properties and the Optimization Landscape of Training Problems with Squared Loss for Neural Networks and General Nonlinear Conic Approximation Schemes&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Constantin Christof&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies stability and optimization landscapes for neural network training with squared loss.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="stochastic-optimization"&gt;
 Stochastic Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#stochastic-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on stochastic optimization methods, including momentum, Langevin dynamics, and communication-efficient algorithms.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Continuous Time Analysis of Momentum Methods&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Nikola B. Kovachki, Andrew M. Stuart&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides a continuous-time analysis of momentum methods in stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Generalization Performance of Multi-pass Stochastic Gradient Descent with Convex Loss Functions&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yunwen Lei, Ting Hu, Ke Tang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes generalization performance of multi-pass stochastic gradient descent for convex losses.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;High-Order Langevin Diffusion Yields an Accelerated MCMC Algorithm&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Wenlong Mou, Yi-An Ma, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops an accelerated MCMC algorithm using high-order Langevin diffusion.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Path Length Bounds for Gradient Descent and Flow&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Chirag Gupta, Sivaraman Balakrishnan, Aaditya Ramdas&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Establishes path length bounds for gradient descent and flow in stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimization with Momentum: Dynamical, Control-Theoretic, and Symplectic Perspectives&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Michael Muehlebach, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes momentum-based optimization from dynamical, control-theoretic, and symplectic perspectives.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;L-SVRG and L-Katyusha with Arbitrary Sampling&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xun Qian, Zheng Qu, Peter Richtárik&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces L-SVRG and L-Katyusha algorithms with arbitrary sampling for stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Lyapunov Analysis of Accelerated Methods in Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ashia C. Wilson, Ben Recht, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides a Lyapunov analysis for accelerated optimization methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;NUQSGD: Provably Communication-Efficient Data-Parallel SGD via Nonuniform Quantization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ali Ramezani-Kebrya, Fartash Faghri, Ilya Markov, Vitalii Aksenov, Dan Alistarh, Daniel M. Roy&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes NUQSGD, a communication-efficient stochastic gradient descent method using nonuniform quantization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;An Inertial Newton Algorithm for Deep Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Camille Castera, Jérôme Bolte, Cédric Févotte, Edouard Pauwels&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops an inertial Newton algorithm for deep learning optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Accelerating Ill-Conditioned Low-Rank Matrix Estimation via Scaled Gradient Descent&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tian Tong, Cong Ma, Yuejie Chi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes scaled gradient descent for accelerating ill-conditioned low-rank matrix estimation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On ADMM in Deep Learning: Convergence and Saturation-Avoidance&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jinshan Zeng, Shao-Bo Lin, Yuan Yao, Ding-Xuan Zhou&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes convergence and saturation-avoidance properties of ADMM in deep learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Unified Convergence Analysis for Shuffling-Type Gradient Methods&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Lam M. Nguyen, Quoc Tran-Dinh, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides a unified convergence analysis for shuffling-type gradient methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Online Optimization Using Kalman Recursion&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Joseph de Vilmarest, Olivier Wintenberger&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies Kalman recursion to stochastic online optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Expanding Boundaries of Gap Safe Screening&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Cassio F. Dantas, Emmanuel Soubies, Cédric Févotte&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Expands gap safe screening techniques for stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Consensus-Based Optimization on the Sphere: Convergence to Global Minimizers and Machine Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Massimo Fornasier, Lorenzo Pareschi, Hui Huang, Philippe Sünnen&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops consensus-based optimization on the sphere with applications to machine learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Decentralized Stochastic Gradient Langevin Dynamics and Hamiltonian Monte Carlo&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mert Gürbüzbalaban, Xuefeng Gao, Yuanhan Hu, Lingjiong Zhu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes decentralized stochastic gradient Langevin dynamics and Hamiltonian Monte Carlo methods.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="distributeddecentralized-optimization"&gt;
 Distributed/Decentralized Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#distributeddecentralized-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing distributed or decentralized optimization algorithms, focusing on communication efficiency and scalability.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Projection-Free Decentralized Online Learning for Submodular Maximization over Time-Varying Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Junlong Zhu, Qingtao Wu, Mingchuan Zhang, Ruijuan Zheng, Keqin Li&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops projection-free decentralized online learning for submodular maximization over time-varying networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Communication-Efficient Distributed Covariance Sketch, with Application to Distributed PCA&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zengfeng Huang, Xuemin Lin, Wenjie Zhang, Ying Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a communication-efficient distributed covariance sketch for distributed PCA.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimal Rates of Distributed Regression with Imperfect Kernels&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Hongwei Sun, Qiang Wu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Establishes optimal rates for distributed regression with imperfect kernels.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;One-Shot Federated Learning: Theoretical Limits and Algorithms to Achieve Them&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Saber Salehkaleybar, Arsalan Sharifnassab, S. Jamaloddin Golestani&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes theoretical limits and algorithms for one-shot federated learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Cooperative SGD: A Unified Framework for the Design and Analysis of Local-Update SGD Algorithms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jianyu Wang, Gauri Joshi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces a unified framework for designing and analyzing local-update SGD algorithms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;DeEPCA: Decentralized Exact PCA with Linear Convergence Rate&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Haishan Ye, Tong Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops DeEPCA, a decentralized exact PCA method with linear convergence.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="submodular-optimization"&gt;
 Submodular Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#submodular-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on submodular optimization, particularly in experimental design.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;&lt;strong&gt;Batch Greedy Maximization of Non-Submodular Functions: Guarantees and Applications to Experimental Design&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jayanth Jagalur-Mohan, Youssef Marzouk&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides guarantees for batch greedy maximization of non-submodular functions with applications to experimental design.&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="bandits-and-online-learning"&gt;
 Bandits and Online Learning&lt;span class="heading__anchor"&gt; &lt;a href="#bandits-and-online-learning"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing multi-armed bandits, online optimization, and regret minimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Regulating Greed Over Time in Multi-Armed Bandits&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Stefano Tracà, Cynthia Rudin, Weiyu Yan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies methods to regulate greed over time in multi-armed bandits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Preference-Based Online Learning with Dueling Bandits: A Survey&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Viktor Bengs, Róbert Busa-Fekete, Adil El Mesaoudi-Paul, Eyke Hüllermeier&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Surveys preference-based online learning with dueling bandits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Multi-Armed Bandit Designs for Dose-Finding Trials&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Maryam Aziz, Emilie Kaufmann, Marie-Karelle Riviere&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores multi-armed bandit designs for dose-finding trials.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Tsallis-INF: An Optimal Algorithm for Stochastic and Adversarial Bandits&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Julian Zimmert, Yevgeny Seldin&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes Tsallis-INF, an optimal algorithm for stochastic and adversarial bandits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bandit Convex Optimization in Non-Stationary Environments&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Peng Zhao, Guanghui Wang, Lijun Zhang, Zhi-Hua Zhou&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Addresses bandit convex optimization in non-stationary environments.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Contextual Bandit Bake-off&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alberto Bietti, Alekh Agarwal, John Langford&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Compares contextual bandit algorithms in a comprehensive evaluation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;MetaGrad: Adaptation Using Multiple Learning Rates in Online Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Tim van Erven, Wouter M. Koolen, Dirk van der Hoeven&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces MetaGrad, an adaptive online learning algorithm with multiple learning rates.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Achieving Fairness in the Stochastic Multi-Armed Bandit Problem&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Vishakha Patil, Ganesh Ghalme, Vineet Nair, Y. Narahari&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops methods for achieving fairness in stochastic multi-armed bandits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Refined Approachability Algorithms and Application to Regret Minimization with Global Costs&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Joon Kwon&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes refined approachability algorithms for regret minimization with global costs.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Bandit Learning in Decentralized Matching Markets&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Lydia T. Liu, Feng Ruan, Horia Mania, Michael I. Jordan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies bandit learning to decentralized matching markets.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Thompson Sampling Algorithms for Cascading Bandits&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zixin Zhong, Wang Chi Chueng, Vincent Y. F. Tan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops Thompson sampling algorithms for cascading bandits.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Fast Learning for Renewal Optimization in Online Task Scheduling&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Michael J. Neely&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes fast learning methods for renewal optimization in online task scheduling.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="bayesian-and-hyperparameter-optimization"&gt;
 Bayesian and Hyperparameter Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#bayesian-and-hyperparameter-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing Bayesian optimization and hyperparameter tuning for scalable and robust optimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;An Empirical Study of Bayesian Optimization: Acquisition Versus Partition&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Erich Merrill, Alan Fern, Xiaoli Fern, Nima Dolatnia&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Conducts an empirical study comparing acquisition and partition strategies in Bayesian optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Hyperparameter Optimization via Sequential Uniform Designs&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zebin Yang, Aijun Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes sequential uniform designs for hyperparameter optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Are We Forgetting about Compositional Optimisers in Bayesian Optimisation?&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Antoine Grosnit, Alexander I. Cowen-Rivers, Rasul Tutunov, Ryan-Rhys Griffiths, Jun Wang, Haitham Bou-Ammar&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores the role of compositional optimizers in Bayesian optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;GIBBON: General-Purpose Information-Based Bayesian Optimisation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Henry B. Moss, David S. Leslie, Javier Gonzalez, Paul Rayson&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces GIBBON, a general-purpose information-based Bayesian optimization framework.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On lp-Hyperparameter Learning via Bilevel Nonsmooth Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Takayuki Okuno, Akiko Takeda, Akihiro Kawana, Motokazu Watanabe&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies lp-hyperparameter learning using bilevel nonsmooth optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="optimization-in-reinforcement-learning"&gt;
 Optimization in Reinforcement Learning&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-in-reinforcement-learning"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on optimization techniques for reinforcement learning, including policy iteration and Q-learning.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Safe Policy Iteration: A Monotonically Improving Approximate Policy Iteration Approach&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alberto Maria Metelli, Matteo Pirotta, Daniele Calandriello, Marcello Restelli&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a safe policy iteration method with monotonic improvement for reinforcement learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alekh Agarwal, Sham M. Kakade, Jason D. Lee, Gaurav Mahajan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the optimality, approximation, and distribution shift in policy gradient methods.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Langevin Dynamics for Adaptive Inverse Reinforcement Learning of Stochastic Gradient Algorithms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Vikram Krishnamurthy, George Yin&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies Langevin dynamics to adaptive inverse reinforcement learning for stochastic gradient algorithms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Hamilton-Jacobi Deep Q-Learning for Deterministic Continuous-Time Systems with Lipschitz Continuous Controls&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jeongho Kim, Jaeuk Shin, Insoon Yang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops Hamilton-Jacobi deep Q-learning for deterministic continuous-time systems.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Partial Policy Iteration for L1-Robust Markov Decision Processes&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Chin Pang Ho, Marek Petrik, Wolfram Wiesemann&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces partial policy iteration for L1-robust Markov decision processes.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Gaussian Approximation for Bias Reduction in Q-Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Carlo D&amp;rsquo;Eramo, Andrea Cini, Alessandro Nuara, Matteo Pirotta, Cesare Alippi, Jan Peters, Marcello Restelli&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes Gaussian approximation techniques for bias reduction in Q-learning.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="other-optimization-topics"&gt;
 Other Optimization Topics&lt;span class="heading__anchor"&gt; &lt;a href="#other-optimization-topics"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers covering miscellaneous optimization topics, including Newton methods, SVM training, and eigenvector computation.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Global and Quadratic Convergence of Newton Hard-Thresholding Pursuit&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Shenglong Zhou, Naihua Xiu, Hou-Duo Qi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes global and quadratic convergence of Newton hard-thresholding pursuit.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Two-Level Decomposition Framework Exploiting First and Second Order Information for SVM Training Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Giulio Galvan, Matteo Lapucci, Chih-Jen Lin, Marco Sciandrone&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a two-level decomposition framework for SVM training using first and second-order information.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Approximate Newton Methods&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Haishan Ye, Luo Luo, Zhihua Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops approximate Newton methods for optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Unified Analysis of First-Order Methods for Smooth Games via Integral Quadratic Constraints&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Guodong Zhang, Xuchan Bao, Laurent Lessard, Roger Grosse&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides a unified analysis of first-order methods for smooth games using integral quadratic constraints.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;LassoNet: A Neural Network with Feature Sparsity&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ismael Lemhadri, Feng Ruan, Louis Abraham, Robert Tibshirani&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces LassoNet, a neural network architecture promoting feature sparsity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;An Algorithmic View of L2 Regularization and Some Path-Following Algorithms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yunzhang Zhu, Renxiong Liu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Explores L2 regularization from an algorithmic perspective with path-following algorithms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The Ensmallen Library for Flexible Numerical Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Ryan R. Curtin, Marcus Edel, Rahul Ganesh Prabhu, Suryoday Basak, Zhihao Lou, Conrad Sanderson&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces the ensmallen library for flexible numerical optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Black-Box Reductions for Zeroth-Order Gradient Algorithms to Achieve Lower Query Complexity&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Bin Gu, Xiyuan Wei, Shangqian Gao, Ziran Xiong, Cheng Deng, Heng Huang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes black-box reductions for zeroth-order gradient algorithms to reduce query complexity.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Riemannian Search for Eigenvector Computation&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zhiqiang Xu, Ping Li&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops Riemannian search methods for eigenvector computation.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item><item><title>Optimization Research Papers in JMLR Volume 21</title><link>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v21/</link><pubDate>Tue, 29 Sep 2020 00:00:00 +0000</pubDate><guid>https://blog.namln.org/en/mathematics/analysis/optimization/jmlr-v21/</guid><description>&lt;h1 class="heading" id="optimization-research-papers-in-jmlr-volume-21-2020"&gt;
 Optimization Research Papers in JMLR Volume 21 (2020)&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-research-papers-in-jmlr-volume-21-2020"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h1&gt;&lt;p&gt;This document lists papers from JMLR Volume 21 (2020) that focus on optimization research, categorized by their primary themes. Each paper is numbered starting from 1 within its subsection, with a brief description of its key contributions to optimization theory, algorithms, or applications.&lt;/p&gt;
&lt;h2 class="heading" id="convex-optimization"&gt;
 Convex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#convex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing convex optimization problems, including complexity bounds, convergence analysis, and applications in regression and assortment optimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Low Complexity Algorithm with O(√T) Regret and O(1) Constraint Violations for Online Convex Optimization with Long Term Constraints&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Hao Yu, Michael J. Neely&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a low-complexity algorithm for online convex optimization with long-term constraints, achieving O(√T) regret and O(1) constraint violations.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Lower Bounds for Parallel and Randomized Convex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Jelena Diakonikolas, Cristóbal Guzmán&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Establishes lower complexity bounds for parallel and randomized algorithms in convex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Discerning the Linear Convergence of ADMM for Structured Convex Optimization through the Lens of Variational Analysis&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xiaoming Yuan, Shangzhi Zeng, Jin Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the linear convergence of ADMM for structured convex optimization using variational analysis.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Data Efficient and Feasible Level Set Method for Stochastic Convex Optimization with Expectation Constraints&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Qihang Lin, Selvaprabu Nadarajah, Negar Soheili, Tianbao Yang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops a data-efficient level set method for stochastic convex optimization with expectation constraints.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Conic Optimization for Quadratic Regression Under Sparse Noise&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Igor Molybog, Ramtin Madani, Javad Lavaei&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies conic optimization to quadratic regression under sparse noise conditions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Dynamic Assortment Optimization with Changing Contextual Information&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xi Chen, Yining Wang, Yuan Zhou&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Addresses dynamic assortment optimization with changing contextual information using convex optimization techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Convex Programming for Estimation in Nonlinear Recurrent Models&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Sohail Bahmani, Justin Romberg&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Uses convex programming for parameter estimation in nonlinear recurrent models.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="nonconvex-optimization"&gt;
 Nonconvex Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#nonconvex-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers tackling nonconvex optimization, focusing on guarantees for local minima, variance reduction, and algorithmic advancements.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Exact Guarantees on the Absence of Spurious Local Minima for Non-negative Rank-1 Robust Principal Component Analysis&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Salar Fattahi, Somayeh Sojoudi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides exact guarantees for the absence of spurious local minima in non-negative rank-1 robust PCA.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Nested Variance Reduction for Nonconvex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dongruo Zhou, Pan Xu, Quanquan Gu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces a stochastic nested variance reduction method for nonconvex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Quoc Tran-Dinh&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes ProxSARAH, an efficient framework for stochastic composite nonconvex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Convergence Rates for the Stochastic Gradient Descent Method for Non-Convex Objective Functions&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Benjamin Fehrman, Benjamin Gess, Arnulf Jentzen&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes convergence rates of stochastic gradient descent for nonconvex objective functions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;AdaGrad Stepsizes: Sharp Convergence Over Nonconvex Landscapes&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Rachel Ward, Xiaoxia Wu, Leon Bottou&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies sharp convergence of AdaGrad stepsize schedules in nonconvex optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Sparse Semismooth Newton Based Proximal Majorization-Minimization Algorithm for Nonconvex Square-Root-Loss Regression Problems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Peipei Tang, Chengjing Wang, Defeng Sun, Kim-Chuan Toh&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops a sparse semismooth Newton-based proximal majorization-minimization algorithm for nonconvex square-root-loss regression.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="stochastic-optimization"&gt;
 Stochastic Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#stochastic-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on stochastic optimization methods, including gradient descent, variance reduction, and robustness to noise.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Convergences of Regularized Algorithms and Stochastic Gradient Methods with Random Projections&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Junhong Lin, Volkan Cevher&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes convergence of regularized algorithms and stochastic gradient methods with random projections.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dominic Richards, Patrick Rebeschini&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies graph-dependent implicit regularization in distributed stochastic subgradient descent.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Artin Spiridonoff, Alex Olshevsky, Ioannis Ch. Paschalidis&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a robust asynchronous stochastic gradient-push method with asymptotically optimal performance for strongly convex functions.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xi Chen, Simon S. Du, Xin T. Tong&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Investigates stationary-point hitting time and ergodicity in stochastic gradient Langevin dynamics.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stochastic Conditional Gradient Methods: From Convex Minimization to Submodular Maximization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Aryan Mokhtari, Hamed Hassani, Amin Karbasi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Extends stochastic conditional gradient methods from convex minimization to submodular maximization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Class of Parallel Doubly Stochastic Algorithms for Large-Scale Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Aryan Mokhtari, Alec Koppel, Martin Takac, Alejandro Ribeiro&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces parallel doubly stochastic algorithms for large-scale learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yao Ma, Alex Olshevsky, Csaba Szepesvari, Venkatesh Saligrama&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies gradient descent to sparse rank-one matrix completion for crowd-sourced worker aggregation.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimal Convergence for Distributed Learning with Stochastic Gradient Methods and Spectral Algorithms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Junhong Lin, Volkan Cevher&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Establishes optimal convergence rates for distributed learning using stochastic gradient methods and spectral algorithms.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Andrei Kulunchakov, Julien Mairal&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops estimate sequences for stochastic composite optimization with variance reduction and noise robustness.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Unified q-Memorization Framework for Asynchronous Stochastic Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Bin Gu, Wenhan Xian, Zhouyuan Huo, Cheng Deng, Heng Huang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a unified q-memorization framework for asynchronous stochastic optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Asymptotic Analysis via Stochastic Differential Equations of Gradient Descent Algorithms in Statistical and Computational Paradigms&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yazhen Wang, Shang Wu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes gradient descent algorithms using stochastic differential equations in statistical and computational settings.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The Error-Feedback Framework: SGD with Delayed Gradients&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Sebastian U. Stich, Sai Praneeth Karimireddy&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces an error-feedback framework for stochastic gradient descent with delayed gradients.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="distributedparallel-optimization"&gt;
 Distributed/Parallel Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#distributedparallel-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing distributed or parallel optimization algorithms, focusing on communication efficiency and scalability.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On the Complexity Analysis of the Primal Solutions for the Accelerated Randomized Dual Coordinate Ascent&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Huan Li, Zhouchen Lin&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the complexity of primal solutions for accelerated randomized dual coordinate ascent in distributed settings.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;WONDER: Weighted One-shot Distributed Ridge Regression in High Dimensions&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Edgar Dobriban, Yue Sheng&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes WONDER, a weighted one-shot distributed ridge regression method for high-dimensional data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Anis Elgabli, Jihong Park, Amrit S. Bedi, Mehdi Bennis, Vaneet Aggarwal&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces GADMM, a fast and communication-efficient framework for distributed machine learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Boyue Li, Shicong Cen, Yuxin Chen, Yuejie Chi&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops communication-efficient distributed optimization with gradient tracking and variance reduction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;On Convergence of Distributed Approximate Newton Methods: Globalization, Sharper Bounds and Beyond&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Xiao-Tong Yuan, Ping Li&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes convergence of distributed approximate Newton methods with sharper bounds and globalization techniques.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="submodular-optimization"&gt;
 Submodular Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#submodular-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on submodular optimization, including minimization and maximization problems.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Quadratic Decomposable Submodular Function Minimization: Theory and Practice&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Pan Li, Niao He, Olgica Milenkovic&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies quadratic decomposable submodular function minimization with theoretical and practical insights.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Optimal Algorithms for Continuous Non-monotone Submodular and DR-Submodular Maximization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Rad Niazadeh, Tim Roughgarden, Joshua R. Wang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops optimal algorithms for continuous non-monotone submodular and DR-submodular maximization.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="bayesian-and-hyperparameter-optimization"&gt;
 Bayesian and Hyperparameter Optimization&lt;span class="heading__anchor"&gt; &lt;a href="#bayesian-and-hyperparameter-optimization"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers addressing Bayesian optimization and hyperparameter tuning for scalable and robust optimization.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Tuning Hyperparameters without Grad Students: Scalable and Robust Bayesian Optimisation with Dragonfly&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Kirthevasan Kandasamy, Karun Raju Vysyaraju, Willie Neiswanger, Biswajit Paria, Christopher R. Collins, Jeff Schneider, Barnabas Poczos, Eric P. Xing&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces Dragonfly, a scalable and robust Bayesian optimization framework for hyperparameter tuning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Distributionally Ambiguous Optimization for Batch Bayesian Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Nikitas Rontsis, Michael A. Osborne, Paul J. Goulart&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes distributionally ambiguous optimization for batch Bayesian optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;The Kalai-Smorodinsky Solution for Many-Objective Bayesian Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mickael Binois, Victor Picheny, Patrick Taillandier, Abderrahmane Habbal&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies the Kalai-Smorodinsky solution to many-objective Bayesian optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Robust Reinforcement Learning with Bayesian Optimisation and Quadrature&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Integrates Bayesian optimization and quadrature for robust reinforcement learning.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="optimization-in-reinforcement-learning"&gt;
 Optimization in Reinforcement Learning&lt;span class="heading__anchor"&gt; &lt;a href="#optimization-in-reinforcement-learning"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers focusing on optimization techniques for policy optimization and reinforcement learning.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dhruv Malik, Ashwin Pananjady, Kush Bhatia, Koulik Khamaru, Peter L. Bartlett, Martin J. Wainwright&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops derivative-free methods for policy optimization in linear quadratic systems with guarantees.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Expected Policy Gradients for Reinforcement Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Kamil Ciosek, Shimon Whiteson&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces expected policy gradients for reinforcement learning optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Importance Sampling Techniques for Policy Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Alberto Maria Metelli, Matteo Papini, Nico Montali, Marcello Restelli&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes importance sampling techniques for efficient policy optimization in reinforcement learning.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;
&lt;h2 class="heading" id="other-optimization-topics"&gt;
 Other Optimization Topics&lt;span class="heading__anchor"&gt; &lt;a href="#other-optimization-topics"&gt;#&lt;/a&gt;&lt;/span&gt;
&lt;/h2&gt;&lt;p&gt;Papers covering miscellaneous optimization topics, including dictionary learning, neural network verification, and differential privacy.&lt;/p&gt;
&lt;ol&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Learning with Fenchel-Young Losses&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Mathieu Blondel, André F.T. Martins, Vlad Niculae&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces optimization with Fenchel-Young losses for structured prediction.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Branch and Bound for Piecewise Linear Neural Network Verification&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Rudy Bunel, Jingyue Lu, Ilker Turkaslan, Philip H.S. Torr, Pushmeet Kohli, M. Pawan Kumar&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies branch and bound techniques for piecewise linear neural network verification.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Conjugate Gradients for Kernel Machines&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Simon Bartels, Philipp Hennig&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops conjugate gradient methods for optimization in kernel machines.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Unique Sharp Local Minimum in L1-Minimization Complete Dictionary Learning&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yu Wang, Siqi Wu, Bin Yu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes unique sharp local minima in L1-minimization for complete dictionary learning.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Community-Based Group Graphical Lasso&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Eugen Pircalabelu, Gerda Claeskens&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a community-based group graphical Lasso for structured optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Constrained Dynamic Programming and Supervised Penalty Learning Algorithms for Peak Detection in Genomic Data&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Toby Dylan Hocking, Guillem Rigaill, Paul Fearnhead, Guillaume Bourque&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops constrained dynamic programming and supervised penalty learning for peak detection in genomic data.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Loss Control with Rank-One Covariance Estimate for Short-Term Portfolio Optimization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Zhao-Rong Lai, Liming Tan, Xiaotian Wu, Liangda Fang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies rank-one covariance estimation for loss control in short-term portfolio optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;A Unified Framework of Online Learning Algorithms for Training Recurrent Neural Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Owen Marschall, Kyunghyun Cho, Cristina Savin&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a unified framework of online learning algorithms for training recurrent neural networks.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Networks&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Amir R. Asadi, Emmanuel Abbe&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Introduces multilevel entropic regularization for neural network training using chaining and chain rule.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Nesterov&amp;rsquo;s Acceleration for Approximate Newton&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Haishan Ye, Luo Luo, Zhihua Zhang&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Applies Nesterov’s acceleration to approximate Newton methods for optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;New Insights and Perspectives on the Natural Gradient Method&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: James Martens&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Provides new insights into the natural gradient method for optimization.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Yuexiang Zhai, Zitong Yang, Zhenyu Liao, John Wright, Yi Ma&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops complete dictionary learning via L4-norm maximization over the orthogonal group.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Empirical Risk Minimization in the Non-Interactive Local Model of Differential Privacy&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Di Wang, Marco Gaboardi, Adam Smith, Jinhui Xu&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Studies empirical risk minimization in the non-interactive local model of differential privacy.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Stable Regression: On the Power of Optimization over Randomization&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dimitris Bertsimas, Ivan Paskov&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Analyzes the power of optimization over randomization in stable regression.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Dimitris Bertsimas, Michael Lingzhi Li&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Proposes a unified optimization framework for fast exact matrix completion.&lt;/p&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;p&gt;&lt;strong&gt;Rank-Based Lasso - Efficient Methods for High-Dimensional Robust Model Selection&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Authors&lt;/em&gt;: Wojciech Rejchel, Małgorzata Bogdan&lt;br&gt;
&lt;em&gt;Description&lt;/em&gt;: Develops rank-based Lasso methods for high-dimensional robust model selection.&lt;/p&gt;
&lt;/li&gt;
&lt;/ol&gt;</description></item></channel></rss>