PaperPulse - AI/ML Summarization Platform

ArXivFeb 19, 2026

Optimal Unconstrained Self-Distillation in Ridge Regression: Strict Improvements, Precise Asymptotics, and One-Shot Tuning

Hien Dang, Pratik Patil et al.

TLDR: This paper demonstrates that self-distillation can significantly improve ridge regression performance by optimally mixing teacher predictions, providing precise asymptotic analyses and a practical one-shot tuning method.

ArXivFeb 19, 2026

genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression

Masahiro Kato

TLDR: genriesz is a Python package that automates debiased machine learning for estimating causal and structural parameters using generalized Riesz regression.

ArXivFeb 19, 2026

From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan's Humanities and Social Sciences

Yi-Chih Huang

TLDR: This study proposes a collaborative AI workflow for humanities and social sciences research, using Taiwan's Claude.ai data to validate its feasibility and effectiveness.

ArXivFeb 19, 2026

Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks

Saveliy Baturin

TLDR: This paper shows that in overparameterized one-hidden-layer ReLU networks, the loss landscape becomes smoother and flatter as the network width increases, resulting in smaller energy gaps between local and global minima.

ArXivFeb 19, 2026

IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control

Qilong Cheng, Matthew Mackay et al.

TLDR: IRIS is a cost-effective, learning-driven robotic camera system for cinematic motion control, using imitation learning to achieve smooth and repeatable camera movements.

ArXivFeb 19, 2026

Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods

Paimon Goulart, Jordan Steinhauser et al.

TLDR: This paper presents an AI-enhanced pipeline using In-Context Learning and tensor methods to improve data analysis in behavioral neuroscience, particularly for studying fear generalization in mice, which can help understand PTSD.

ArXivFeb 19, 2026

LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

Kasun Dewage, Marianna Pensky et al.

TLDR: CRAFT is a parameter-efficient fine-tuning method using Tucker decomposition on pre-trained attention weights, achieving competitive performance with minimal adaptation parameters.

ArXivFeb 19, 2026

Instructor-Aligned Knowledge Graphs for Personalized Learning

Abdulrahman AlRabah, Priyanka Kargupta et al.

TLDR: InstructKG is a framework that automatically constructs knowledge graphs from course materials to capture learning dependencies and aid personalized learning.

ArXivFeb 19, 2026

Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction

Shi Yin, Jinming Mu et al.

TLDR: This paper presents a novel approach to crystal structure prediction using large language models and constrained optimization to improve symmetry inference and enforce physical validity, achieving state-of-the-art results without relying on existing databases.

ArXivFeb 19, 2026

Catastrophic Forgetting Resilient One-Shot Incremental Federated Learning

Obaidullah Zaland, Zulfiqar Ahmad Khan et al.

TLDR: This paper introduces One-Shot Incremental Federated Learning (OSI-FL), a framework that addresses communication overhead and catastrophic forgetting in federated learning by using category-specific embeddings and selective sample retention.

ArXivFeb 19, 2026

Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

Akira Sakai, Yuma Ichikawa

TLDR: The paper identifies that weight sign persistence is a bottleneck in sub-bit model compression and proposes methods to reduce sign flips while maintaining performance.

ArXivFeb 19, 2026

Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

Srijan Sood, Kassiani Papasotiriou et al.

TLDR: This study compares Deep Reinforcement Learning (DRL) and Mean-Variance Optimization (MVO) for portfolio allocation, showing DRL's strong performance across various financial metrics.

ArXivFeb 19, 2026

Retaining Suboptimal Actions to Follow Shifting Optima in Multi-Agent Reinforcement Learning

Yonghyeon Jo, Sunwoo Lee et al.

TLDR: The paper introduces Successive Sub-value Q-learning (S2Q), a method that improves adaptability in multi-agent reinforcement learning by retaining multiple high-value actions, outperforming existing algorithms.

ArXivFeb 19, 2026

Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

Shengtian Yang, Yu Li et al.

TLDR: The paper introduces Phase-Aware Mixture of Experts (PA-MoE) to enhance reinforcement learning by allowing expert specialization for complex tasks without being dominated by simpler tasks.

ArXivFeb 19, 2026

MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning

Xiaoliang Fu, Jiaye Lin et al.

TLDR: MASPO is a new framework that overcomes limitations in existing RLVR algorithms for large language models by optimizing gradient use, probability mass, and signal reliability, achieving better performance than current methods.

ArXivFeb 19, 2026

The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

Peter Balogh

TLDR: Certain transformer attention heads in language models act as membership testers, identifying repeated tokens with high precision, similar to Bloom filters.

ArXivFeb 19, 2026

Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

Dylan Bouchard, Mohit Singh Chauhan et al.

TLDR: This study introduces a taxonomy for fine-grained uncertainty quantification in long-form language model outputs, revealing that claim-level scoring and uncertainty-aware decoding improve factuality in generated content.

ArXivFeb 19, 2026

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Yue Liu, Zhiyuan Hu et al.

TLDR: KLong is a new LLM agent designed to tackle long-horizon tasks using a novel training method combining trajectory-splitting SFT and progressive RL, outperforming existing models on various benchmarks.

ArXivFeb 19, 2026

Deeper detection limits in astronomical imaging using self-supervised spatiotemporal denoising

Yuduo Guo, Hao Zhang et al.

TLDR: ASTERIS, a self-supervised denoising algorithm, enhances astronomical imaging detection limits by leveraging spatiotemporal data, improving detection by 1 magnitude and identifying previously undetectable features in deep space images.

ArXivFeb 19, 2026

Adaptive Decentralized Composite Optimization via Three-Operator Splitting

Xiaokai Chen, Ilya Kuruzov et al.

TLDR: The paper introduces an adaptive decentralized optimization method using three-operator splitting and local stepsize adjustments, achieving robust convergence for convex and strongly convex problems.

AI Research Paper Feed

Optimal Unconstrained Self-Distillation in Ridge Regression: Strict Improvements, Precise Asymptotics, and One-Shot Tuning

genriesz: A Python Package for Automatic Debiased Machine Learning with Generalized Riesz Regression

From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan's Humanities and Social Sciences

Asymptotic Smoothing of the Lipschitz Loss Landscape in Overparameterized One-Hidden-Layer ReLU Networks

IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control

Transforming Behavioral Neuroscience Discovery with In-Context Learning and AI-Enhanced Tensor Methods

LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

Instructor-Aligned Knowledge Graphs for Personalized Learning

Universal Fine-Grained Symmetry Inference and Enforcement for Rigorous Crystal Structure Prediction

Catastrophic Forgetting Resilient One-Shot Incremental Federated Learning

Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

Deep Reinforcement Learning for Optimal Portfolio Allocation: A Comparative Study with Mean-Variance Optimization

Retaining Suboptimal Actions to Follow Shifting Optima in Multi-Agent Reinforcement Learning

Phase-Aware Mixture of Experts for Agentic Reinforcement Learning

MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning

The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

KLong: Training LLM Agent for Extremely Long-horizon Tasks

Deeper detection limits in astronomical imaging using self-supervised spatiotemporal denoising

Adaptive Decentralized Composite Optimization via Three-Operator Splitting