Papers similar to 2606.01628

~ similar to 2606.01628· 20 results

cond-mat.mtrl-scics.ETcs.LGRecentJun 1, 2026

Towards Automated Discovery: A Review of Generative Models, Multimodal Learning and Closed-Loop Workflows in Inverse Materials Design

Anand Babu, Rogério Almeida Gouvêa, Gian-Marco Rignanese

This review surveys advanced techniques—including generative models, multimodal learning, and closed-loop workflows—for automated inverse materials design, enabling the targeted discovery of novel cry…

View →

cs.LGcs.AIRecentMay 31, 2026

Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling

Guang Lin, Shikui Tu, Lei Xu

The paper introduces FTDiff, a reinforcement learning fine-tuning framework that efficiently generates high-quality, drug-like molecules constrained by a target protein structure, outperforming existi…

View →

cond-mat.mtrl-scics.AIcs.LGRecentMay 28, 2026

What drives performance in molecular MPNNs? An operator-level factorial benchmark

Panyu Jiao, Shuizhou Chen, Yiheng Shen, Yuyang Wang +2 more

The paper introduces an operator-level factorial benchmark for molecular MPNNs, finding that message construction (specifically concatenation-based mixing) is the primary determinant of performance, r…

View →

cs.AIRecentMay 27, 2026

MolLingo: Molecule-Native Representations for LLM-Powered Scientific Agents

Thao Nguyen, Heng Ji

MolLingo is a multi-agent system that significantly improves automated molecular design by integrating domain-specific chemical reasoning and structural context into LLMs, outperforming state-of-the-a…

View →

cs.LGcs.AIRecentJun 1, 2026

Learning Implicit Bias in Generative Spaces for Accelerating Protein Dynamics Emulation

Kaihui Cheng, Zhiqiang Cai, Wenkai Xiang, Zhihang Hu +3 more

The paper introduces a history-dependent bias to generative protein emulators, significantly improving the exploration of rare and diverse protein states compared to standard emulators.

View →

q-bio.BMcs.AIRecentMay 29, 2026

AMix-2: Establishing Protein as a Native Modality in Large Language Models

Keyue Qiu, Yixin Wu, Lihao Wang, Yawen Ouyang +18 more

The paper introduces AMix-2, a novel protein-text foundation model that unifies protein understanding and sequence design by embedding both modalities in a shared token space, achieving state-of-the-a…

View →

cs.LGcs.AIq-bio.QMRecentMay 27, 2026

From Detection to Mechanism: Cross-Attention Graph Neural Networks Enable Drug-Drug Interaction Type Prediction An Ablation Study with Acetylsalicylic Acid Validation

Juergen Dietrich

The paper introduces a cross-attention Graph Neural Network (CrossAtt) that significantly improves the prediction of drug-drug interaction (DDI) mechanism types, demonstrating that explicit modeling o…

View →

cs.LGcs.AIRecentJun 1, 2026

Variational Learning for Insertion-based Generation

Yangtian Zhang, Zhe Wang, Arthur Gretton, Rex Ying +3 more

The paper introduces the Insertion Process (IP), a novel stochastic generative model that learns variable-length, non-monotonic sequence generation by explicitly modeling the insertion order of tokens…

View →

cs.AIRecentJun 1, 2026

Structure-Guided Adaptive Propagation for Protein-Protein Interaction Site Prediction

Enqiang Zhu, Yizi Liu, Yilong Luo, Yao Chen +2 more

The paper introduces SGAP-PPIS, a structure-guided adaptive propagation model that improves protein-protein interaction site prediction by allowing information diffusion to adapt based on a residue's…

View →

cs.AIRecentMay 28, 2026

EvoMD-LLM: Learning the Language of Species Evolution in Reactive Molecular Dynamics

Zhichen Tang, Zhengzheng Dang, Yulin Chen, Jixin Wu +2 more

EvoMD-LLM introduces a novel framework that models reactive molecular dynamics as a symbolic temporal language problem, enabling LLMs to accurately predict complex, time-evolving chemical processes.

View →

cs.LGcond-mat.mtrl-sciphysics.chem-phRecentJun 1, 2026

Speculative Sampling For Faster Molecular Dynamics

Arthur Kosmala, Stephan Günnemann, Meng Gao, Brandon Wood

The paper introduces Langevin Speculative Dynamics (LSD), a speculative sampling method that accelerates molecular dynamics simulations by using a fast draft model to propose steps, achieving signific…

View →

cs.AIq-bio.BMRecentMay 30, 2026

Probe Before You Edit: Probing-Guided Molecular Optimization for LLM Agents in Structure-Based Drug Design

Zaifei Yang, Weiyu Chen, Yaqing Wang, James Kwok

The paper introduces PROBE, an optimization framework that guides LLM agents in structure-based drug design by performing controlled 'probe edits' to assess how molecular changes affect both binding a…

View →

cs.AIq-bio.QMRecentJun 1, 2026

AgentPLM: Agentic Protein Language Models with Reasoning-Augmented Decoding for Protein Sequence Design

Sahil Rahman, Maxx Richard Rahman

AgentPLM introduces a novel framework that enhances protein language models by integrating external biophysical tools and a specialized policy optimization, enabling active, reasoning-based protein se…

View →

cs.AIcond-mat.mtrl-sciRecentMay 29, 2026

Coupling Language Models with Physics-based Simulation for Synthesis of Inorganic Materials

Edward W. Staley, Tom Arbaugh, Michael Pekala, Alexander New +5 more

The paper proposes a novel hybrid framework that couples Large Language Models (LLMs) with simplified physics-based simulations to improve the synthesis planning of novel inorganic crystalline materia…

View →

cs.LGcs.AIRecentJun 1, 2026

TN-SHAP-G: Graph-Structured Tensor Network Surrogates for Shapley Values and Interactions

Farzaneh Heidari, Guillaume Rabusseau

The paper introduces TN-SHAP-G, a novel framework that uses graph-structured tensor networks to efficiently approximate and compute Shapley values and interaction indices for black-box models, overcom…

View →

cs.LGcs.AIRecentMay 27, 2026

Learning Compositional Latent Structure with Vector Networks

Niclas Pokel, Benjamin F. Grewe

The paper introduces the Vector Network (VN), a novel recurrent architecture that replaces fixed weight matrices with reusable weight atoms, enabling superior compositional generalization by making st…

View →

cs.LGRecentJun 3, 2026

BBOmix: A Tabular Benchmark for Hyperparameter Optimization of Unsupervised Biological Representation Learning

Luca Thale-Bombien, Jan Ewald, Ralf König, Aaron Klein

This paper introduces BBOmix, an open-source benchmark for unsupervised representation learning on real-world biological data.

View →

cond-mat.mtrl-scics.CEcs.CLRecentMay 29, 2026

A Padding Method for Enhanced Encoding of Inorganic Structures with Varying Chemical Compositions

Thang Dang, Haderbache Amir, Tzanakakis Alexandros, Yoshimoto Yuta

The paper introduces a novel padding method that leverages crystal symmetry to enhance the encoding of complex inorganic structures, significantly improving the generation of stable, novel materials.

View →

cs.LGmath.STstat.MERecentJun 1, 2026

Network Learning with Semi-relaxed Gromov-Wasserstein

Charles Dufour, Ulysse Naepels, Leonardo V. Santoro

The paper proposes a semi-relaxed Gromov-Wasserstein objective to estimate the latent connectivity structure of large-scale networks, achieving statistically consistent and efficient recovery of the u…

View →