MMODELYST
Literature

Papers

Showing 1–68 of 68 notable papers
PaperTopicAuthorsPublishedHF ▲
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative DecodingCodeJianuo Huang +5May 28, 2026143
Your UnEmbedding Matrix is Secretly a Feature Lens for Text EmbeddingsCodeSonghao Wu +5Jun 5, 202691
ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient EstimationCodeHongru Hou +7May 27, 202687
Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software EvolutionCodeLiliana Hotsko +3Jun 4, 202686
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse AttentionCodeYan Wang +12Jun 8, 202661
KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning TasksCodeLorenz K. Muller +5Jun 2, 202659
TRL-Bench: Standardizing Cross-Paradigm Representation-Level Evaluation of Tabular EncodersCodeWei Pang +12Jun 8, 202650
How LoRA Remembers? A Parametric Memory Law for LLM FinetuningCodeZiwen Xu +6May 28, 202642
ThriftAttention: Selective Mixed Precision for Long-Context FP4 AttentionCodeJoe SharrattMay 21, 202641
MemTrace: Tracing and Attributing Errors in Large Language Model Memory SystemsCodeXinle Deng +17May 27, 202639
Draft-OPD: On-Policy Distillation for Speculative Draft ModelsCodeHaodi Lei +10May 28, 202633
VIA-SD: Verification via Intra-Model Routing for Speculative DecodingCodeYuchen Xian +3Jun 10, 202632
A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RLCodeLei Yang +2Jun 1, 202627
Reinforcement Learning Elicits Contextual Learning of Unseen Language TranslationCodeHanxu Hu +4Jun 4, 202625
Self-Distilled Policy GradientCodeYifeng Liu +3Jun 2, 202624
When Should Models Change Their Minds? Contextual Belief Management in Large Language ModelsCodeHaoming Xu +8May 28, 202624
On Subquadratic Architectures: From Applications to PrinciplesCodeAnamaria-Roberta Hartl +8Jun 10, 202623
Long Live The Balance: Information Bottleneck Driven Tree-based Policy OptimizationCodeHao Jiang +9May 27, 202623
MIRA: Mid-training Rubric Anchoring for Source-Aware Data SelectionCodeHaowen Wang +10May 28, 202622
Xetrieval: Mechanistically Explaining Dense RetrievalCodeZhixin Cai +9May 28, 202621
Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation LearningCodeTong Ye +8May 28, 202618
Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious CodeCodeYitong Zhang +2Jun 10, 202617
Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy DistillationCodeYuying Li +8Jun 1, 202616
HRBench: Benchmarking and Understanding Thinking-Mode Switch Strategies in Hybrid-Reasoning LLMsCodeYansong Ning +4May 27, 202615
DEI: Diversity in Evolutionary Inference for Quality-Diversity SearchCodeJohn Donaghy +1May 26, 202614
CubePart: An Open-Vocabulary Part-Controllable 3D GeneratorCodeYiheng Zhu +11May 27, 202614
Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the WildCodeZhimin Zhao +4May 22, 202614
Lost in Sampling: Assessing Lexical Reachability in LLMs via the Word Coverage Score (WCS)CodeSamer Awad +5May 26, 202613
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model InternalsCodeFederico Torrielli +2May 25, 202612
Speculative Pipeline Decoding: Higher-Accruacy and Zero-Bubble Speculation via Pipeline ParallelismCodeYijiong Yu +4May 29, 202610
Verifiable Rewards Beyond Math and Code: Lightweight Corpus-Grounded Process Supervision for Factual Question AnsweringCodeShicheng Fan +5May 28, 202610
Thinking Before Constraining: A Unified Decoding Framework for Large Language ModelsCodeNgoc Trinh Hung Nguyen +5May 28, 202610
OPRD: On-Policy Representation DistillationCodeShenzhi Yang +9Jun 4, 20269
Reasoning Arena: Trace Tournaments When Verifiable Rewards Fall ShortCodeHan Zhou +4Jun 8, 20268
Answer Presence Drives RAG Rewriting GainsCodeYuejie Li +10Jun 4, 20268
Measuring the Depth of LLM Unlearning via Activation PatchingCodeJaeung Lee +2May 23, 20268
Models That Know How Evaluations Are Designed Score SaferCodeKatharina Deckenbach +3May 27, 20268
ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language ModelsCodeYujie Lin +4May 16, 20268
Trajectory-Refined DistillationCodeLi Jiang +3Jun 7, 20267
Latent Reasoning with Normalizing FlowsCodeGuancheng Tu +7Jun 4, 20267
LongAttnComp: Cross-Family Context Compression for Long-Context ReasoningCodeMengmeng Ji +3May 31, 20267
MAAT: Multi-phase Adapter-Aware Targeted UnlearningCodeSuryash Yagnik +5May 28, 20267
AlphaTransit: Learning to Design City-scale Transit RoutesCodeBibek Poudel +2May 27, 20267
Multi-view Consistent 3D Gaussian Head Avatars 'without' Multi-view GenerationCodeAviral Chharia +1May 24, 20267
DRIFT: Decoupled Rollouts and Importance-Weighted Fine-Tuning for Efficient Multi-Turn OptimizationCodeJian Mu +4May 29, 20266
The Shape of Addition: Geometric Structures of Arithmetic in Large Language ModelsCodeLiuyuan Wen +4May 29, 20265
MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh GenerationCodeJiale Xu +2Jun 3, 20265
BenchEvolver: Frontier Task Synthesis via Solution-Centric EvolutionCodeYangzhen Wu +12May 31, 20265
Efficient and Scalable Provenance Tracking for LLM-Generated Code SnippetsCodeAndrea Gurioli +4May 27, 20265
Understanding Data Temporality Impact on Large Language Models Pre-trainingCodeHippolyte Pilchen +4May 21, 20265
U-TTT: Towards Generalizable PET Image Denoising via Test-Time TrainingCodeZhiwen Yang +6Jun 9, 20264
IR3DE: A Linear Router for Large Language ModelsCodeEros Fanì +1Jun 4, 20264
Unified Panoramic Geometry Estimation via Multi-View Foundation ModelsCodeVukasin Bozic +6May 25, 20264
NSF-SciFy: Mining the NSF Awards Database for Scientific ClaimsCodeDelip Rao +3May 25, 20264
When is Your LLM Steerable?CodeChenrui Fan +4Jun 10, 20263
SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM InferenceCodeYaosheng Fu +4Jun 3, 20263
Reinforcement Learning from Rich Feedback with Distributional DAggerCodeRishabh Agrawal +2Jun 3, 20263
Surflo: Consistent 3D Surface Flow Model with Global StateCodeAntoine Guédon +5Jun 11, 20262
Adaptive Multi-Resolution Procedural Knowledge Compression for Large Language ModelsCodeChangyue Wang +7Jun 10, 20262
Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and RecombinationCodeJiasheng Zheng +8May 29, 20262
BA-T: An Iterative Transformer for Two-View Bundle AdjustmentCodeGanlin Zhang +3Jun 2, 20262
Confidence-Adaptive SwiGLU for Mixture-of-ExpertsCodeShaohua Li +6May 30, 20262
A Stationary (and Therefore Compatible) Representation is All You NeedCodeNiccolò Biondi +3Jun 10, 20261
Empirical Study on the Characteristics and Evolution of AI-usage in GitHub Repositories: Evidence from Code CommentsCodeAbdullah Al Mujahid +2Jun 5, 20261
Measuring Model Robustness via Fisher Information: Spectral Bounds, Theoretical Guarantees, and Practical AlgorithmsCodeChong Zhang +4Jun 3, 20261
The Distillation Game: Adaptive Attacks & Efficient DefensesCodeYoussef Allouah +3May 21, 20261
Review Arcade: On the Human Alignment and Gameability of LLM ReviewsCodeHans Ole Hatzel +2May 27, 20261
BatteryMFormer: Multi-level Learning for Battery Degradation Trajectory ForecastingCodeRuifeng Tan +5May 26, 20261