Posts by Collection

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

Paper Title Number 1

Published in Journal 1, 2009

This paper is about the number 1. The number 2 is left for future work.

Recommended citation: Your Name, You. (2009). "Paper Title Number 1." Journal 1. 1(1). http://academicpages.github.io/files/paper1.pdf

Paper Title Number 2

Published in Journal 1, 2010

This paper is about the number 2. The number 3 is left for future work.

Recommended citation: Your Name, You. (2010). "Paper Title Number 2." Journal 1. 1(2). http://academicpages.github.io/files/paper2.pdf

Paper Title Number 3

Published in Journal 1, 2015

This paper is about the number 3. The number 4 is left for future work.

Recommended citation: Your Name, You. (2015). "Paper Title Number 3." Journal 1. 1(3). http://academicpages.github.io/files/paper3.pdf

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Published in EMNLP 2025, 2025

We propose LM-Searcher, a novel approach for cross-domain neural architecture search using LLMs via unified numerical encoding.

Recommended citation: Yuxuan Hu, Jihao Liu, Ke Wang, Jinliang Zhen, Weikang Shi, Manyuan Zhang, Qi Dou, Rui Liu, Aojun Zhou, Hongsheng Li (2025). "LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding." EMNLP 2025. https://arxiv.org/abs/2509.05657

Mmsearch-plus: Benchmarking Provenance-aware Search for Multimodal Browsing Agents

Published in ICLR 2026, 2026

We introduce Mmsearch-plus, a benchmark for provenance-aware search in multimodal browsing agents.

Recommended citation: Xijia Tao, Yihua Teng, Xinxing Su, Xinyu Fu, Jihao Wu, Chaofan Tao, Ziru Liu, Haoli Bai, Rui Liu, Lingpeng Kong (2026). "Mmsearch-plus: Benchmarking Provenance-aware Search for Multimodal Browsing Agents." ICLR 2026. https://arxiv.org/abs/2508.21475

Pusa v1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

Published in ICLR 2026, 2026

We present Pusa v1.0, an Image-to-Video model that surpasses Wan-I2V with only $500 training cost using vectorized timestep adaptation.

Recommended citation: Yaofang Liu, Yumeng Ren, Aitor Artola, Yuxuan Hu, Xiaodong Cun, Xiaotong Zhao, Alan Zhao, Raymond H. Chan, Suiyun Zhang, Rui Liu, Dandan Tu, Jean-Michel Morel (2026). "Pusa v1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation." ICLR 2026. https://arxiv.org/abs/2507.16116

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Published in The 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), 2026

We present MathCanvas, the first unified multi-model that enables thinking while drawing auxiliary lines for multimodal mathematical reasoning.

Recommended citation: Weikang Shi, Aldrich Yu, Rongyao Fang, Houxing Ren, Ke Wang, Aojun Zhou, Changyao Tian, Xinyu Fu, Yuxuan Hu, Zimu Lu, Linjiang Huang, Si Liu, Rui Liu†, Hongsheng Li (2026). "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning." ACL 2026. https://mathcanvas.github.io/

Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models

Published in ICML 2026, 2026

We present Beyond Confidence, a novel adaptive and coherent decoding method for diffusion language models.

Recommended citation: Kecheng Chen, Ziru Liu, Xijia Tao, Hui Liu, Xinyu Fu, Suiyun Zhang, Dandan Tu, Lingpeng Kong, Rui Liu, Haoliang Li (2026). "Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models." ICML 2026.

PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios

Published in ICML 2026, 2026

We present PhoStream, a benchmark for real-world streaming evaluation of omnimodal assistants in mobile scenarios.

Recommended citation: Xudong Lu, Huankang Guan, Yang Bo, Jinpeng Chen, Xintong Guo, Shuhan Li, Fang Liu, Peiwen Sun, Xueying Li, Wei Zhang, Xue Yang, Rui Liu, Hongsheng Li (2026). "PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios." ICML 2026.

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Published in ICML 2026, 2026

We present SpaceVista, a unified framework for all-scale visual spatial reasoning from millimeter to kilometer scales.

Recommended citation: Peiwen Sun, Shiqiang Lang, Dongming Wu, Yi Ding, Kaituo Feng, Huadai Liu, Zhen Ye, Rui Liu, Yun-Hui Liu, Jianan Wang, Xiangyu Yue (2026). "SpaceVista: All-Scale Visual Spatial Reasoning from mm to km." ICML 2026.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Rui Liu

Posts by Collection

portfolio

Portfolio item number 1

Portfolio item number 2

publications

Paper Title Number 1

Paper Title Number 2

Paper Title Number 3

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Mmsearch-plus: Benchmarking Provenance-aware Search for Multimodal Browsing Agents

Pusa v1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Beyond Confidence: Adaptive and Coherent Decoding for Diffusion Language Models

PhoStream: Benchmarking Real-World Streaming for Omnimodal Assistants in Mobile Scenarios

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

talks

Talk 1 on Relevant Topic in Your Field

Tutorial 1 on Relevant Topic in Your Field

Talk 2 on Relevant Topic in Your Field

Conference Proceeding talk 3 on Relevant Topic in Your Field

teaching

Teaching experience 1

Teaching experience 2