publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

arXiv

Test-time Recursive Thinking: Self-Improvement without External Feedback

Yufan Zhuang, Chandan Singh, Liyuan Liu, and 5 more authors

arXiv, 2026

PDF

2025

NeurIPS

Text Generation Beyond Discrete Token Sampling

Yufan Zhuang, Liyuan Liu, Chandan Singh, and 2 more authors

Neural Information Processing Systems, 2025

PDF
ACL

Self-Taught Agentic Long Context Understanding

Yufan Zhuang, Xiaodong Yu, Jialian Wu, and 7 more authors

Annual Meeting of the Association for Computational Linguistics, 2025

PDF
ICLR

Vector-ICL: In-context Learning with Continuous Vector Representations

Yufan Zhuang, Chandan Singh, Liyuan Liu, and 2 more authors

International Conference on Learning Representations, 2025

PDF

2024

EMNLP

Data Contamination Can Cross Language Barriers

Feng Yao^*, Yufan Zhuang^*, Zihao Sun, and 3 more authors

Empirical Methods in Natural Language Processing, 2024

PDF
TMLR

Learning a Decision Tree Algorithm with Transformers

Yufan Zhuang, Liyuan Liu, Chandan Singh, and 2 more authors

Transactions on Machine Learning Research, 2024

PDF

2023

UniReps@NeurIPS

WavSpA: Wavelet Space Attention for Boosting Transformers’ Long Sequence Learning Ability

Yufan Zhuang, Zihan Wang, Fangbo Tao, and 1 more author

NeurIPS 1st UniReps Workshop, 2023

PDF
TOSEM

Incorporating Signal Awareness in Source Code Modeling: an Application to Vulnerability Detection

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 3 more authors

ACM Transactions on Software Engineering and Methodology, 2023

PDF
EuroS&P

Code Vulnerability Detection via Signal-Aware Learning

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 3 more authors

In 2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P), 2023
Patent

Artificial intelligence model learning introspection

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 2 more authors

2023
Patent

Training data augmentation via program simplification

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 2 more authors

2023
Patent

Complexity based artificial intelligence model training

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 2 more authors

2023

2022

Patent

Probing Model Signal Awareness

Yunhui Zheng, Sahil Suneja, Yufan Zhuang, and 2 more authors

2022
AAG

Sleeping Lion or Sick Man? Machine Learning Approaches to Deciphering Heterogeneous Images of Chinese in North America

Qiang Fu, Yufan Zhuang, Yushu Zhu, and 1 more author

Annals of the American Association of Geographers, 2022

PDF

2021

arXiv

Data-Driven AI Model Signal-Awareness Enhancement and Introspection

Sahil Suneja, Yufan Zhuang, Yunhui Zheng, and 2 more authors

arXiv preprint arXiv:2111.05827, 2021
arXiv

Software Vulnerability Detection via Deep Learning over Disaggregated Code Graph Representation

Yufan Zhuang, Sahil Suneja, Veronika Thost, and 3 more authors

arXiv preprint arXiv:2109.03341, 2021

PDF
SOCC Vision

Towards Reliable AI for Source Code Understanding

Sahil Suneja, Yunhui Zheng, Yufan Zhuang, and 2 more authors

ACM Symposium on Cloud Computing, 2021
FSE

Probing model signal-awareness via prediction-preserving input minimization

Sahil Suneja^*, Yunhui Zheng^*, Yufan Zhuang^*, and 2 more authors

ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

PDF
BDR

Agreeing to disagree: choosing among eight topic-modeling methods

Qiang Fu, Yufan Zhuang, Jiaxin Gu, and 2 more authors

Big Data Research, 2021

PDF

2020

arXiv

Exploring software naturalness through neural language models

Luca Buratti, Saurabh Pujar, Mihaela Bornea, and 8 more authors

arXiv preprint arXiv:2006.12641, 2020

PDF
arXiv

Learning to map source code to software vulnerability using code-as-a-graph

Sahil Suneja, Yunhui Zheng, Yufan Zhuang, and 2 more authors

arXiv preprint arXiv:2006.08614, 2020

PDF

2019

BPOD@BigData

Search for K: assessing five topic-modeling approaches to 120,000 Canadian articles

Qiang Fu, Yufan Zhuang, Jiaxin Gu, and 3 more authors

In BPOD Workshop at 2019 IEEE International Conference on Big Data (Big Data), 2019

PDF