publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. arXiv
    Vector-ICL: In-context Learning with Continuous Vector Representations
    Yufan Zhuang , Chandan Singh , Liyuan Liu , Jingbo Shang , and Jianfeng Gao
    arXiv preprint arXiv:2410.05629, 2024
  2. EMNLP
    Data Contamination Can Cross Language Barriers
    Feng Yao* , Yufan Zhuang* , Zihao Sun , Sunan Xu , Animesh Kumar , and Jingbo Shang
    Empirical Methods in Natural Language Processing, 2024
  3. TMLR
    Learning a Decision Tree Algorithm with Transformers
    Yufan Zhuang , Liyuan Liu , Chandan Singh , Jingbo Shang , and Jianfeng Gao
    Transactions on Machine Learning Research, 2024

2023

  1. NeurIPS
    WavSpA: Wavelet Space Attention for Boosting Transformers’ Long Sequence Learning Ability
    Yufan Zhuang , Zihan Wang , Fangbo Tao , and Jingbo Shang
    NeurIPS 1st UniReps Workshop, 2023
  2. TSE
    Incorporating Signal Awareness in Source Code Modeling: an Application to Vulnerability Detection
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Jim Laredo , Alessandro Morari , and Udayan Khurana
    ACM Transactions on Software Engineering and Methodology, 2023
  3. EuroS&P
    Code Vulnerability Detection via Signal-Aware Learning
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Jim Laredo , Alessandro Morari , and Udayan Khurana
    In 2023 IEEE 8th European Symposium on Security and Privacy (EuroS&P) , 2023
  4. Patent
    Artificial intelligence model learning introspection
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Alessandro Morari , and Jim Alain Laredo
    2023
  5. Patent
    Training data augmentation via program simplification
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Alessandro Morari , and Jim Alain Laredo
    2023
  6. Patent
    Complexity based artificial intelligence model training
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Alessandro Morari , and Jim Alain Laredo
    2023

2022

  1. Patent
    Probing Model Signal Awareness
    Yunhui Zheng , Sahil Suneja , Yufan Zhuang , Alessandro Morari , and Jim Alain Laredo
    2022
  2. AAG
    Sleeping Lion or Sick Man? Machine Learning Approaches to Deciphering Heterogeneous Images of Chinese in North America
    Qiang Fu , Yufan Zhuang , Yushu Zhu , and Xin Guo
    Annals of the American Association of Geographers, 2022

2021

  1. arXiv
    Data-Driven AI Model Signal-Awareness Enhancement and Introspection
    Sahil Suneja , Yufan Zhuang , Yunhui Zheng , Jim Laredo , and Alessandro Morari
    arXiv preprint arXiv:2111.05827, 2021
  2. arXiv
    Software Vulnerability Detection via Deep Learning over Disaggregated Code Graph Representation
    Yufan Zhuang , Sahil Suneja , Veronika Thost , Giacomo Domeniconi , Alessandro Morari , and Jim Laredo
    arXiv preprint arXiv:2109.03341, 2021
  3. SOCC Vision
    Towards Reliable AI for Source Code Understanding
    Sahil Suneja , Yunhui Zheng , Yufan Zhuang , Alessandro Morari , and Jim Laredo
    ACM Symposium on Cloud Computing, 2021
  4. FSE
    Probing model signal-awareness via prediction-preserving input minimization
    Sahil Suneja* , Yunhui Zheng* , Yufan Zhuang* , Alessandro Morari , and Jim Laredo
    ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021
  5. BDR
    Agreeing to disagree: choosing among eight topic-modeling methods
    Qiang Fu , Yufan Zhuang , Jiaxin Gu , Yushu Zhu , and Xin Guo
    Big Data Research, 2021

2020

  1. arXiv
    Exploring software naturalness through neural language models
    Luca Buratti , Saurabh Pujar , Mihaela Bornea , Scott McCarley , Yunhui Zheng , Gaetano Rossiello , Alessandro Morari , Jim Laredo , Veronika Thost , Yufan Zhuang , and  others
    arXiv preprint arXiv:2006.12641, 2020
  2. arXiv
    Learning to map source code to software vulnerability using code-as-a-graph
    Sahil Suneja , Yunhui Zheng , Yufan Zhuang , Jim Laredo , and Alessandro Morari
    arXiv preprint arXiv:2006.08614, 2020

2019

  1. BPOD@BigData
    Search for K: assessing five topic-modeling approaches to 120,000 Canadian articles
    Qiang Fu , Yufan Zhuang , Jiaxin Gu , Yushu Zhu , Huihui Qin , and Xin Guo
    In BPOD Workshop at 2019 IEEE International Conference on Big Data (Big Data) , 2019