Publications

Publications by categories in reversed chronological order. Full list is available on my Google Scholar.

2023

  1. Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias
    Yue Yu*, Yuchen Zhuang*, Jieyu Zhang*, Yu Meng, Alexander Ratner, Ranjay Krishna, Jiaming Shen, and Chao Zhang
    Proceedings of NeurIPS (D&B Track), 2023.
  2. ToolQA: A Dataset for LLM Question Answering with External Tools
    Yuchen Zhuang*, Yue Yu*, Kuan Wang*, Haotian Sun, and Chao Zhang
    Proceedings of NeurIPS (D&B Track), 2023.
  3. Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
    Alexander Bukharin, Yan Li, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Chao Zhang, Songan Zhang, and Tuo Zhao
    Proceedings of NeurIPS, 2023.
  4. Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-based Uncertainty Propagation Approach
    Yue Yu, Rongzhi Zhang, Ran Xu, Jieyu Zhang, Jiaming Shen, and Chao Zhang
    Proceedings of ACL, 2023.
  5. ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense Retrieval
    Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen, and Chao Zhang
    Proceedings of ACL Findings, 2023.
  6. Local Boosting for Weakly-Supervised Learning
    Rongzhi Zhang, Yue Yu, Jiaming Shen, Xiquan Cui, and Chao Zhang
    Proceedings of KDD, 2023.
  7. DyGen: Fine-Tuning Language Models with Noisy Labels by Dynamics-Enhanced Generative Modeling
    Yuchen Zhuang, Yue Yu, Lingkai Kong, Xiang Chen, and Chao Zhang
    Proceedings of KDD, 2023.
  8. R-Mixup: Riemannian Mixup for Biological Networks
    Xuan Kan, Zimu Li, Hejie Cui, Yue Yu, Ran Xu, Shaojun Yu, Zilong Zhang, Ying Guo, and Carl Yang
    Proceedings of KDD, 2023.
  9. Weakly-supervised Scientific Document Classification via Retrieval-Augmented Multi-stage Training
    Ran Xu*, Yue Yu*, Joyce C Ho, and Carl Yang
    Proceedings of SIGIR, 2023. (Short Paper)
  10. Neighborhood-regularized Self-Training for Learning with Few Labels
    Ran Xu, Yue Yu, Hejie Cui, Xuan Kan, Yanqiao Zhu, Joyce Ho, Chao Zhang, and Carl Yang
    Proceedings of AAAI, 2023. (Oral)
  11. Deep DAG Learning on Brain Networks for fMRI Analysis
    Yue Yu, Xuan Kan, Hejie Cui, Ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, and Carl Yang
    Proceedings of ISBI, 2023.
  12. A Survey on Knowledge Graphs for Healthcare: Resources, Application Progress, and Promise
    Hejie Cui, Jiaying Lu, Shiyu Wang, Ran Xu, Wenjing Ma, Shaojun Yu, Yue Yu, Xuan Kan, Tianfan Fu, Chen Ling, Joyce Ho, Fei Wang, and Carl Yang
    Proceedings of ICML (IMLH Workshop), 2023.

2022

  1. COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning
    Yue Yu, Chenyan Xiong, Si Sun, Chao Zhang, and Arnold Overwijk
    Proceedings of EMNLP, 2022. (Oral)
  2. Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Negatives
    Si Sun, Chenyan Xiong, Yue Yu, Arnold Overwijk, Zhiyuan Liu, and Jie Bao
    Proceedings of EMNLP, 2022.
  3. ReSel: N-ary Relation Extraction from Scientific Text and Tables by Learning to Retrieve and Select
    Yuchen Zhuang, Yinghao Li, Junyang Zhang, Yue Yu, Yingjun Mou, Xiang Chen, Le Song, and Chao Zhang
    Proceedings of EMNLP, 2022. (Oral)
  4. AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models
    Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, and Chao Zhang
    Proceedings of NAACL, 2022. (Oral)
  5. Self-Training with Differentiable Teacher
    Simiao Zuo*, Yue Yu*, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, and Hongyuan Zha
    Proceedings of NAACL Findings, 2022.
  6. Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning
    Rongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, and Chao Zhang
    Proceedings of ACL, 2022.
  7. Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR
    Ran Xu, Yue Yu, Chao Zhang, Mohammed K Ali, Joyce C Ho, and Carl Yang
    Proceedings of ML4H, 2022. (Best Paper Award)
  8. A survey on programmatic weak supervision
    Jieyu Zhang*, Cheng-Yu Hsieh*, Yue Yu*, Chao Zhang, and Alexander Ratner
    arXiv preprint arXiv:2202.05433, 2022.

2021

  1. Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self-Training Approach
    Yue Yu*, Simiao Zuo*, Haoming Jiang, Wendi Ren, Tuo Zhao, and Chao Zhang
    Proceedings of NAACL, 2021. (Oral)
  2. SumGNN: multi-typed drug interaction prediction via efficient knowledge graph summarization
    Yue Yu*, Kexin Huang*, Chao Zhang, Lucas M Glass, Jimeng Sun, and Cao Xiao
    Bioinformatics, 2021.
  3. WRENCH: A Comprehensive Benchmark for Weak Supervision
    Jieyu Zhang, Yue Yu, Yinghao Li, Yujing Wang, Yaming Yang, Mao Yang, and Alexander Ratner
    Proceedings of NeurIPS (D&B Track), 2021. (Oral)

2020

  1. STEAM: Self-supervised taxonomy expansion with mini-paths
    Yue Yu, Yinghao Li, Jiaming Shen, Hao Feng, Jimeng Sun, and Chao Zhang
    Proceedings of KDD, 2020. (Oral)
  2. BOND: BERT-assisted open-domain named entity recognition with distant supervision
    Chen Liang*, Yue Yu*, Haoming Jiang*, Siawpeng Er, Ruijia Wang, Tuo Zhao, and Chao Zhang
    Proceedings of KDD, 2020. (Oral)
  3. SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup
    Rongzhi Zhang, Yue Yu, and Chao Zhang
    Proceedings of EMNLP, 2020.
  4. Semantic-aware spatio-temporal app usage representation via graph convolutional network
    Yue Yu, Tong Xia, Huandong Wang, Jie Feng, and Yong Li
    Proceedings of IMWUT/UbiComp, 2020.
  5. Urban anomaly analytics: Description, detection, and prediction
    Mingyang Zhang, Tong Li, Yue Yu, Yong Li, Pan Hui, and Yu Zheng
    IEEE Transactions on Big Data, 2020.
  6. Understanding Urban Dynamics via State-Sharing Hidden Markov Model
    Tong Xia, Yong Li, Yue Yu, Fengli Xu, Qingmin Liao, and Depeng Jin
    IEEE Transactions on Knowledge and Data Engineering, 2020.

2019

  1. Understanding Urban Dynamics via State-sharing Hidden Markov Model
    Tong Xia*, Yue Yu*, Fengli Xu, Funing Sun, Diansheng Guo, Depeng Jin, and Yong Li
    Proceedings of WWW, 2019.
  2. Privacy-preserving cross-domain location recommendation
    Chen Gao, Chao Huang, Yue Yu, Huandong Wang, Yong Li, and Depeng Jin
    Proceedings of IMWUT/UbiComp, 2019.