2024

  • MCTS: A Multi-Reference Chinese Text Simplification Dataset
    Ruining Chong, Luming Lu, Liner Yang, Jinran Nie, Zhenghao Liu, Shuo Wang, Shuhan Zhou, Yaoxin Li, Erhong Yang
    Proceedings of LREC-COLING 2024
    [paper][code]

  • OMGEval:An Open Multilingual Generative Evaluation Benchmark for Large Language Models
    Yang Liu, Meng Xu, Shuo Wang, Liner Yang, Haoyu Wang, Zhenghao Liu, Cunliang Kong, Yun Chen, Yang Liu, Maosong Sun, Erhong Yang
    arXiv 2024
    [arXiv][code]
  • UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
    Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun
    arXiv 2024
    [arXiv][code]
  • Cross-domain Chinese Sentence Pattern Parsing
    Jingsi Yu, Cunliang Kong, Liner Yang, Meishan Zhang, Lin Zhu, Chenhui Xie, Yujie Wang, Haozhe Lin, Maosong Sun, Erhong Yang
    arXiv 2024
    [arXiv]
  • From Text to CQL: Bridging Natural Language and Corpus Search Engine
    Luming Lu, Jiyuan An, Yujie Wang, Liner Yang, Cunliang Kong, Zhenghao Liu, Shuo Wang, Haozhe Lin, Mingwei Fang, Yaping Huang, Erhong Yang
    arXiv 2024
    [arXiv]

    2023

  • Lexical Complexity Controlled Sentence Generation
    Jinran Nie, Liner Yang, Yun Chen, Cunliang Kong, Junhui Zhu, Erhong Yang
    Proceedings of CCL 2023
    荣获CCL 2023 最佳英文论文奖
    [paper][code]
  • Is Chinese Spelling Check Ready? Understanding the Correction Behavior in Real-World Scenarios
    Liner Yang, Xin Liu, Tianxin Liao, Zhenghao Liu, Mengyan Wang, Xuezhi Fang, Erhong Yang
    AI Open
    [paper]
  • End-to-end Hard Constrained Text Generation via Incrementally Predicting Segments
    Jinran Nie, Xuancheng Huang, Yang Liu, Cunliang Kong, Xin Liu, Liner Yang, Erhong Yang
    Knowledge-Based Systems
    [link]
  • Cost-efficient Crowdsourcing for Span-based Sequence Labeling: Worker Selection and Data Augmentation
    Yujie Wang, Chao Huang, Liner Yang, Zhixuan Fang, Yaping Huang, Yang Liu, Erhong Yang
    [paper]
  • Leveraging Prefix Transfer for Multi-Intent Text Revision
    Ruining Chong, Cunliang Kong, Liu Wu, Zhenghao Liu, Ziye Jin, Liner Yang, Yange Fan, Hanghang Fan, Erhong Yang
    The 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023)
    [paper]
  • 汉语学习者文本多维标注语料库建设
    王莹莹, 孔存良, 杨麟儿, 胡韧奋, 杨尔弘, 孙茂松
    语言文字应用, 2023, (1): 88-100
    [paper]
  • 高频术语视角下计算机辅助语言学习领域的热点研究
    朱君辉, 王晓菀
    中国科技术语, 2023
    [paper]
  • 句式结构树库的自动构建研究
    谢晨晖, 胡正升, 杨麟儿, 廖田昕, 杨尔弘
    中文信息学报, 2023, 37(2): 15-25
    [paper]
  • 基于片段预测的词汇约束文本生成
    聂锦燃, 杨麟儿, 杨尔弘
    中文信息学报, 2023, 37(8): 150-158
    [paper]

    2022

  • Lexical Complexity Controlled Sentence Generation
    Jinran Nie, Liner Yang, Yun Chen, Cunliang Kong, Junhui Zhu, Erhong Yang
    arXiv 2022
    [arXiv]
  • 文心语料库检索平台的研制
    朱君辉, 刘鑫, 杨麟儿, 师佳璐, 刘鹏远, 杨尔弘
    第十二届全国语言文字应用学术研讨会
    [paper] [slides] [demo] [talk]
  • 汉语语法点特征及其在二语文本难度自动分级研究中的应用
    朱君辉, 刘鑫, 杨麟儿, 王鸿滨, 杨尔弘
    语言文字应用, 2022, (3): 87-99
    [paper]
  • 句式结构树库的自动构建研究
    谢晨晖, 胡正升, 杨麟儿, 廖田昕, 杨尔弘
    第二十一届中国计算语言学大会 (CCL 2022)
    荣获 CCL 2022 最佳中文论文奖
    [paper] [slides] [code] [demo]
  • 汉语增强依存句法自动转换研究
    余婧思, 师佳璐, 杨麟儿, 肖丹, 杨尔弘
    第二十一届中国计算语言学大会 (CCL 2022)
    [paper] [poster] [code] [demo]
  • COMPILING: A Benchmark Dataset for Chinese Complexity Controllable Definition Generation
    Jiaxin Yuan, Cunliang Kong, Chenhui Xie, Liner Yang, Erhong Yang
    The 21st China National Conference on Computational Linguistics (CCL 2022)
    [paper] [arXiv] [poster]
  • Multitasking Framework for Unsupervised Simple Definition Generation
    Cunliang Kong, Yun Chen, Hengyuan Zhang, Liner Yang, Erhong Yang
    The 60th Annual Meeting of the Association for Computational Linguistics (ACL 2022)
    [paper] [arXiv] [code]
  • CTAP for Chinese: A linguistic Complexity Feature Automatic Calculation Platform
    Yue Cui, Junhui Zhu, Liner Yang, Xuezhi Fang, Xiaobin Chen, Yujie Wang, Erhong Yang
    The 12th Language Resources and Evaluation Conference (LREC 2022)
    [paper] [code] [toolkit]
  • BLCU-ICALL at SemEval-2022 Task 1: Cross-Attention Multitasking Framework for Definition Modeling
    Cunliang Kong, Yujie Wang, Ruining Chong, Liner Yang, Hengyuan Zhang, and Erhong Yang, Yaping Huang
    The 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2022)
    [paper] [arXiv] [code]
  • Controllable Data Synthesis Method for Grammatical Error Correction
    Liner Yang, Chengcheng Wang, Yun Chen, Yongping Du, Erhong Yang
    Frontiers of Computer Science, 2022
    [link] [arXiv]
  • 汉语学习者依存句法树库构建
    师佳璐, 罗昕宇, 杨麟儿, 肖丹, 胡正升, 王一君, 袁佳欣, 余婧思, 杨尔弘
    中文信息学报, 2022, 36(1): 39-46
    [paper] [link]
  • LitMind Dictionary: An Open-Source Online Dictionary
    Cunliang Kong, Xuezhi Fang, Liner Yang, Yun Chen, Erhong Yang
    arXiv 2022
    [blog] [arXiv] [code]

2021

  • YACLC: A Chinese Learner Corpus with Multidimensional Annotation
    Yingying Wang, Cunliang Kong, Liner Yang, Yijun Wang, Xiaorong Lu, Renfen Hu, Shan He, Zhenghao Liu, Yun Chen, Erhong Yang, Maosong Sun
    arXiv 2021
    [blog] [arXiv] [data]
  • Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction
    Zhenghao Liu, Xiaoyuan Yi, Maosong Sun, Liner Yang, Tat-Seng Chua
    The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2021)
    [paper] [arXiv] [code]
  • Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning
    Shengsheng Zhang, Yaping Huang, Yun Chen, Liner Yang, Chencheng Wang, Erhong Yang
    arXiv 2011
    [arXiv]
  • 基于BERT与柱搜索的中文释义生成
    范齐楠, 孔存良, 杨麟儿, 杨尔弘
    中文信息学报, 2021, 35(11): 80-90
    [paper] [link]
  • 面向汉语作为第二语言学习的个性化语法纠错
    张生盛, 庞桂娜, 杨麟儿, 王辰成, 杜永萍, 杨尔弘, 黄雅平
    中文信息学报, 2021, 35(12): 28-35
    [paper] [link]
  • 2020,流行语里的中国与世界
    杨尔弘, 崔悦, 朱君辉, 师佳璐
    语言生活皮书—中国语言生活状况报告 (2021), 2021:216-223
    [paper] [link]
  • 2020年科技焦点名词
    崔悦
    中国科技术语, 2021, 23(01): 28
    [paper] [link]

2020

  • Incorporating Sememes into Chinese Definition Modeling
    Liner Yang, Cunliang Kong, Yun Chen, Yang Liu, Qinan Fan, Erhong Yang
    IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020
    [link] [arXiv] [code]
  • Toward Cross-Lingual Definition Generation for Language Learners
    Cunliang Kong, Liner Yang, Tianzuo Zhang, Qinan Fan, Zhenghao Liu, Yun Chen, Erhong Yang
    arXiv 2020
    [arXiv]
  • 汉语中介语的依存句法标注规范及标注实践
    肖丹, 杨尔弘, 张明慧, 陆天荧, 杨麟儿
    中文信息学报, 2020, 34(11): 19-28
    [paper] [link]
  • 基于Transformer增强架构的中文语法纠错方法
    王辰成, 杨麟儿,王莹莹, 杜永萍, 杨尔弘
    中文信息学报, 22020, 34(6): 106-114
    [paper] [link]
  • 基于门控化上下文感知网络的词语释义生成方法
    张海同, 孔存良, 杨麟儿, 何姗, 杜永萍, 杨尔弘
    中文信息学报, 2020, 34(7): 105-112
    [paper] [link]
  • 二语习得视角下“被”搭配的跨语料库对比研究
    方雪至, 谢永慧, 崔悦, 杨尔弘
    第21届汉语词汇语义学研讨会 (CLSW2020)
    [paper]
  • 2019,流行语里的中国与世界
    杨尔弘, 陆天荧 , 崔悦, 方雪至
    语言生活皮书—中国语言生活状况报告 (2020), 2020:229-236
    [paper] [link]
  • 2019年科技焦点名词
    陆天荧, 崔悦, 方雪至
    中国科技术语, 2020, 22(01):75-80
    [paper] [link]