Main Conference - Long Papers

  • Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics Daniel Deutsch, Rotem Dror, Dan Roth
  • MOVER: Mask, Over-generate and Rank for Hyperbole Generation Yunxiang Zhang, Xiaojun Wan
  • Aligning to Normative Values in Morally Informed Game Environments Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hanna Hajishirzi, Yejin Choi
  • Diagnosing Vision-and-Language Navigation: What Really Matters Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang
  • HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction Shuliang Liu, Xuming Hu, Chenwei Zhang, Shu’ang Li, Lijie Wen, Philip S. Yu
  • Time Waits for No One! Analysis and Challenges of Temporal Misalignment Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah Smith
  • Hate Speech and Counter Speech Detection: Conversational Context Does Matter Xinchen Yu, Eduardo Blanco, Lingzi Hong
  • Non-Autoregressive Chinese ASR Error Correction with Phonological Training Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao
  • Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi, Noah Smith
  • Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis Baber Khalid, SUNGJIN LEE
  • You Don’t Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers’ Private Personas Haoran Li, Yangqiu Song, Lixin Fan
  • Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou
  • A Holistic Framework for Analyzing the COVID-19 Vaccine Debate Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser
  • Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation Hanxun Zhong, Zhicheng Dou, Yutao Zhu, Hongjin Qian, Ji-Rong Wen
  • What do Toothbrushes do in the Kitchen? How Transformers Think our World is Structured Alexander Henlein, Alexander Mehler
  • Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints Chun Zeng, Jiangjie Chen, Tianyi Zhuang, Rui Xu, Hao Yang, Qin Ying, shimin tao, Yanghua Xiao
  • Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs Ghazi Felhi, Joseph Le Roux, Djamé Seddah
  • LaMemo: Language Modeling with Look-Ahead Memory Haozhe Ji, Rongsheng Zhang, Zhenyu Yang, Zhipeng Hu, Minlie Huang
  • Few-Shot Document-Level Relation Extraction Nicholas Popovic, Michael Färber
  • Template-free Prompt Tuning for Few-shot NER Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang, Xuanjing Huang
  • Hyperbolic Relevance Matching for Neural Keyphrase Extraction Mingyang Song, Yi Feng, Liping Jing
  • DialSummEval: Revisiting Summarization Evaluation for Dialogues Mingqi Gao, Xiaojun Wan
  • CoMPM: Context Modeling with Speaker’s Pre-trained Memory Tracking for Emotion Recognition in Conversation Joosung Lee, Wooin Lee
  • CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Amit Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
  • Shedding New Light on the Language of the Dark Web Youngjin Jin, Eugene Jang, Yongjae Lee, Seungwon Shin, Jin-Woo Chung
  • Identifying Implicitly Abusive Remarks about Identity Groups using a Linguistically Informed Approach Michael Wiegand, Elisabeth Eder, Josef Ruppenhofer
  • Cross-Lingual Event Detection via Optimized Adversarial Training Luis Fernando Guzman-Nateras, Minh Van Nguyen, Thien Huu Nguyen
  • DEMix Layers: Disentangling Domains for Modular Language Modeling Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah Smith, Luke Zettlemoyer
  • Nearest Neighbor Knowledge Distillation for Neural Machine Translation Zhixian Yang, Renliang Sun, Xiaojun Wan
  • Cryptocoin Bubble Detection: A New Dataset, Task & Hyperbolic Models Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava
  • IDPG: An Instance-Dependent Prompt Generation Method Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V.G.Vinod Vydiswaran, Hao Ma
  • Few-shot Subgoal Planning with Language Models Lajanugen Logeswaran, Yao Fu, Moontae Lee, Honglak Lee
  • Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification Han Wang, Canwen Xu, Julian McAuley
  • ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence Yibo Hu, MohammadSaleh Hosseini, Erick Skorupa Parolin, Javier Osorio, Latifur Khan, Patrick Brandt, Vito D’Orazio
  • Extreme Zero-Shot Learning for Extreme Text Classification Yuanhao Xiong, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Inderjit S Dhillon
  • Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Xing Fan, Chenlei Guo, Yang Liu
  • CORWA: A Citation-Oriented Related Work Annotation Dataset Xiangci Li, Biswadip Mandal, Jessica Ouyang
  • When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes Mycal Tucker, Tiwalayo Eisape, Peng Qian, Roger P. Levy, Julie Shah
  • Maximum Bayes Smatch Ensemble Distillation for AMR Parsing Young-Suk Lee, Ramon Fernandez Astudillo, Hoang Thanh Lam, Tahira Naseem, Radu Florian, Salim Roukos
  • ExSum: From Local Explanations to Model Understanding Yilun Zhou, Marco Tulio Ribeiro, Julie Shah
  • QuALITY: Question Answering with Long Input Texts, Yes! Richard Yuanzhe Pang, Alicia Vail Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny L Ma, Jana Thompson, He He, Samuel R. Bowman
  • Visual Commonsense in Pretrained Unimodal and Multimodal Models Chenyu Zhang, Benjamin Van Durme, Elias Stengel-Eskin, Zhuowan Li
  • Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance Jingwei Ni, Zhijing Jin, Markus Freitag, Mrinmaya Sachan, Bernhard Schölkopf
  • Is “my favorite new movie” my favorite movie? Probing the Understanding of Recursive Noun Phrases QING LYU, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch
  • Syn2Vec: Synset Colexification Graphs for Lexical Semantic Similarity John Harvill, Roxana Girju, Mark A. Hasegawa-Johnson
  • TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding Le Zhang, Zichao Yang, Diyi Yang
  • Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables Nan Hu, Zirui Wu, Yuxuan Lai, Xiao Liu, Yansong Feng
  • Semantically Informed Slang Interpretation Zhewei Sun, Richard Zemel, Yang Xu
  • Partner Personas Generation for Dialogue Response Generation Hongyuan Lu, Wai Lam, Hong Cheng, Helen M. Meng
  • Sketching as a Tool for Understanding and Accelerating Self-attention for Long Sequences Yifan Chen, Qi Zeng, Dilek Hakkani-Tur, Di Jin, Heng Ji, Yun Yang
  • On the Effect of Pretraining Corpora on In-context Few-shot Learning by a Large-scale Language Model Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung
  • KALA: Knowledge-Augmented Language Model Adaptation Minki Kang, Jinheon Baek, Sung Ju Hwang
  • DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar
  • Cross-modal Contrastive Learning for Speech Translation Rong Ye, Mingxuan Wang, Lei Li
  • Modeling Multi-Granularity Hierarchical Features for Relation Extraction Xinnian Liang, Shuangzhi Wu, Mu Li, Zhoujun Li
  • A Corpus for Understanding and Generating Moral Stories Jian Guan, Ziqi Liu, Minlie Huang
  • JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering Yueqing Sun, Qi Shi, Le Qi, Yu Zhang
  • Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen
  • A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction Runxin Xu, Peiyi Wang, Tianyu Liu, Shuang Zeng, Baobao Chang, Zhifang Sui
  • An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui
  • A Double-Graph Based Framework for Frame Semantic Parsing Ce Zheng, Xudong Chen, Runxin Xu, Baobao Chang
  • RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction Yuan Liang, Zhuoxuan Jiang, di yin, Bo Ren
  • SkillSpan: Hard and Soft Skill Extraction from English Job Postings Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank
  • Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES Felix Stahlberg, Shankar Kumar
  • DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks LIN TIAN, Xiuzhen Zhang, Jey Han Lau
  • Mitigating Toxic Degeneration with Empathetic Data: Exploring the Relationship Between Toxicity and Empathy Charles Welch, Allison Lahnala, Béla Neuendorf, Lucie Flek
  • SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis Zheng Zhang, Zili Zhou, Yanna Wang
  • A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank Dan Malkin, Gabriel Stanovsky
  • Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks Ruixiang Cui, Daniel Hershcovich, Anders Søgaard
  • Interactive Symbol Grounding with Complex Referential Expressions Rimvydas Rubavicius, Alex Lascarides
  • Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization Lulu Zhao, Fujia Zheng, Weihao Zeng, Keqing He, Weiran Xu, Huixing Jiang, Wei Wu, Yanan Wu
  • Reducing Disambiguation Biases in NMT by Leveraging Explicit Word Sense Information Niccolò Campolungo, Tommaso Pasini, Denis Emelin, Roberto Navigli
  • Match made by BERT? Towards Interpretable Paper-Reviewer Assignments in NLP Terne Sasha Thorn Jakobsen, Anna Rogers
  • Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs Songlin Yang, Wei Liu, Kewei Tu
  • Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, hong yu, Sachindra Joshi
  • Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew Arnold, Xiang Ren
  • EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification Minyi Zhao, Lu Zhang, Yi Xu, Jiandong Ding, Jihong Guan, Shuigeng Zhou
  • A Study of the Attention Abnormality in Trojaned BERTs Weimin Lyu, Songzhu Zheng, Tengfei Ma, Chao Chen
  • Quantifying Adaptability in Pre-trained Language Models with 500 Tasks Belinda Z. Li, Jane A. Yu, Madian Khabsa, Luke Zettlemoyer, Alon Halevy, Jacob Andreas
  • Disentangling Indirect Answers to Yes-No Questions in Real Conversations Krishna Chaitanya Sanagavarapu, Jathin Pranav Singaraju, Anusha Kakileti, Anirudh Kaza, Aaron Abraham Mathews, Helen Li, Nathan Raul Brito, Eduardo Blanco
  • Massive-scale Decoding for Text Generation using Lattices Jiacheng Xu, Siddhartha Jonnalagadda, Greg Durrett
  • Entity Linking via Explicit Mention-Mention Coreference Modeling Dhruv Agarwal, Rico Angell, Nicholas Monath, Andrew McCallum
  • GenIE: Generative Information Extraction Martin Josifoski, Nicola De Cao, Maxime Peyrard, Fabio Petroni, Robert West
  • Symbolic Knowledge Distillation: from General Language Models to Commonsense Models Peter West, Chandra Bhagavatula, Jack Hessel, Jena D. Hwang, Liwei Jiang, Ronan Le Bras, Ximing Lu, Sean Welleck, Yejin Choi
  • Measure and Improve Robustness in NLP Models: A Survey Xuezhi Wang, Haohan Wang, Diyi Yang
  • Using Paraphrases to Study Properties of Contextual Embeddings Laura Burdick, Jonathan K Kummerfeld, Rada Mihalcea
  • Disentangling Categorization in Multi-agent Emergent Communication Washington Garcia, Hamilton Scott Clouse, Kevin R. B. Butler
  • SURF: Semantic-level Unsupervised Reward Function for Machine Translation Atijit Anuchitanukul, Julia Ive
  • Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi
  • CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine
  • Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity Sheshera Mysore, Arman Cohan, Tom Hope
  • Testing the Ability of Language Models to Interpret Figurative Language Emmy Liu, Chenxuan Cui, Kenneth Zheng, Graham Neubig
  • Compositional Task-Oriented Parsing as Abstractive Question Answering Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie
  • What kind of company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris Mikael Brunila, Jack LaViolette
  • Imagination-Augmented Natural Language Understanding Yujie Lu, Wanrong Zhu, Xin Eric Wang, Miguel Eckstein, William Yang Wang
  • Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling Jakob Prange, Nathan Schneider, Lingpeng Kong
  • Joint Extraction of Entities, Relations, and Events via Modeling Inter-Instance and Inter-Label Dependencies Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Huu Nguyen
  • Improving Compositional Generalization with Latent Structure and Data Augmentation Linlu Qiu, Peter Shaw, Panupong Pasupat, Paweł Krzysztof Nowak, Tal Linzen, Fei Sha, Kristina Toutanova
  • Consolidating Answers in Question Answering Systems Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen
  • FNet: Mixing Tokens with Fourier Transforms James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon
  • TVShowGuess: Character Comprehension in Stories as Speaker Guessing Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton
  • Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation Siyu Lai, Zhen Yang, Fandong Meng, Xue Zhang, Yufeng Chen, Jinan Xu, Jie Zhou
  • ProQA: Structural Prompt-based Pre-training for Unified Question Answering Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
  • Generative Cross-Domain Data Augmentation for Aspect and Opinion Co-Extraction Junjie Li, Jianfei Yu, Rui Xia
  • DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass
  • Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers Vivek Kumar, Rishabh Maheshwary, Vikram Pudi
  • Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold Yanan Wu, Keqing He, Yuanmeng Yan, QiXiang Gao, Zhiyuan Zeng, Fujia Zheng, Lulu Zhao, Huixing Jiang, Wei Wu, Weiran Xu
  • COGMEN: COntextualized GNN based Multimodal Emotion recognitioN Abhinav Joshi, Ashwani Bhat, Ayush Jain, Atin Vikram Singh, Ashutosh Modi
  • KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media Wenqian Zhang, Shangbin Feng, Zilong Chen, Zhenyu Lei, Jundong Li, Minnan Luo
  • Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee
  • Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset Fengzhu ZENG, Wei Gao
  • Joint Learning-based Heterogeneous Graph Attention Network for Timeline Summarization Jingyi You, Dongyuan Li, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
  • Connecting Loss Difference with Equal Opportunity for Fair Models Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin, Lea Frermann
  • Unsupervised Stem-based Cross-lingual Part-of-Speech Tagging for Morphologically Rich Low-Resource Languages Ramy Eskander, Cass Lowry, Sujay Khandagale, Judith Lynn Klavans, Maria Polinsky, Smaranda Muresan
  • Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting Linzhi Wu, Pengjun Xie, Jie Zhou, Meishan Zhang, Ma Chunping, Guangwei Xu, Min Zhang
  • Bilingual Tabular Inference: A Case Study on Indic Languages Chaitanya Agarwal, Vivek Gupta, Anoop Kunchukuttan, Manish Shrivastava
  • A Complex KBQA System using Multiple Reasoning Paths Yu Wang, Vijay Srinivasan, Hongxia Jin
  • WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz
  • DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction MeiHan Tong, Bin Xu, Shuai Wang, Meihuan Han, Yixin Cao, Jiangqi Zhu, Siyu Chen, Lei Hou, Juanzi Li
  • On Transferability of Prompt Tuning for Natural Language Processing Yusheng Su, Xiaozhi Wang, Yujia Qin, Chi-Min Chan, Yankai Lin, Huadong Wang, Kaiyue Wen, Zhiyuan Liu, Peng Li, Juanzi Li, Lei Hou, Maosong Sun, Jie Zhou
  • Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation Pengzhi Gao, Zhongjun He, Hua Wu, Haifeng Wang
  • Knowledge Inheritance for Pre-trained Language Models Yujia Qin, Yankai Lin, Jing Yi, Jiajie Zhang, Xu Han, Zhengyan Zhang, Yusheng Su, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou
  • TRUE: Re-evaluating Factual Consistency Evaluation Or Honovich, Roee Aharoni, Jonathan Herzig, Hagai Taitelbaum, Doron Kukliansy, Vered Cohen, Thomas Scialom, Idan Szpektor, Avinatan Hassidim, Yossi Matias
  • Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering JianGuo Mao, Wenbin Jiang, Xiangdong Wang, Zhifan Feng, Yajuan Lyu, Hong Liu, Yong Zhu
  • EASE: Entity-Aware Contrastive Learning of Sentence Embedding Sosuke Nishikawa, Ryokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen
  • From spoken dialogue to formal summary: An utterance rewriting for dialogue summarization Yue Fang, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Bo Long, Yanyan Lan, Yanquan Zhou
  • Residue-Based Natural Language Adversarial Attack Detection Vyas Raina, Mark Gales
  • A Computational Acquisition Model for Multimodal Word Categorization Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann
  • On the Effectiveness of Sentence Encoding for Intent Detection Meta-Learning Tingting Ma, Qianhui Wu, Zhiwei Yu, Tiejun Zhao, Chin-Yew Lin
  • Can Rationalization Improve Robustness? Howard Chen, Jacqueline He, Karthik R Narasimhan, Danqi Chen
  • One Reference Is Not Enough: Diverse Distillation with Reference Selection for Non-Autoregressive Translation Chenze Shao, Xuanfu Wu, Yang Feng
  • GMN: Generative Multi-modal Network for Practical Document Information Extraction Haoyu Cao, Jiefeng Ma, Antony Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
  • Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela
  • Adaptable Adapters Nafise Sadat Moosavi, Quentin Delfosse, Kristian Kersting, Iryna Gurevych
  • Maize: Effective and Efficient Retrieval via Lightweight Late Interaction Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia
  • Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš
  • FRUIT: Faithfully Reflecting Updated Information in Text Robert L. Logan IV, Alexandre Tachard Passos, Sameer Singh, Ming-Wei Chang
  • Learning the Ordering of Coordinate Compounds and Elaborate Expressions in Hmong, Lahu, and Chinese Chenxuan Cui, Katherine J. Zhang, David R Mortensen
  • Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities Benjamin Hsu, Graham Horwood
  • PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts Daniel Khashabi, Xinxi Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sameer Singh, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Yejin Choi
  • When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer Ameet Deshpande, Partha Talukdar, Karthik R Narasimhan
  • Benchmarking Intersectional Biases in NLP John P. Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi
  • Sonnet Generation by Training on Non-poetic Texts with Discourse-level Coherence and Poetic Features Yufei Tian, Nanyun Peng
  • Improving In-Context Few-Shot Learning via Self-Supervised Training Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva
  • Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Daniel Morrison, Alexander Fabbri, Yejin Choi, Noah Smith
  • ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Xin Zhao, Ji-Rong Wen
  • Learning to Transfer Prompts for Text Generation Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Xin Zhao
  • DocAMR: Multi-Sentence AMR Representation and Evaluation Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O’Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider
  • Lifting the Curse of Multilinguality by Pre-training Modular Transformers Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe
  • Transparent Human Evaluation for Image Captioning Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith
  • TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations Prashanth Vijayaraghavan, Soroush Vosoughi
  • On the Use of Bert for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation Yongjie Wang, Chuan Wang, Ruobing Li, Hui Lin
  • Multimodal Dialogue State Tracking Hung Le, Nancy F. Chen, Steven HOI
  • VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems Hung Le, Nancy F. Chen, Steven HOI
  • CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking Xuming Hu, Zhijiang Guo, GuanYu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu
  • Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation Zhexin Zhang, Jiaxin Wen, Jian Guan, Minlie Huang
  • Fine-grained Location Extraction via Curriculum Learning Pei Chen, Haotian Xu, Cheng Zhang, Ruihong Huang
  • LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking Yifan Wang, Jing Zhao, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He
  • Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang
  • Towards Efficient NLP: A Standard Evaluation and A Strong Baseline Xiangyang Liu, Tianxiang Sun, JunLiang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu
  • Clues Before Answers: Generation-Enhanced Multiple-Choice QA Zixian Huang, Ao Wu, Jiaying Zhou, Yu Gu, Yue Zhao, Gong Cheng
  • Unsupervised Paraphrasability Prediction for Compound Nominalizations John Sie Yuen Lee, Ho Hung Lim, Carol Webster
  • FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal
  • Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity? SUBBA REDDY OOTA, Veeral Agarwal, JASHN ARORA, mounika marreddy, Manish Gupta, Bapi Raju Surampudi
  • Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding Zeming Chen, Qiyue Gao
  • A Dataset for N-ary Relation Extraction of Drug Combinations Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg
  • ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition Xinyu Wang, Min Gui, Yong Jiang, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang, Kewei Tu
  • Efficient Constituency Tree based Encoding for Natural Language to Bash Translation Shikhar Bharadwaj, Shirish Shevade
  • Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation Shumpei Inoue, Tsungwei Liu, Son Hong Nguyen, Minh-Tien Nguyen
  • NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias Nayeon Lee, Yejin Bang, Tiezheng YU, Andrea Madotto, Pascale Fung
  • MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction Yue Zhang, Zhenghua Li, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang
  • Boosted Dense Retriever Patrick Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Scott Yih, Sebastian Riedel
  • Analyzing Encoded Concepts in Transformer Language Models Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
  • Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
  • A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles HACHEME, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi, Yvonne Wambui Gitau, Jade Abbott, Mohamed Ahmed, Millicent Ochieng, Anuoluwapo Aremu, Perez Ogayo, Jonathan Mukiibi, Fatoumata Ouoba Kabore, Godson Koffi KALIPE, Derguene Mbaye, Allahsera Auguste Tapo, Victoire Memdjokam Koagne, Edwin Munkoh-Buabeng, Valencia Wagner, Idris Abdulmumin, Ayodele Awokoya
  • Document-Level Event Argument Extraction by Leveraging Redundant Information and Closed Boundary Loss Hanzhang Zhou, Kezhi Mao
  • Features or Spurious Artifacts? Data-centric Baselines for Fair and Robust Hate Speech Detection Alan Ramponi, Sara Tonelli
  • Low Resource Style Transfer via Domain Adaptive Meta Learning Xiangyang Li, Xiang Long, Yu Xia, Sujian Li
  • Progressive Class Semantic Matching for Semi-supervised Text Classification Haiming Xu, Lingqiao Liu, Ehsan M Abbasnejad
  • Domain Confused Contrastive Learning for Unsupervised Domain Adaptation Quanyu Long, Tianze Luo, Wenya Wang, Sinno Pan
  • Interpretable Proof Generation via Iterative Backward Reasoning Hanhao Qu, Yu Cao, Jun Gao, Liang Ding, Ruifeng Xu
  • PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding Antoine Chaffin, Vincent Claveau, Ewa Kijak
  • Triggerless Backdoor Attack for NLP Tasks with Clean Labels Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan
  • Are All the Datasets in Benchmark Necessary? A Pilot Study of Dataset Evaluation for Text Classification Yang Xiao, Jinlan Fu, See-Kiong Ng, Pengfei Liu
  • Document-Level Relation Extraction with Sentences Importance Estimation and Focusing Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao
  • Improving Entity Disambiguation by Reasoning over a Knowledge Base Tom Ayoola, Joseph Fisher, Andrea Pierleoni
  • Learning to Borrow– Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion Huda Hakami, Mona Hakami, Angrosh Mandya, Danushka Bollegala
  • Do Trajectories Encode Verb Meaning? Dylan Ebert, Chen Sun, Ellie Pavlick
  • Selective Differential Privacy for Language Modeling Weiyan Shi, Aiqi Cui, Evan Li, Ruoxi Jia, Zhou Yu
  • Robust Conversational Agents against Imperceptible Toxicity Triggers Ninareh Mehrabi, Ahmad Beirami, Fred Morstatter, Aram Galstyan
  • Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur
  • MetaICL: Learning to Learn In Context Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
  • Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition Besnik Fetahu, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi
  • Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization Prasetya Ajie Utama, Joshua Bambrick, Nafise Sadat Moosavi, Iryna Gurevych
  • Multi-Domain Targeted Sentiment Analysis Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim
  • Gender Bias in Masked Language Models for Multiple Languages Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki
  • Federated Learning with Noisy User Feedback Rahul Sharma, Anil Ramakrishna, Ansel MacLaughlin, Anna Rumshisky, Jimit Majmudar, Clement Chung, Salman Avestimehr, Rahul Gupta
  • Don’t sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks Jonathan Rusert, Padmini Srinivasan
  • Re2G: Retrieve, Rerank, Generate Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Naik, Pengshan Cai, Alfio Gliozzo
  • Learning to Retrieve Passages without Supervision Ori Ram, Gal Shachaf, Omer Levy, Jonathan Berant, Amir Globerson
  • Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko
  • Learning To Retrieve Prompts for In-Context Learning Ohad Rubin, Jonathan Herzig, Jonathan Berant
  • Unified Semantic Typing with Meaningful Label Inference James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
  • A Structured Span Selector for Span Prediction Tasks Tianyu Liu, Yuchen Eleanor Jiang, Ryan D Cotterell, Mrinmaya Sachan
  • How Gender Debiasing Affects Internal Model Representations, and Why It Matters Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov
  • QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization Alexander Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong
  • Training Mixed-Domain Translation Models via Federated Learning Peyman Passban, Tanya Roosta, Rahul Gupta, ankit Chadha, Clement Chung
  • Interactive Query-Assisted Summarization via Deep Reinforcement Learning Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Ido Dagan, Yael Amsterdamer
  • Text Style Transfer via Optimal Transport Nasim Nouri
  • AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization Alexander Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona T. Diab
  • What do tokens know about their characters and how do they know it? Ayush Kaushal, Kyle Mahowald
  • WiC = TSV = WSD: On the Equivalence of Three Semantic Tasks Bradley Hauer, Grzegorz Kondrak
  • Combating the curse of multilinguality in cross-lingual WSD via the application of sparsified contextualized word representations Gábor Berend
  • A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya
  • Does Summary Evaluation Survive Translation to Other Languages? Spencer Braun, Oleg Vasilyev, Neslihan Iskender, John Bohannon
  • LITE: Intent-based Task Representation Learning Using Weak Supervision Naoki Otani, Michael Gamon, Sujay Kumar Jauhar, Mei Yang, Sri Raghu Malireddi, Oriana Riva
  • SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han
  • Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models Patrick Huber, Giuseppe Carenini
  • Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models Qinyuan Ye, Madian Khabsa, Mike Lewis, Sinong Wang, Xiang Ren, Aaron Jaech
  • GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych
  • Do Prompt-Based Models Really Understand the Meaning of Their Prompts? Albert Webson, Ellie Pavlick
  • MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen
  • End-to-End Chinese Speaker Identification: Formulation, Annotation, and Methods Dian Yu, Ben Zhou, Dong Yu
  • Learning to Express in Knowledge-Grounded Conversation Xueliang Zhao, Tingchen Fu, Chongyang Tao, Wei Wu, Dongyan Zhao, Rui Yan
  • Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning Yu Jin Kim, Beong-woo Kwak, Youngwook Kim, Reinald Kim Amplayo, seung-won hwang, Jinyoung Yeo
  • Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks Akari Asai, Matt Gardner, Hanna Hajishirzi
  • On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation Kelly Marchisio, Markus Freitag, David Grangier
  • Generic and Trend-aware Curricula for Relation Extraction in Text Graphs Nidhi Vakil, Hadi Amiri
  • Locally Aggregated Feature Attribution on Natural Language Understanding Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song
  • Sentence-Level Resampling for Named Entity Recognition Xiaochen Wang, Yue Wang
  • Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park
  • KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge Distillation Marzieh S. Tahaei, Ella Charlaix, Vahid Partovi Nia, Ali Ghodsi, Mehdi Rezagholizadeh
  • D2U: Distance-to-Uniform Learning for Out-of-Scope Detection Eyup Halit Yilmaz, Cagri Toraman
  • Modeling Exemplification in Long-form Question Answering via Retrieval Shufan Wang, Fangyuan Xu, Laure Thompson, Eunsol Choi, Mohit Iyyer
  • Don’t Take It Literally: An Edit-Invariant Sequence Loss for Text Generation Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu
  • Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
  • CS1QA: A Dataset for Code-based Question Answering in an Introductory Programming Course Changyoon Lee, Yeon Seonwoo, Alice Oh
  • Event Schema Induction with Double Graph Autoencoders Xiaomeng Jin, Manling Li, Heng Ji
  • Multi-Relational Graph Transformer for Automatic Short Answer Grading Rajat Agarwal, Varun Khurana, Karish Grover, Mukesh Mohania, Vikram Goyal
  • Even the Simplest Baseline Needs Careful Re-investigation: A Case Study on XML-CNN Si-An Chen, Jie-Jyun Liu, Tsung-Han Yang, Hsuan-Tien Lin, Chih-Jen Lin
  • Simple Local Attentions Remain Competitive for Long-Context Tasks Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad
  • Frustratingly Easy System Combination for Grammatical Error Correction Muhammad Reza Qorib, Seung-Hoon Na, Hwee Tou Ng
  • All You May Need for VQA are Image Captions Soravit Changpinyo, Doron Kukliansy, Idan Szpektor, Xi Chen, Nan Ding, Radu Soricut
  • MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification Jianhai Zhang, Mieradilijiang Maimaiti, Gao Xing, Yuanhang Zheng, Ji Zhang
  • Hero-Gang Neural Model For Named Entity Recognition Jinpeng Hu, Yaling Shen, Yang Liu, Xiang Wan, Tsung-Hui Chang
  • Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling Nuo Chen, Linjun Shou, MING GONG, Jian Pei, Daxin Jiang
  • DEGREE: A Data-Efficient Generation-Based Event Extraction Model I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng
  • MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo
  • The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation Tobias Domhan, Eva Hasler, Ke Tran, Sony Trenous, Bill Byrne, Felix Hieber
  • Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering Rajat Kumar, Mayur Patidar, VAIBHAV VARSHNEY, Lovekesh Vig, Gautam Shroff
  • RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators Rilwan Akanni Adewoyin, Ritabrata Dutta, Yulan He
  • TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
  • Non-Autoregressive Machine Translation: It’s Not as Fast as it Seems Jindřich Helcl, Barry Haddow, Alexandra Birch
  • Proposition-Level Clustering for Multi-Document Summarization Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan
  • A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu
  • ValCAT: Generating Variable-Length Contextualized Adversarial Transformations using Encoder-Decoder Chuyun Deng, Mingxuan Liu, Yue Qin, Jia Zhang, Hai-Xin Duan, Donghong Sun
  • Representation Learning for Conversational Data using Discourse Mutual Information Maximization Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal
  • MuPAD: A Chinese Multi-Domain Predicate-Argument Dataset Yahui Liu, Haoping Yang, Chen Gong, Qingrong Xia, Zhenghua Li, Min Zhang
  • Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt
  • Improving Constituent Representation with Hypertree Neural Networks Hao Zhou, Gongshen Liu, Kewei Tu
  • Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai
  • OPERA: Operation-Pivoted Discrete Reasoning over Text Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, jiahui liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao
  • Guiding Visual Question Generation Nihir Vedd, Zixu Wang, Marek Rei, yishu miao, Lucia Specia
  • Implicit n-grams Induced by Recurrence Xiaobing Sun, Wei Lu
  • MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation Simiao Zuo, Qingru Zhang, Chen Liang, Pengcheng He, Tuo Zhao, Weizhu Chen
  • Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis Jiahao Cao, Rui Liu, Huailiang Peng, Lei Jiang, Xu Bai
  • Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media Lixing Zhu, Gabriele Pergola, Zheng Fang, Robert Procter, Yulan He
  • BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan
  • Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding Ao Jia, Yu He, Yazhou Zhang, Sagar Uprety, Dawei Song, Christina Lioma
  • Graph and Attention Based Fact Verification and Heterogeneous COVID-19 Claims Dataset Miguel Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Robert Procter, Yulan He
  • Many Hands Make Light Work: Using Essay Traits to Automatically Score Essays Rahul Kumar, Sandeep Mathias, Sriparna Saha, Pushpak Bhattacharyya
  • Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert
  • Go Back in Time: Generating Flashbacks in Stories with Event Plots and Temporal Prompts Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng
  • Label Anchored Contrastive Learning for Language Understanding Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He
  • AcTune: Uncertainty-Aware Active Self-Training for Active Fine-Tuning of Pretrained Language Models Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang
  • Quality-Aware Decoding for Neural Machine Translation Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins
  • Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning Haiyan Yin, Dingcheng Li, Ping Li
  • On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat
  • Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate Hannah Rose Kirk, Bertram Vidgen, Paul Rottger, Tristan Thrush, Scott A. Hale
  • Efficient Hierarchical Domain Adaptation for Pretrained Language Models Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge
  • Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation Deeksha varshney, Akshara Prabhakar, Asif Ekbal
  • Quantifying Synthesis and Fusion and their Impact on Machine Translation Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva
  • Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou
  • Context-Aware Abbreviation Expansion Using Large Language Models Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael Brenner
  • MultiSpanQA: A Dataset for Multi-Span Question Answering Haonan Li, Martin Tomko, Maria Vasardani, Timothy Baldwin
  • DISAPERE: A Dataset for Discourse Structure in Peer Review Discussions Neha Nayak Kennard, Tim O’Gorman, Rajarshi Das, Akshay Sharma, Chhandak Bagchi, Matthew Clinton, Pranay Kumar Yelugam, Hamed Zamani, Andrew McCallum
  • Cross-Domain Detection of GPT-2-Generated Technical Text Juan Diego Rodríguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan
  • Towards a Progression-Aware Autonomous Dialogue Agent Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang
  • Unsupervised Slot Schema Induction for Task-oriented Dialog Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau
  • Database Search Results Disambiguation for Task-Oriented Dialog Systems Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar
  • Probing via Prompting and Pruning Jiaoda Li, Mrinmaya Sachan, Ryan D Cotterell
  • CoSe-Co: Text Conditioned Generative CommonSense Contextualizer Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy
  • DREAM: Improving Situational QA by First Elaborating the Situation Yuling Gu, Bhavana Dalvi, Peter Clark
  • Masked Part-Of-Speech Model: Does modeling long context help unsupervised POS-tagging? Xiang Zhou, Shiyue Zhang, Mohit Bansal
  • Inducing and Using Alignments for Transition-based AMR Parsing Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo
  • EmpHi: Generating Empathetic Responses with Human-like Intents MAO YAN CHEN, Siheng Li, Yujiu Yang
  • ScAN: Suicide Attempt and Ideation Events Dataset Bhanu Pratap Singh Rawat, Samuel Kovaly, Hong Yu, Wilfred Pigeon
  • FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization David Wan, Mohit Bansal
  • DocTime: A Document-level Temporal Dependency Graph Parser Puneet Mathur, Vlad I Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain
  • When a sentence does not introduce a discourse entity, Transformer-based models still often refer to it Sebastian Schuster, Tal Linzen
  • KAT: A Knowledge Augmented Transformer for Vision-and-Language Liangke Gui, Borui Wang, Qiuyuan Huang, Alexander G Hauptmann, Yonatan Bisk, Jianfeng Gao
  • Provably Confidential Language Modelling Xuandong Zhao, Lei Li, Yu-Xiang Wang
  • OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering Zhengbao Jiang, Yi Mao, Pengcheng He, Graham Neubig, Weizhu Chen
  • CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination Hyounghun Kim, Abhay Zala, Mohit Bansal
  • CompactIE: Compact Facts in Open Information Extraction Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish
  • WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah
  • Textless Speech-to-Speech Translation on Real Data Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Pino, Jiatao Gu, Wei-Ning Hsu
  • GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering Yoonseok Yang, Kyu Seok Kim, Minsam Kim, Juneyoung Park
  • Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection Angelica Chen, Vicky Zayats, Daniel David Walker, Dirk Padfield
  • Explaining Toxic Text via Knowledge Enhanced Text Generation Rohit Sridhar, Diyi Yang
  • NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics Ximing Lu, Sean Welleck, Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah Smith, Yejin Choi
  • Learning Dialogue Representations from Consecutive Utterances Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang
  • Long-term Control for Dialogue Generation: Methods and Evaluation Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q Weinberger, Ryan McDonald
  • On the Use of External Data for Spoken Named Entity Recognition Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Han
  • Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie
  • Meta Learning for Natural Language Processing: A Survey Hung-yi Lee, Shang-Wen Li, Thang Vu
  • Reframing Human-AI Collaboration for Generating Free-Text Explanations Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi
  • Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework Minghao Zhu, Junli Wang, Chungang Yan
  • Disentangled Action Recognition with Knowledge-bases Zhekun Luo, Shalini Ghosh, Devin Guillory, Keizo Kato, Trevor Darrell, Huijuan Xu
  • Cross-document Misinformation Detection based on Event Graph Reasoning Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji
  • Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization Haode Zhang, Haowen Liang, Yuwei Zhang, Li-Ming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam
  • On the Robustness of Reading Comprehension Models to Entity Renaming Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren
  • Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Transfer Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar
  • On Synthetic Data for Back Translation Jiahao Xu, Yubin Ruan, Wei Bi, Guoping Huang, Shuming Shi, Lihui Chen, Lemao Liu
  • Diversifying Neural Dialogue Generation via Negative Distillation Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li
  • Ask Me Anything in Your Native Language Nikita Sorokin, Dmitry Abulkhanov, Irina Piontkovskaya, Valentin Malykh
  • Understand before Answer: Improve Temporal Reading Comprehension via Precise Question Understanding Hao Huang, Xiubo Geng, Guodong Long, Daxin Jiang
  • Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Kumar Nelakanti, Vineet Gandhi
  • TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation Sajad Sotudeh, Nazli Goharian
  • Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han
  • GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers Ali Modarressi, Mohsen Fayyaz, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar
  • Cooperative Self-training of Machine Reading Comprehension Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass
  • Political Ideology and Polarization: A Multi-dimensional Approach Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li
  • CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang
  • Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao
  • NewsEdits: A Dataset of News Article Revision Histories and a Novel Document-Level Reasoning Challenge Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng
  • Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov
  • Enhancing Self-Attention with Knowledge-Assisted Attention Maps Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Wei Shen, Defu Cao, Mingliang Zhang, Yaming Yang, Jing Bai, Yunhai Tong, Hao Sun, Ruofei Zhang
  • Semantic Diversity in Dialogue with Natural Language Inference Katherine Stasaski, Marti Hearst
  • Learning Natural Language Generation from Scratch with Truncated Reinforcement Learning Alice Martin, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin
  • Social Norms Guide Reference Resolution Mitchell Abrams, Matthias Scheutz
  • Putting the Con in Context: Identifying Deceptive Actors in the Game of Mafia Samee Omotayo Ibraheem, Gaoyue Zhou, John DeNero
  • SwahBERT: Language Model of Swahili Gati L Martin, Medard Medard Mswahili, Young-Seob Jeong, Jiyoung Woo

Main Conference - Short Papers

  • Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference Emīls Kadiķis, Vaibhav Srivastav, Roman Klinger
  • MCSE: Multimodal Contrastive Learning of Sentence Embeddings Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow
  • Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
  • Consistency Training with Virtual Adversarial Discrete Perturbation Jungsoo Park, Gyuwan Kim, Jaewoo Kang
  • Conceptualizing Treatment Leakage in Text-based Causal Inference Adel Daoud, Connor Thomas Jerzak, Richard Johansson
  • Label Definitions Improve Semantic Role Labeling Li Zhang, Ishan Jindal, Yunyao Li
  • Contrastive Learning for Prompt-based Few-shot Language Learners Yiren Jian, Chongyang Gao, Soroush Vosoughi
  • Embedding Hallucination for Few-shot Language Fine-tuning Yiren Jian, Chongyang Gao, Soroush Vosoughi
  • Few-Shot Semantic Parsing with Language Models Trained On Code Richard Shin, Benjamin Van Durme
  • Modeling Explicit Task Interactions in Document-Level Joint Entity and Relation Extraction Liyan Xu, Jinho D. Choi
  • On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models? Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy
  • Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang
  • Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens Itay Itzhak, Omer Levy
  • Exact Paired Permutation Testing Algorithms for NLP Systems Ran Zmigrod, Tim Vieira, Ryan D Cotterell
  • Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting Shuzheng Si, Shuang Zeng, Baobao Chang
  • Partial-input baselines show that NLI models can ignore context, but they don’t. Neha Srikanth, Rachel Rudinger
  • How Do Construct-Driven vs. Construct-Agnostic Counterfactuals Affect the Robustness of Social Computing Models? Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein
  • Learning to Generate Examples for Semantic Processing Tasks Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili
  • Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge Ian Porada, Alessandro Sordoni, Jackie CK Cheung
  • Show, Don’t Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue Raghav Gupta, Harrison Lee, Jeffrey Zhao, Yuan Cao, Abhinav Rastogi, Yonghui Wu
  • Learning Cross-Lingual IR from an English Retriever Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil
  • A Follower-aware Speaker Model For Vision-and-Language Navigation Zi-Yi Dou, Nanyun Peng
  • Uninformative Input Features and Counterfactual Invariance: Two Perspectives on Spurious Correlations in Natural Language Jacob Eisenstein
  • Causal Distillation for Language Models Zhengxuan Wu, Atticus Geiger, Joshua Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah Goodman
  • Grapheme-to-Phoneme Conversion for Thai using Neural Regression Models Tomohiro Yamasaki
  • A Data Cartography based MixUp for Pre-trained Language Models Seo Yeon Park, Cornelia Caragea
  • Improving negation detection with negation-focused pre-training Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor
  • AISFG: Abundant Information Slot Filling Generator Yang Yan, Junda Ye, Zhongbao Zhang, Liwen Wang
  • Collective Self-Labeling for Passage Retrieval Jihyuk Kim, Minsoo Kim, seung-won hwang
  • Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval Siyu Ren, Kenny Q. Zhu
  • Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning Hongyi Yuan, Zheng Yuan, Sheng Yu
  • Towards Debiasing Translation Artifacts KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith
  • Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics Zihan Zhang, Meng Fang, Ling Chen, Mohammad Reza Namazi Rad
  • Does it really generalize well on unseen data? Systematic Evaluation of Relational Triple Extraction Methods Juhyuk Lee, Min-Joong Lee, June Yong Yang, Eunho Yang
  • Quantifying Language Variation Acoustically with Few Resources Martijn Bartelds, Martijn Wieling
  • Reproducibility Beyond the NLP Research Community: A Study on User Experience Shane Storks, Keunwoo Peter Yu, Joyce Chai
  • ChapterBreak: A Challenge Dataset for Long-Range Language Models Simeng Sun, Katherine Thai, Mohit Iyyer
  • How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns Stephanie Brandl, Ruixiang Cui, Anders Søgaard
  • Exposing the Limits of Video-Text Models through Contrast Sets Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach
  • Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin
  • UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis
  • Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution Connor Baumler, Rachel Rudinger
  • Tricks for Training Sparse Translation Models Dheeru Dua, Shruti Bhosale, Vedanuj Goswami, James Cross, Mike Lewis, Angela Fan
  • Global Entity Disambiguation with BERT Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto
  • Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption Garam Lee, Jai Hyun Park, Minsoo Kim, seung-won hwang, Jung Hee Cheon
  • Incorporating Centering Theory into Entity Coreference Resolution Haixia Chai, Michael Strube
  • Modal Dependency Parsing via Language Model Priming Jiarui Yao, Nianwen Xue, Bonan Min
  • The USMLE® Step 2 Clinical Skills Patient Note Corpus Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber
  • Question-Evidence Similarity Learning for Long-Context Question Answering Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan
  • Using Natural Sentence Prompts for Understanding Biases in Language Models Sarah Alnegheimish, Alicia Guo, Yi Sun
  • Data Augmentation with Dual Training for Offensive Span Detection Nasim Nouri
  • Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis
  • Paragraph-based Transformer Pretraining for Multi-Sentence Inference Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti
  • Cheat Codes to Quantify Missing Source Information in Neural Machine Translation Proyag Pal, Kenneth Heafield
  • A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling Forrest Sheng Bao, Ge Luo, Hebi Li, Cen Chen, Yinfei Yang, Youbiao He, Minghui Qiu
  • On the Diversity and Limits of Human Explanations Chenhao Tan
  • Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem Ryoma Sato
  • Reference-free Summarization Evaluation via Semantic Correlation and Compression Ratio Yizhu Liu, Qi Jia, Kenny Q. Zhu
  • Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations Daniela Brook Weiss, Paul Roit, Ori Ernst, Ido Dagan
  • Combining Humor and Sarcasm for Improving Political Parody Detection Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras
  • BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer Marinela Parovic, Goran Glavaš, Ivan Vulić, Anna Korhonen
  • CIAug: Equipping Interpolative Augmentation with Curriculum Learning Ramit Sawhney, Megh Thakkar, Shrey Pandit, Ritesh Singh Soun, Sarvagya Malaviya, Yuval Pinter
  • Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan D Cotterell, Isabelle Augenstein
  • SKILL: Structured Knowledge Infusion for Large Language Models Fedor Moiseev, Zhe Dong, Enrique Alfonseca, Martin Jaggi
  • Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach
  • Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction Jiaxin Yu, Deqing Yang, Shuyu Tian
  • Pretrained Models for Multilingual Federated Learning Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme
  • Sort by Structure: Language Model Ranking as Dependency Probing Max Müller-Eberstein, Rob van der Goot, Barbara Plank
  • Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions Elior Sulem, Jamaal Hay, Dan Roth
  • $\textscAmbiPun$: Generating Humorous Puns with Ambiguous Context Anirudh Mittal, Yufei Tian, Nanyun Peng
  • Socially Aware Bias Measurements for Hindi Language Representations Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang
  • On Curriculum Learning for Commonsense Reasoning Adyasha Maharana, Mohit Bansal
  • Abstraction not Memory: BERT and the English Article System Harish Tayyar Madabushi, Dagmar Divjak, Petar Milin
  • Generating Repetitions with Appropriate Repeated Words Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
  • PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining Machel Reid, Mikel Artetxe
  • Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification Xiaolei Huang
  • Analyzing Modality Robustness in Multimodal Sentiment Analysis Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria
  • EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction Benfeng Xu, Quan Wang, Yajuan Lyu, Yabing Shi, Yong Zhu, Jie Gao, Zhendong Mao
  • Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan
  • A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo
  • A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction Shu Liu, Kaiwen Li, Zuhe Li
  • SUBS: Subtree Substitution for Compositional Semantic Parsing Jingfeng Yang, Le Zhang, Diyi Yang
  • LEA: Meta Knowledge-Driven Self-Attentive Document Embedding for Few-Shot Text Classification Seungki Hong, Tae Young Jang
  • ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu
  • Language Model Augmented Monotonic Attention for Simultaneous Translation Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim

Special Theme Papers

  • On the Machine Learning of Ethical Judgments from Natural Language Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan D Cotterell, Adina Williams
  • User-Centric Gender Rewriting Bashar Alhafni, Nizar Habash, Houda Bouamor
  • Machine-in-the-Loop Rewriting for Creative Image Captioning Vishakh Padmakumar, He He
  • Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data Jamar L. Sullivan, Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, Blase Ur
  • Automatic Correction of Human Translations Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero
  • An Exploration of Post-Editing Effectiveness in Text Summarization Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel R. Tetreault, Alejandro Jaimes
  • Mapping the Design Space of Human-AI Interaction in Text Summarization Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes
  • User-driven research of medical Note Generation software Tom Knoll, Francesco Moramarco, Alex Papadopoulos Korfiatis, Rachel Young, Claudia Ruffini, Mark Perera, Christian Perstl, Ehud Reiter, Anya Belz, Aleksandar Savkov
  • The Why and The How: A Survey on Natural Language Interaction in Visualization Henrik Voigt, Ozge Alacam, Monique Meuschke, Kai Lawonn, Sina Zarrieß
  • Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications Kaitlyn Zhou, Su Lin Blodgett, Adam Trischler, Hal Daumé III, Kaheer Suleman, Alexandra Olteanu
  • Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs Xu Wang, Simin Fan, Jessica Houghton, Lu Wang
  • Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring? Zijie Zeng, XINYU LI, Dragan Gasevic, Guanliang Chen
  • Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks Paul Rottger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert
  • What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke

Findings

  • Cross-Domain Classification of Moral Values Enrico Liscio, Alin Eugen Dondera, Andrei Geadau, Catholijn M Jonker, Pradeep Kumar Murukannaiah
  • ID10M: Idiom Identification in 10 Languages Simone Tedeschi, Federico Martelli, Roberto Navigli
  • Query2Particles: Knowledge Graph Reasoning with Particle Embeddings Jiaxin Bai, Zihao Wang, Hongming Zhang, Yangqiu Song
  • RoViST: Learning Robust Metrics for Visual Storytelling Eileen Wang, Caren Han, Josiah Poon
  • Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation Yong Cao, Wei Li, Xianzhi Li, Min Chen, Guangyong Chen, Long Hu, Zhengdao Li, Kai Hwang
  • Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval Zhihao Fan, zhongyu wei, Zejun Li, Siyuan Wang, Xuanjing Huang, Jianqing Fan
  • Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, Sheng Yu
  • EA$^2$E: Improving Consistency with Event Awareness for Document-Level Argument Extraction Qi Zeng, Qiusi Zhan, Heng Ji
  • Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer Zhengyuan Liu, Nancy F. Chen
  • Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation Huan Lin, Baosong Yang, Liang Yao, Dayiheng Liu, Haibo Zhang, jun xie, Min Zhang, Jinsong Su
  • AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee
  • TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization Ze Yang, Christian WANG, Zhoujin Tian, Wei Wu, Zhoujun Li
  • KETOD: Knowledge-Enriched Task-Oriented Dialogue Zhiyu Chen, Bing Liu, Seungwhan Moon, Chinnadhurai Sankar, Paul A. Crook, William Yang Wang
  • Zero-Shot Event Detection Based on Ordered Contrastive Learning and Prompt-Based Prediction Senhui Zhang, Tao Ji, Wendi Ji, Xiaoling Wang
  • DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation Md Rashad Al Hasan Rony, Ricardo Usbeck, Jens Lehmann
  • Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning Hongzhan Lin, Jing Ma, Liangliang Chen, Zhiwei Yang, Mingfei Cheng, Guang Chen
  • Weakly Supervised Text-to-SQL Parsing through Question Decomposition Tomer Wolfson, Daniel Deutch, Jonathan Berant
  • MTG: A Benchmark Suite for Multilingual Text Generation Yiran Chen, Zhenqiao Song, Xianze Wu, Danqing Wang, Jingjing Xu, Jiaze Chen, Hao Zhou, Lei Li
  • TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier
  • AMRize, then Parse! Enhancing AMR Parsing with PseudoAMR Data Liang Chen, Peiyi Wang, Runxin Xu, Tianyu Liu, Zhifang Sui, Baobao Chang
  • Latent Group Dropout for Multilingual and Multidomain Machine Translation Minh-Quang PHAM, François Yvon, Josep Crego
  • RCL: Relation Contrastive Learning for Zero-Shot Relation Extraction Shusen Wang, Bosen Zhang, Yajing Xu, Yanan Wu, Bo Xiao
  • Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre
  • Learning Discriminative Representations for Open Relation Extraction with Instance Ranking and Label Calibration Shusen Wang, Bin Duan, Yanan Wu, Yajing Xu
  • The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank Daphne Ippolito, Liam Dugan, Emily Reif, Ann Yuan, Andy Coenen, Chris Callison-Burch
  • CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Scott Yih, Sonal Gupta, Xilun Chen
  • Unsupervised Domain Adaptation for Question Generation with DomainData Selection and Self-training Peide Zhu, Claudia Hauff
  • Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity Kurt Shuster, Jack Urbanek, Arthur Szlam, Jason E Weston
  • Context-Aware Language Modeling for Goal-Oriented Dialogue Systems Charlie Victor Snell, Mengjiao Yang, Justin Fu, Yi Su, Sergey Levine
  • Jointly Learning Guidance Induction and Faithful Summary Generation via Conditional Variational Autoencoders Wang Xu, Tiejun Zhao
  • Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation Zhijing Wu, Hua Xu, Jingliang Fang
  • Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback Yunfan Hu, Zhaopeng Qiu, Xian Wu
  • Analytical Reasoning of Text Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan
  • Weakly Supervised Text Classification using Supervision Signals from a Language Model Ziqian Zeng, Weimin Ni, Tianqing Fang, Xiang Li, Xinran Zhao, Yangqiu Song
  • CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection Zhen Li, Bing Xu, Conghui Zhu, Tiejun Zhao
  • LiST: Lite Prompted Self-training Makes Efficient Few-shot Learners Yaqing Wang, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, Ahmed Hassan Awadallah, Jianfeng Gao
  • Improve Discourse Dependency Parsing with Contextualized Representations Yifei Zhou, Yansong Feng
  • A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim
  • A Label-Aware Autoregressive Framework for Cross-Domain NER Jinpeng Hu, He Zhao, Dan dan Guo, Xiang Wan, Tsung-Hui Chang
  • D2GCLF: Document-to-Graph Classifier for Legal Document Classification Qiqi Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Ruofan Wang
  • Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning Siyu Ren, Kenny Q. Zhu
  • On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations Roy Schwartz, Gabriel Stanovsky
  • Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching Kunbo Ding, Weijie Liu, Yuejian Fang, Zhe Zhao, Qi Ju, Xuefeng Yang, Rong Tian, Zhu Tao, Haoyan Liu, Han Guo, Xingyu Bai, Weiquan Mao, Yudong Li, Weigang Guo, Taiqiang Wu, Ningyuan Sun
  • BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He
  • CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong
  • Towards Job-Transition-Tag Graph for a Better Job Title Representation Learning Jun ZHU, CELINE HUDELOT
  • Semantic-Preserving Abstractive Text Summarization with Siamese Generative Adversarial Net Xin Sheng, Linli Xu, Yinlong Xu, Deqiang Jiang, Bo Ren
  • Balancing Multi-Domain Corpora Learning for Open-Domain Response Generation Yujie Xing, Jinglun Cai, Nils Barlaug, Peng Liu, Jon Atle Gulla
  • Controllable Sentence Simplification via Operation Classification Liam Cripwell, Joël Legrand, Claire Gardent
  • Learning Structural Information for Syntax-Controlled Paraphrase Generation Erguang Yang, Chenglin Bai, Deyi Xiong, Yujie Zhang, Yao Meng, Jinan Xu, Yufeng Chen
  • Capturing Conversational Interaction for Question Answering via Global History Reasoning Jin Qian, Bowei Zou, Mengxing Dong, Xiao Li, AiTi Aw, Yu Hong
  • Learning to Execute Actions or Ask Clarification Questions Zhengxiang Shi, Yue Feng, Aldo Lipani
  • Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer Haoran Xu, Kenton Murray
  • Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence M.J Jang, Frank Martin Mtumbuka, Thomas Lukasiewicz
  • Challenges in Generalization in Open Domain Question Answering Linqing Liu, Patrick Lewis, Sebastian Riedel, Pontus Stenetorp
  • NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue Inigo Casanueva, Ivan Vulić, Georgios P. Spithourakis, Paweł Budzianowski
  • Uncertainty-Aware Cross-Lingual Transfer with Pseudo Partial Labels Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Chang-Tien Lu
  • What kinds of errors do reference resolution models make and what can we learn from them? Jorge Sánchez, Mauricio Mazuecos, Hernán Maina, Luciana Benotti
  • Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models Joseph McDonald, Baolin Li, Nathan C. Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
  • Event Detection for Suicide Understanding Luis Fernando Guzman-Nateras, Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
  • BehancePR: A Punctuation Restoration Dataset for Livestreaming Video Transcript Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
  • XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu
  • Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback Duy-Hung Nguyen, Nguyen Viet Dung Nghiem, Bao-Sinh Nguyen, Tien Dung Le, Minh-Tien Nguyen, Shahab Sabahi, Hung Le
  • A Timestep aware Sentence Embedding and Acme Coverage for Brief but Informative Title Generation Quanbin Wang, XieXiong Lin, Feng Wang
  • METGEN: A Module-based Entailment Tree Generation Framework for Answer Explanation Ruixin Hong, Hongming Zhang, Xintong Yu, Changshui Zhang
  • Delving Deep into Regularity: A Simple but Effective Method for Chinese Named Entity Recognition Yingjie Gu, Xiaoye Qu, Zhefeng Wang, Yi ZHENG, Baoxing Huai, Nicholas Jing Yuan
  • Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval Jiaheng Liu, Tan Yu, Hanyu Peng, Mingming Sun, Ping Li
  • SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising Kuan Xu, Yongbo Wang, Yongliang Wang, Zihao Wang, Zujie Wen, Yang Dong
  • HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea Haneul Yoo, Jiho Jin, Juhee Son, JinYeong Bak, Kyunghyun Cho, Alice Oh
  • Learn from Relation Information: Towards Prototype Representation Rectification for Few-Shot Relation Extraction Yang Liu, Jinpeng Hu, Xiang Wan, Tsung-Hui Chang
  • Exploiting Numerical-Contextual Knowledge to Improve Numerical Reasoning in Question Answering Jeonghwan Kim, Junmo Kang, Kyung-min Kim, Giwon Hong, Sung-Hyon Myaeng
  • Exploring the Universal Vulnerability of Prompt-based Learning Paradigm Lei Xu, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Zhiyuan Liu
  • Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou
  • Minimally-Supervised Relation Induction from Pre-trained Language Model Lu Sun, Yongliang Shen, Weiming Lu
  • When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation? Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi
  • Detecting Narrative Elements in Informational Text Effi Levi, Guy Mor, Tamir Sheafer, Shaul Shenhav
  • Analyzing the Intensity of Complaints on Social Media MING FANG, Shi Zong, Jing Li, Xinyu Dai, Shujian Huang, Jiajun Chen
  • $Great~Truths~are ~Always ~Simple:$ A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models Jinhao Jiang, Kun Zhou, Ji-Rong Wen, Xin Zhao
  • Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models Tianlu Wang, Rohit Sridhar, Diyi Yang, Xuezhi Wang
  • Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi
  • GraphCache: Message Passing as Caching for Sentence-Level Relation Extraction Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi
  • Zero-shot Entity Linking with Less Data G P Shrivatsa Bhargav, Dinesh Khandelwal, Saswati Dana, Dinesh Garg, Pavan Kapanipathi, Salim Roukos, Alexander Gray, L Venkata Subramaniam
  • A Dual-Channel Framework for Sarcasm Recognition by Detecting Sentiment Conflict Yiyi Liu, Yequan Wang, Aixin Sun, Xuying Meng, Jing Li, Jiafeng Guo
  • Post-Training Dialogue Summarization using Pseudo-Paraphrasing Qi Jia, Yizhu Liu, Haifeng Tang, Kenny Q. Zhu
  • EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Inigo Casanueva, Paweł Budzianowski
  • Pruning Adatperfusion with Lottery Ticket Hypothesis Jiarun Wu, Qingliang Chen, Zeguan Xiao, Yuliang Gu, Mengsi Sun
  • The Role of Context in Detecting Previously Fact-Checked Claims Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov
  • Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction Xiang Chen, Ningyu Zhang, Lei Li, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen
  • Dependency Position Encoding for Relation Extraction Qiushi Guo, Xin Wang, Dehong Gao
  • KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan
  • DISARM: Detecting the Victims Targeted by Harmful Memes Shivam Sharma, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
  • Hierarchical Transformers Are More Efficient Language Models Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski
  • White-box Testing of NLP models with Mask Neuron Coverage Arshdeep Sekhon, Yangfeng Ji, Matthew Dwyer, Yanjun Qi
  • UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih
  • Domain-matched Pre-training Tasks for Dense Retrieval Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Scott Yih, Sonal Gupta, Yashar Mehdad
  • Efficient Few-Shot Fine-Tuning for Opinion Summarization Arthur Brazinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer
  • Temporal Attention for Language Models Guy D. Rosin, Kira Radinsky
  • MixQG: Neural Question Generation with Mixed Answer Types Lidiya Murakhovs’ka, Chien-Sheng Wu, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong
  • BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation Eleftheria Briakou, Sida Wang, Luke Zettlemoyer, Marjan Ghazvininejad
  • Exploring Neural Models for Query-Focused Summarization Jesse Vig, Alexander Fabbri, Wojciech Maciej Kryscinski, Chien-Sheng Wu, Wenhao Liu
  • Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation Junwei Yang, Zequn Liu, Ming Zhang, Sheng Wang
  • All Information is Valuable: Question Matching over Full Information Transmission Network Le Qi, Yu Zhang, Qingyu Yin, Guidong Zheng, wen junjie, Jinlong Li, Ting Liu
  • Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions Kosuke Nishida, Kyosuke Nishida, Shuichi Nishioka
  • Learn To Remember: Transformer with Recurrent Memory for Document-level Machine Translation Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn
  • Unbiased Math Word Problems Benchmark for Mitigating Solving Bias ZhiCheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang
  • RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation Md Akmal Haidar, NITHIN ANCHURI, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart
  • Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di Jin, Dilek Hakkani-Tur
  • Low-resource Entity Set Expansion: A Comprehensive Study on User-generated Text Yutong Shao, Nikita Bhutani, Sajjadur Rahman, Estevam Hruschka
  • ALLSH: Active Learning Guided by Local Sensitivity and Hardness Shujian Zhang, Chengyue Gong, Xingchao Liu, Pengcheng He, Weizhu Chen, Mingyuan Zhou
  • BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla Abhik Bhattacharjee, Tahmid Hasan, Kazi Samin Mubasshir, Md Saiful Islam, Wasi Uddin Ahmad, Anindya Iqbal, M. Sohel Rahman, Rifat Shahriyar
  • To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning Yunjie Ji, Liangyu Chen, Chenxiao Dou, Baochang Ma, Xiangang Li
  • Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu
  • A Survey on Stance Detection for Mis- and Disinformation Identification Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
  • FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems Divya V Sharma, Arun Balaji Buduru
  • Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training Yifan Gao, Qingyu Yin, zheng li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael Lyu
  • End-to-end Spoken Conversational Question Answering: Task, Dataset and Model Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou
  • Towards Computationally Feasible Deep Active Learning Akim Tsvigun, Artem Shelmanov, Gleb Kuzmin, Leonid Sanochkin, Daniil Larionov, Gleb Gennadjevich Gusev, Manvel Avetisian, Leonid Zhukov
  • DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks Ziyang Luo, Yadong Xi, Jing Ma, Zhiwei Yang, Xiaoxi Mao, Changjie Fan, Rongsheng Zhang
  • Dangling-Aware Entity Alignment with Mixed High-Order Proximities Juncheng Liu, Zequn Sun, Bryan Hooi, Yiwei Wang, Dayiheng Liu, Baosong Yang, Xiaokui Xiao, Muhao Chen
  • Improving Code-Switching Dependency Parsing with Semi-Supervised Auxiliary Tasks Şaziye Betül Özateş, Arzucan Özgür, Tunga Gungor, Özlem Çetinoğlu
  • Speeding Up Entmax Maxat Tezekbayev, Vassilina Nikoulina, Matthias Gallé, Zhenisbek Assylbekov
  • OTExtSum: Extractive Text Summarisation with Optimal Transport Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang
  • Prompt Augmented Generative Replay via Supervised Contrastive Learning for Lifelong Intent Detection VAIBHAV VARSHNEY, Mayur Patidar, Rajat Kumar, Lovekesh Vig, Gautam Shroff
  • Phrase-level Textual Adversarial Attack with Label Preservation Yibin Lei, Yu Cao, Dianqi Li, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy
  • Seeing the wood for the trees: a contrastive regularization method for the low-resource Knowledge Base Question Answering Junping Liu, Shijie Mei, Xinrong Hu, Xun Yao, JACK Yang, Yi Guo
  • RGL: A Simple yet Effective Relation Graph Augmented Prompt-based Tuning Approach for Few-Shot Learning Yaqing Wang, Xin Tian, Haoyi Xiong, Yueyang Li, Zeyu Chen, Sheng Guo, Dejing Dou
  • CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu
  • A Self-supervised Joint Training Framework for Document Reranking Xiaozhi Zhu, Tianyong Hao, Sijie Cheng, Fu Lee Wang, Hai Liu
  • ‘Diversity and Uncertainty in Moderation’’ are the Key to Data Selection for Multilingual Few-shot Transfer Shanu Kumar, Sandipan Dandapat, Monojit Choudhury
  • Probing the Role of Positional Information in Vision-Language Models Philipp J. Rösch, Jindřich Libovický
  • Masked Summarization to Generate Factually Inconsistent Summaries for Improved Factual Consistency Checking Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung
  • Restoring Hebrew Diacritics Without a Dictionary Elazar Gershuni, Yuval Pinter
  • MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei QIN, Yunshi Lan, Jie Shao, Xiangliang Zhang
  • QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning Zechen Li, Anders Søgaard
  • MM-Claims: A Dataset for Multimodal Claim Detection in Social Media Gullal Singh Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth
  • SHARP: Search-Based Adversarial Attack for Structured Prediction Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu
  • Self-Training with Differentiable Teacher Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha
  • An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models Victor Steinborn, Philipp Dufter, Haris Jabbar, Hinrich Schütze
  • Improving Contextual Representation with Gloss Regularized Pre-training Yu Lin, Zhecheng An, Peihao Wu, Zejun MA
  • Learning Rich Representation of Keyphrases from Text Mayank Kulkarni, Debanjan Mahata, Ravneet Singh Arora, Rajarshi Bhowmik
  • Efficient Learning of Multiple NLP Tasks via Collective Weight Factorization on BERT Christos Charalampos Papadopoulos, Yannis Panagakis, Manolis Koubarakis, Mihalis Nicolaou
  • The Limits of Word Level Differential Privacy Justus Mattern, Benjamin Weggenmann, Florian Kerschbaum
  • Attention Fusion: a light yet efficient late fusion mechanism for task adaptation in NLU Jin Cao, Chandana Satya Prakash, Wael Hamza
  • Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems Azlaan Mustafa Samad, Kshitij Mishra, Mauajama Firdaus, Asif Ekbal
  • Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver
  • Learning to Embed Multi-Modal Contexts for Situated Conversational Agents Yunseon Choi, Oh Joon Kwon, Haeju Lee, Kee-Eung Kim, Jinhyeon Kim, Ran Han, Yoonhyung Kim, Youngjune Lee, Minho Park, Kangwook Lee, Haebin Shin
  • MultiNER: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition Simone Tedeschi, Roberto Navigli
  • Permutation Invariant Strategy Using Transformer Encoders for Table Understanding Sarthak Dash, Sugato Bagchi, Nandana Mihindukulasooriya, Alfio Gliozzo
  • A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis Ehsan Hosseini-Asl, Wenhao Liu, Caiming Xiong
  • LM-CORE: Language Models with Contextually Relevant External Knowledge Jivat Neet Kaur, Sumit Bhatia, Milan Aggarwal, Rachit Bansal, Balaji Krishnamurthy
  • Challenging America: Modeling language in longer time scales Jakub Pokrywka, Filip Graliński, Krzysztof Jassem, Karol Kaczmarek, Krzysztof Jan Jurkiewicz, Piotr Wierzchon
  • LongT5: Efficient Text-To-Text Transformer for Long Sequences Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang
  • A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning Yang Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang
  • Data Augmentation for Low-Resource Dialogue Summarization Joshua Maynez, Yongtai Liu, Shashi Narayan, Gonçalo Simões
  • Entity Cloze By Date: Understanding what LMs know about unseen entities Yasumasa Onoe, Michael JQ Zhang, Eunsol Choi, Greg Durrett
  • LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework Mengjie Zhao, Fei Mi, Yasheng Wang, Minglei Li, Xin Jiang, Qun Liu, Hinrich Schuetze
  • Opponent Modeling in Negotiation Dialogues by Related Data Adaptation Kushal Chawla, Gale Lucas, Jonathan May, Jonathan Gratch
  • Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting Jesin James, Vithya Yogarajan, Isabella Shields, Catherine Watson, Peter Keegan, Keoni Mahelona, Peter-Lucas Jones
  • CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations Jialu Li, Hao Tan, Mohit Bansal
  • CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality Maria Nadejde, Anna Currey, Benjamin Hsu, Xing Niu, Georgiana Dinu, Marcello Federico
  • StATIK: Structure and Text for Inductive Knowledge Graph Completion Elan Sopher Markowitz, Keshav Balasubramanian, Mehrnoosh Mirtaheri, Murali Annavaram, Aram Galstyan, Greg Ver Steeg
  • Instilling Type Knowledge in Language Models via Multi-Task QA Shuyang Li, Mukund Sridhar, Chandana Satya Prakash, Jin Cao, Wael Hamza, Julian McAuley
  • Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis Seth Kulick, Neville Ryant, Beatrice Santorini
  • Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System Chang Tian, Wenpeng Yin, Marie-Francine Moens
  • On Measuring Social Biases in Prompt-Based Learning Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Şerife Leman Runyun, Derry Wijaya
  • Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schuetze
  • Improving the Faithfulness of Abstractive Summarization via Entity Coverage Control Haopeng Zhang, Semih Yavuz, Wojciech Maciej Kryscinski, Kazuma Hashimoto, Yingbo Zhou
  • Fine-grained Image Captioning with CLIP Reward Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal
  • Harmless Transfer Learning for Item Embeddings Chengyue Gong, xiaocong du, Dhruv Choudhary, Bhargav Bhushanam, qiang liu, Arun Kejariwal
  • A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco
  • Video-based Multimodal Intent Discovery Adyasha Maharana, Quan Hung Tran, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Walter W Chang, Mohit Bansal
  • Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Henghui Zhu, Rui Dong, Xinchi Chen, Peng Xu, zhiheng huang, Andrew Arnold, Dan Roth
  • Improving Few-Shot Relation Classification by Prototypical Representation Learning with Definition Text Li Zhenzhen, Yuyang Zhang, Jian-Yun Nie, Dongsheng Li
  • Literature-Augmented Clinical Outcome Prediction Aakanksha Naik, Sravanthi Parasa, Sergey Feldman, Lucy Lu Wang, Tom Hope
  • DOCmT5: Document-Level Pre-training of Multilingual Language Models Chia-Hsuan Lee, Aditya Siddhant, Viresh Ratnakar, Melvin Johnson
  • Few-Shot Self-Rationalization with Natural Language Prompts Ana Marasovic, Iz Beltagy, Doug Downey, Matthew E Peters
  • From Cognitive to Computational Modeling: Text-based Risky Decision-Making Guided by Fuzzy Trace Theory Jaron Mar, Jiamou Liu
  • Extracting Temporal Event Relation with Syntax-guided Graph Transformer SHUAICHENG ZHANG, Qiang Ning, Lifu Huang
  • TEAM: A multitask learning based Taxonomy Expansion approach for Attach and Merge Bornali Phukon, Anasua Mitra, Ranbir Singh Sanasam, Priyankoo Sarmah
  • Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang
  • PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting Zhen Zhang, Wei Zhu, Jinfan Zhang, Peng Wang, Rize Jin, Tae-Sun Chung
  • Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision Yang Li, Guodong Long, Tao Shen, Jing Jiang
  • Exploring the Value of Multi-View Learning for Session-Aware Query Representation Diego Ortiz, Jose G Moreno, Gilles Hubert, Karen Pinel-Sauvagnat, Lynda Tamine Tamine
  • A Framework to Generate High-quality Datapoints for Multiple Novel Intent Detection Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly
  • Zero-shot Cross-lingual Conversational Semantic Role Labeling Han Wu, Haochen Tan, Kun Xu, Shuqi LIU, Lianwei Wu, Linqi Song
  • PerKGQA: Question Answering over Personalized Knowledge Graphs Ritam Dutt, Kasturi Bhattacharjee, Rashmi Gangadharaiah, Dan Roth, Carolyn Rose
  • FreeTransfer-X: Safe and Annotation-Free Cross-Lingual Transfer for Different Networks Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu
  • Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick
  • SemAttack: Natural Textual Attacks via Different Semantic Spaces Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li
  • FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks Bill Yuchen Lin, Chaoyang He, Zihang Zeng, Hulin Wang, Yufen Huang, Christophe Dupuy, Rahul Gupta, Mahdi Soltanolkotabi, Xiang Ren, Salman Avestimehr
  • Multi-Hop Open-Domain Question Answering over Structured andUnstructured Knowledge Yue Feng, Zhen Han, Mingming Sun, Ping Li
  • How to Translate Your Samples and Choose Your Shots? Analyzing Translate-train & Few-shot Cross-lingual Transfer Iman Jundi, Gabriella Lapesa
  • In-BoXBART: Get Instructions into Biomedical Multi-task Learning Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral
  • Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks Zhao Meng, Yihan Dong, Mrinmaya Sachan, Roger Wattenhofer
  • An Item Response Theory Framework for Persuasion Anastassia Kornilova, Vladimir Eidelman, Daniel Argyle
  • LongChecker: Improving scientific claim verification by modeling full-abstract context David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi
  • SEQZERO: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models Jingfeng Yang, Haoming Jiang, Qingyu Yin, Danqing Zhang, Bing Yin, Diyi Yang
  • Improving Conversational Recommendation Systems’ Quality with Context-Aware Item Meta-Information Bowen Yang, Cong Han, Yu Li, Lei Zuo, Zhou Yu
  • PromptGen: Automatically Generate Prompts using Generative Models Yue Zhang, Hongliang Fei, Dingcheng Li, Ping Li
  • Masked Measurement Prediction: Learning to Jointly Predict Quantities and Units from Textual Context Daniel Spokoyny, Ivan Lee, Zhao Jin, Taylor Berg-Kirkpatrick
  • PubHealthTab: A Public Health Table-based Dataset for Evidence-based Fact Checking Mubashara Akhtar, Oana Cocarascu, Elena Simperl
  • One Size Does Not Fit All: The Case for Personalised Word Complexity Models Sian Gooding, Manuel Tragut
  • Design Challenges for a Multi-Perspective Search Engine Sihao Chen, Siyi Liu, Xander Uyttendaele, Yi Zhang, William W. Bruno, Dan Roth
  • Aligning Generative Language Models with Human Values Ruibo Liu, Ge Zhang, Xinyu Feng, Soroush Vosoughi
  • Opportunities for Human-centered Evaluation of Machine Translation Systems Daniel J. Liebling, Katherine A Heller, Samantha Robertson, Wesley Hanwen Deng
  • Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs’ka, Wenhao Liu, Caiming Xiong
  • POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection Yujian Liu, Xinliang Frederick Zhang, David Wegsman, Nicholas Beauchamp, Lu Wang
  • Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation Prakhar Gupta, Harsh Jhamtani, Jeffrey Bigham
  • CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee