Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Daniel Deutsch, Rotem Dror, Dan Roth
MOVER: Mask, Over-generate and Rank for Hyperbole Generation
Yunxiang Zhang, Xiaojun Wan
Aligning to Normative Values in Morally Informed Game Environments
Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hanna Hajishirzi, Yejin Choi
Diagnosing Vision-and-Language Navigation: What Really Matters
Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction
Shuliang Liu, Xuming Hu, Chenwei Zhang, Shu’ang Li, Lijie Wen, Philip S. Yu
Time Waits for No One! Analysis and Challenges of Temporal Misalignment
Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah Smith
Hate Speech and Counter Speech Detection: Conversational Context Does Matter
Xinchen Yu, Eduardo Blanco, Lingzi Hong
Non-Autoregressive Chinese ASR Error Correction with Phonological Training
Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection
Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi, Noah Smith
Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis
Baber Khalid, SUNGJIN LEE
You Don’t Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers’ Private Personas
Haoran Li, Yangqiu Song, Lixin Fan
Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training
Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou
A Holistic Framework for Analyzing the COVID-19 Vaccine Debate
Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser
Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation
Hanxun Zhong, Zhicheng Dou, Yutao Zhu, Hongjin Qian, Ji-Rong Wen
What do Toothbrushes do in the Kitchen? How Transformers Think our World is Structured
Alexander Henlein, Alexander Mehler
Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints
Chun Zeng, Jiangjie Chen, Tianyi Zhuang, Rui Xu, Hao Yang, Qin Ying, shimin tao, Yanghua Xiao
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs
Ghazi Felhi, Joseph Le Roux, Djamé Seddah
LaMemo: Language Modeling with Look-Ahead Memory
Haozhe Ji, Rongsheng Zhang, Zhenyu Yang, Zhipeng Hu, Minlie Huang
Few-Shot Document-Level Relation Extraction
Nicholas Popovic, Michael Färber
Template-free Prompt Tuning for Few-shot NER
Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang, Xuanjing Huang
Hyperbolic Relevance Matching for Neural Keyphrase Extraction
Mingyang Song, Yi Feng, Liping Jing
DialSummEval: Revisiting Summarization Evaluation for Dialogues
Mingqi Gao, Xiaojun Wan
CoMPM: Context Modeling with Speaker’s Pre-trained Memory Tracking for Emotion Recognition in Conversation
Joosung Lee, Wooin Lee
CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning
Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Amit Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
Shedding New Light on the Language of the Dark Web
Youngjin Jin, Eugene Jang, Yongjae Lee, Seungwon Shin, Jin-Woo Chung
Identifying Implicitly Abusive Remarks about Identity Groups using a Linguistically Informed Approach
Michael Wiegand, Elisabeth Eder, Josef Ruppenhofer
Cross-Lingual Event Detection via Optimized Adversarial Training
Luis Fernando Guzman-Nateras, Minh Van Nguyen, Thien Huu Nguyen
DEMix Layers: Disentangling Domains for Modular Language Modeling
Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah Smith, Luke Zettlemoyer
Nearest Neighbor Knowledge Distillation for Neural Machine Translation
Zhixian Yang, Renliang Sun, Xiaojun Wan
Cryptocoin Bubble Detection: A New Dataset, Task & Hyperbolic Models
Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava
IDPG: An Instance-Dependent Prompt Generation Method
Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V.G.Vinod Vydiswaran, Hao Ma
Few-shot Subgoal Planning with Language Models
Lajanugen Logeswaran, Yao Fu, Moontae Lee, Honglak Lee
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification
Han Wang, Canwen Xu, Julian McAuley
ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence
Yibo Hu, MohammadSaleh Hosseini, Erick Skorupa Parolin, Javier Osorio, Latifur Khan, Patrick Brandt, Vito D’Orazio
Extreme Zero-Shot Learning for Extreme Text Classification
Yuanhao Xiong, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Inderjit S Dhillon
Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation
Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Xing Fan, Chenlei Guo, Yang Liu
CORWA: A Citation-Oriented Related Work Annotation Dataset
Xiangci Li, Biswadip Mandal, Jessica Ouyang
When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes
Mycal Tucker, Tiwalayo Eisape, Peng Qian, Roger P. Levy, Julie Shah
Maximum Bayes Smatch Ensemble Distillation for AMR Parsing
Young-Suk Lee, Ramon Fernandez Astudillo, Hoang Thanh Lam, Tahira Naseem, Radu Florian, Salim Roukos
ExSum: From Local Explanations to Model Understanding
Yilun Zhou, Marco Tulio Ribeiro, Julie Shah
QuALITY: Question Answering with Long Input Texts, Yes!
Richard Yuanzhe Pang, Alicia Vail Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny L Ma, Jana Thompson, He He, Samuel R. Bowman
Visual Commonsense in Pretrained Unimodal and Multimodal Models
Chenyu Zhang, Benjamin Van Durme, Elias Stengel-Eskin, Zhuowan Li
Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance
Jingwei Ni, Zhijing Jin, Markus Freitag, Mrinmaya Sachan, Bernhard Schölkopf
Is “my favorite new movie” my favorite movie? Probing the Understanding of Recursive Noun Phrases
QING LYU, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch
Syn2Vec: Synset Colexification Graphs for Lexical Semantic Similarity
John Harvill, Roxana Girju, Mark A. Hasegawa-Johnson
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding
Le Zhang, Zichao Yang, Diyi Yang
Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables
Nan Hu, Zirui Wu, Yuxuan Lai, Xiao Liu, Yansong Feng
Semantically Informed Slang Interpretation
Zhewei Sun, Richard Zemel, Yang Xu
Partner Personas Generation for Dialogue Response Generation
Hongyuan Lu, Wai Lam, Hong Cheng, Helen M. Meng
Sketching as a Tool for Understanding and Accelerating Self-attention for Long Sequences
Yifan Chen, Qi Zeng, Dilek Hakkani-Tur, Di Jin, Heng Ji, Yun Yang
On the Effect of Pretraining Corpora on In-context Few-shot Learning by a Large-scale Language Model
Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung
KALA: Knowledge-Augmented Language Model Adaptation
Minki Kang, Jinheon Baek, Sung Ju Hwang
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents
Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar
Cross-modal Contrastive Learning for Speech Translation
Rong Ye, Mingxuan Wang, Lei Li
Modeling Multi-Granularity Hierarchical Features for Relation Extraction
Xinnian Liang, Shuangzhi Wu, Mu Li, Zhoujun Li
A Corpus for Understanding and Generating Moral Stories
Jian Guan, Ziqi Liu, Minlie Huang
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
Yueqing Sun, Qi Shi, Le Qi, Yu Zhang
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning
Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen
A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction
Runxin Xu, Peiyi Wang, Tianyu Liu, Shuang Zeng, Baobao Chang, Zhifang Sui
An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling
Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui
A Double-Graph Based Framework for Frame Semantic Parsing
Ce Zheng, Xudong Chen, Runxin Xu, Baobao Chang
RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction
Yuan Liang, Zhuoxuan Jiang, di yin, Bo Ren
SkillSpan: Hard and Soft Skill Extraction from English Job Postings
Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank
Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES
Felix Stahlberg, Shankar Kumar
DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks
LIN TIAN, Xiuzhen Zhang, Jey Han Lau
Mitigating Toxic Degeneration with Empathetic Data: Exploring the Relationship Between Toxicity and Empathy
Charles Welch, Allison Lahnala, Béla Neuendorf, Lucie Flek
SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis
Zheng Zhang, Zili Zhou, Yanna Wang
A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank
Dan Malkin, Gabriel Stanovsky
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks
Ruixiang Cui, Daniel Hershcovich, Anders Søgaard
Interactive Symbol Grounding with Complex Referential Expressions
Rimvydas Rubavicius, Alex Lascarides
Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization
Lulu Zhao, Fujia Zheng, Weihao Zeng, Keqing He, Weiran Xu, Huixing Jiang, Wei Wu, Yanan Wu
Reducing Disambiguation Biases in NMT by Leveraging Explicit Word Sense Information
Niccolò Campolungo, Tommaso Pasini, Denis Emelin, Roberto Navigli
Match made by BERT? Towards Interpretable Paper-Reviewer Assignments in NLP
Terne Sasha Thorn Jakobsen, Anna Rogers
Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs
Songlin Yang, Wei Liu, Kewei Tu
Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition
Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, hong yu, Sachindra Joshi
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora
Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew Arnold, Xiang Ren
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Minyi Zhao, Lu Zhang, Yi Xu, Jiandong Ding, Jihong Guan, Shuigeng Zhou
A Study of the Attention Abnormality in Trojaned BERTs
Weimin Lyu, Songzhu Zheng, Tengfei Ma, Chao Chen
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks
Belinda Z. Li, Jane A. Yu, Madian Khabsa, Luke Zettlemoyer, Alon Halevy, Jacob Andreas
Disentangling Indirect Answers to Yes-No Questions in Real Conversations
Krishna Chaitanya Sanagavarapu, Jathin Pranav Singaraju, Anusha Kakileti, Anirudh Kaza, Aaron Abraham Mathews, Helen Li, Nathan Raul Brito, Eduardo Blanco
Massive-scale Decoding for Text Generation using Lattices
Jiacheng Xu, Siddhartha Jonnalagadda, Greg Durrett
Entity Linking via Explicit Mention-Mention Coreference Modeling
Dhruv Agarwal, Rico Angell, Nicholas Monath, Andrew McCallum
GenIE: Generative Information Extraction
Martin Josifoski, Nicola De Cao, Maxime Peyrard, Fabio Petroni, Robert West
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
Peter West, Chandra Bhagavatula, Jack Hessel, Jena D. Hwang, Liwei Jiang, Ronan Le Bras, Ximing Lu, Sean Welleck, Yejin Choi
Measure and Improve Robustness in NLP Models: A Survey
Xuezhi Wang, Haohan Wang, Diyi Yang
Using Paraphrases to Study Properties of Contextual Embeddings
Laura Burdick, Jonathan K Kummerfeld, Rada Mihalcea
Disentangling Categorization in Multi-agent Emergent Communication
Washington Garcia, Hamilton Scott Clouse, Kevin R. B. Butler
SURF: Semantic-level Unsupervised Reward Function for Machine Translation
Atijit Anuchitanukul, Julia Ive
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer
Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine
Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity
Sheshera Mysore, Arman Cohan, Tom Hope
Testing the Ability of Language Models to Interpret Figurative Language
Emmy Liu, Chenxuan Cui, Kenneth Zheng, Graham Neubig
What kind of company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris
Mikael Brunila, Jack LaViolette
Imagination-Augmented Natural Language Understanding
Yujie Lu, Wanrong Zhu, Xin Eric Wang, Miguel Eckstein, William Yang Wang
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling
Jakob Prange, Nathan Schneider, Lingpeng Kong
Joint Extraction of Entities, Relations, and Events via Modeling Inter-Instance and Inter-Label Dependencies
Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Huu Nguyen
Improving Compositional Generalization with Latent Structure and Data Augmentation
Linlu Qiu, Peter Shaw, Panupong Pasupat, Paweł Krzysztof Nowak, Tal Linzen, Fei Sha, Kristina Toutanova
Consolidating Answers in Question Answering Systems
Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen
FNet: Mixing Tokens with Fourier Transforms
James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon
TVShowGuess: Character Comprehension in Stories as Speaker Guessing
Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton
ProQA: Structural Prompt-based Pre-training for Unified Question Answering
Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
Generative Cross-Domain Data Augmentation for Aspect and Opinion Co-Extraction
Junjie Li, Jianfei Yu, Rui Xia
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings
Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar, Rishabh Maheshwary, Vikram Pudi
KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media
Wenqian Zhang, Shangbin Feng, Zilong Chen, Zhenyu Lei, Jundong Li, Minnan Luo
Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances
Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset
Fengzhu ZENG, Wei Gao
Connecting Loss Difference with Equal Opportunity for Fair Models
Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin, Lea Frermann
Unsupervised Stem-based Cross-lingual Part-of-Speech Tagging for Morphologically Rich Low-Resource Languages
Ramy Eskander, Cass Lowry, Sujay Khandagale, Judith Lynn Klavans, Maria Polinsky, Smaranda Muresan
Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting
Linzhi Wu, Pengjun Xie, Jie Zhou, Meishan Zhang, Ma Chunping, Guangwei Xu, Min Zhang
Bilingual Tabular Inference: A Case Study on Indic Languages
Chaitanya Agarwal, Vivek Gupta, Anoop Kunchukuttan, Manish Shrivastava
A Complex KBQA System using Multiple Reasoning Paths
Yu Wang, Vijay Srinivasan, Hongxia Jin
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models
Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz
DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction
MeiHan Tong, Bin Xu, Shuai Wang, Meihuan Han, Yixin Cao, Jiangqi Zhu, Siyu Chen, Lei Hou, Juanzi Li
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su, Xiaozhi Wang, Yujia Qin, Chi-Min Chan, Yankai Lin, Huadong Wang, Kaiyue Wen, Zhiyuan Liu, Peng Li, Juanzi Li, Lei Hou, Maosong Sun, Jie Zhou
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation
Pengzhi Gao, Zhongjun He, Hua Wu, Haifeng Wang
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin, Yankai Lin, Jing Yi, Jiajie Zhang, Xu Han, Zhengyan Zhang, Yusheng Su, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou
TRUE: Re-evaluating Factual Consistency Evaluation
Or Honovich, Roee Aharoni, Jonathan Herzig, Hagai Taitelbaum, Doron Kukliansy, Vered Cohen, Thomas Scialom, Idan Szpektor, Avinatan Hassidim, Yossi Matias
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering
JianGuo Mao, Wenbin Jiang, Xiangdong Wang, Zhifan Feng, Yajuan Lyu, Hong Liu, Yong Zhu
From spoken dialogue to formal summary: An utterance rewriting for dialogue summarization
Yue Fang, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Bo Long, Yanyan Lan, Yanquan Zhou
Residue-Based Natural Language Adversarial Attack Detection
Vyas Raina, Mark Gales
A Computational Acquisition Model for Multimodal Word Categorization
Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann
On the Effectiveness of Sentence Encoding for Intent Detection Meta-Learning
Tingting Ma, Qianhui Wu, Zhiwei Yu, Tiejun Zhao, Chin-Yew Lin
Can Rationalization Improve Robustness?
Howard Chen, Jacqueline He, Karthik R Narasimhan, Danqi Chen
One Reference Is Not Enough: Diverse Distillation with Reference Selection for Non-Autoregressive Translation
Chenze Shao, Xuanfu Wu, Yang Feng
GMN: Generative Multi-modal Network for Practical Document Information Extraction
Haoyu Cao, Jiefeng Ma, Antony Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela
Maize: Effective and Efficient Retrieval via Lightweight Late Interaction
Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog
Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš
FRUIT: Faithfully Reflecting Updated Information in Text
Robert L. Logan IV, Alexandre Tachard Passos, Sameer Singh, Ming-Wei Chang
Learning the Ordering of Coordinate Compounds and Elaborate Expressions in Hmong, Lahu, and Chinese
Chenxuan Cui, Katherine J. Zhang, David R Mortensen
Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities
Benjamin Hsu, Graham Horwood
PROMPT WAYWARDNESS: The Curious Case of Discretized Interpretation of Continuous Prompts
Daniel Khashabi, Xinxi Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sameer Singh, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Yejin Choi
When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer
Ameet Deshpande, Partha Talukdar, Karthik R Narasimhan
Benchmarking Intersectional Biases in NLP
John P. Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi
Sonnet Generation by Training on Non-poetic Texts with Discourse-level Coherence and Poetic Features
Yufei Tian, Nanyun Peng
Improving In-Context Few-Shot Learning via Self-Supervised Training
Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand
Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Daniel Morrison, Alexander Fabbri, Yejin Choi, Noah Smith
ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models
Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Xin Zhao, Ji-Rong Wen
Learning to Transfer Prompts for Text Generation
Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Xin Zhao
DocAMR: Multi-Sentence AMR Representation and Evaluation
Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O’Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider
Lifting the Curse of Multilinguality by Pre-training Modular Transformers
Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe
Transparent Human Evaluation for Image Captioning
Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations
Prashanth Vijayaraghavan, Soroush Vosoughi
On the Use of Bert for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation
Yongjie Wang, Chuan Wang, Ruobing Li, Hui Lin
Multimodal Dialogue State Tracking
Hung Le, Nancy F. Chen, Steven HOI
VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems
Hung Le, Nancy F. Chen, Steven HOI
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking
Xuming Hu, Zhijiang Guo, GuanYu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu
Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation
Zhexin Zhang, Jiaxin Wen, Jian Guan, Minlie Huang
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline
Xiangyang Liu, Tianxiang Sun, JunLiang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu
Clues Before Answers: Generation-Enhanced Multiple-Choice QA
Zixian Huang, Ao Wu, Jiaying Zhou, Yu Gu, Yue Zhao, Gong Cheng
Unsupervised Paraphrasability Prediction for Compound Nominalizations
John Sie Yuen Lee, Ho Hung Lim, Carol Webster
FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations
Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal
Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?
SUBBA REDDY OOTA, Veeral Agarwal, JASHN ARORA, mounika marreddy, Manish Gupta, Bapi Raju Surampudi
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding
Zeming Chen, Qiyue Gao
A Dataset for N-ary Relation Extraction of Drug Combinations
Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition
Xinyu Wang, Min Gui, Yong Jiang, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang, Kewei Tu
Efficient Constituency Tree based Encoding for Natural Language to Bash Translation
Shikhar Bharadwaj, Shirish Shevade
Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation
Shumpei Inoue, Tsungwei Liu, Son Hong Nguyen, Minh-Tien Nguyen
NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias
Nayeon Lee, Yejin Bang, Tiezheng YU, Andrea Madotto, Pascale Fung
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction
Yue Zhang, Zhenghua Li, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang
Boosted Dense Retriever
Patrick Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Scott Yih, Sebastian Riedel
Analyzing Encoded Concepts in Transformer Language Models
Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis
Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles HACHEME, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi, Yvonne Wambui Gitau, Jade Abbott, Mohamed Ahmed, Millicent Ochieng, Anuoluwapo Aremu, Perez Ogayo, Jonathan Mukiibi, Fatoumata Ouoba Kabore, Godson Koffi KALIPE, Derguene Mbaye, Allahsera Auguste Tapo, Victoire Memdjokam Koagne, Edwin Munkoh-Buabeng, Valencia Wagner, Idris Abdulmumin, Ayodele Awokoya
Document-Level Event Argument Extraction by Leveraging Redundant Information and Closed Boundary Loss
Hanzhang Zhou, Kezhi Mao
Features or Spurious Artifacts? Data-centric Baselines for Fair and Robust Hate Speech Detection
Alan Ramponi, Sara Tonelli
Low Resource Style Transfer via Domain Adaptive Meta Learning
Xiangyang Li, Xiang Long, Yu Xia, Sujian Li
Progressive Class Semantic Matching for Semi-supervised Text Classification
Haiming Xu, Lingqiao Liu, Ehsan M Abbasnejad
Domain Confused Contrastive Learning for Unsupervised Domain Adaptation
Quanyu Long, Tianze Luo, Wenya Wang, Sinno Pan
Interpretable Proof Generation via Iterative Backward Reasoning
Hanhao Qu, Yu Cao, Jun Gao, Liang Ding, Ruifeng Xu
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding
Antoine Chaffin, Vincent Claveau, Ewa Kijak
Triggerless Backdoor Attack for NLP Tasks with Clean Labels
Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan
Are All the Datasets in Benchmark Necessary? A Pilot Study of Dataset Evaluation for Text Classification
Yang Xiao, Jinlan Fu, See-Kiong Ng, Pengfei Liu
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing
Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao
Improving Entity Disambiguation by Reasoning over a Knowledge Base
Tom Ayoola, Joseph Fisher, Andrea Pierleoni
Learning to Borrow– Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion
Huda Hakami, Mona Hakami, Angrosh Mandya, Danushka Bollegala
Selective Differential Privacy for Language Modeling
Weiyan Shi, Aiqi Cui, Evan Li, Ruoxi Jia, Zhou Yu
Robust Conversational Agents against Imperceptible Toxicity Triggers
Ninareh Mehrabi, Ahmad Beirami, Fred Morstatter, Aram Galstyan
Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs
Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur
MetaICL: Learning to Learn In Context
Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition
Besnik Fetahu, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi
Gender Bias in Masked Language Models for Multiple Languages
Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki
Federated Learning with Noisy User Feedback
Rahul Sharma, Anil Ramakrishna, Ansel MacLaughlin, Anna Rumshisky, Jimit Majmudar, Clement Chung, Salman Avestimehr, Rahul Gupta
Don’t sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks
Jonathan Rusert, Padmini Srinivasan
Learning to Retrieve Passages without Supervision
Ori Ram, Gal Shachaf, Omer Levy, Jonathan Berant, Amir Globerson
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko
Learning To Retrieve Prompts for In-Context Learning
Ohad Rubin, Jonathan Herzig, Jonathan Berant
Unified Semantic Typing with Meaningful Label Inference
James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
A Structured Span Selector for Span Prediction Tasks
Tianyu Liu, Yuchen Eleanor Jiang, Ryan D Cotterell, Mrinmaya Sachan
How Gender Debiasing Affects Internal Model Representations, and Why It Matters
Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov
QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization
Alexander Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong
Training Mixed-Domain Translation Models via Federated Learning
Peyman Passban, Tanya Roosta, Rahul Gupta, ankit Chadha, Clement Chung
Interactive Query-Assisted Summarization via Deep Reinforcement Learning
Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Ido Dagan, Yael Amsterdamer
Text Style Transfer via Optimal Transport
Nasim Nouri
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization
Alexander Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona T. Diab
What do tokens know about their characters and how do they know it?
Ayush Kaushal, Kyle Mahowald
WiC = TSV = WSD: On the Equivalence of Three Semantic Tasks
Bradley Hauer, Grzegorz Kondrak
Combating the curse of multilinguality in cross-lingual WSD via the application of sparsified contextualized word representations
Gábor Berend
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony
Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya
Does Summary Evaluation Survive Translation to Other Languages?
Spencer Braun, Oleg Vasilyev, Neslihan Iskender, John Bohannon
LITE: Intent-based Task Representation Learning Using Weak Supervision
Naoki Otani, Michael Gamon, Sujay Kumar Jauhar, Mei Yang, Sri Raghu Malireddi, Oriana Riva
SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction
Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models
Patrick Huber, Giuseppe Carenini
Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models
Qinyuan Ye, Madian Khabsa, Mike Lewis, Sinong Wang, Xiang Ren, Aaron Jaech
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval
Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych
Do Prompt-Based Models Really Understand the Meaning of Their Prompts?
Albert Webson, Ellie Pavlick
MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection
Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen
End-to-End Chinese Speaker Identification: Formulation, Annotation, and Methods
Dian Yu, Ben Zhou, Dong Yu
Learning to Express in Knowledge-Grounded Conversation
Xueliang Zhao, Tingchen Fu, Chongyang Tao, Wei Wu, Dongyan Zhao, Rui Yan
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim, Beong-woo Kwak, Youngwook Kim, Reinald Kim Amplayo, seung-won hwang, Jinyoung Yeo
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks
Akari Asai, Matt Gardner, Hanna Hajishirzi
On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation
Kelly Marchisio, Markus Freitag, David Grangier
Generic and Trend-aware Curricula for Relation Extraction in Text Graphs
Nidhi Vakil, Hadi Amiri
Locally Aggregated Feature Attribution on Natural Language Understanding
Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song
Sentence-Level Resampling for Named Entity Recognition
Xiaochen Wang, Yue Wang
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models
Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park
KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge Distillation
Marzieh S. Tahaei, Ella Charlaix, Vahid Partovi Nia, Ali Ghodsi, Mehdi Rezagholizadeh
Modeling Exemplification in Long-form Question Answering via Retrieval
Shufan Wang, Fangyuan Xu, Laure Thompson, Eunsol Choi, Mohit Iyyer
Don’t Take It Literally: An Edit-Invariant Sequence Loss for Text Generation
Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu
Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data
Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
CS1QA: A Dataset for Code-based Question Answering in an Introductory Programming Course
Changyoon Lee, Yeon Seonwoo, Alice Oh
Event Schema Induction with Double Graph Autoencoders
Xiaomeng Jin, Manling Li, Heng Ji
Multi-Relational Graph Transformer for Automatic Short Answer Grading
Rajat Agarwal, Varun Khurana, Karish Grover, Mukesh Mohania, Vikram Goyal
Even the Simplest Baseline Needs Careful Re-investigation: A Case Study on XML-CNN
Si-An Chen, Jie-Jyun Liu, Tsung-Han Yang, Hsuan-Tien Lin, Chih-Jen Lin
Simple Local Attentions Remain Competitive for Long-Context Tasks
Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad
Frustratingly Easy System Combination for Grammatical Error Correction
Muhammad Reza Qorib, Seung-Hoon Na, Hwee Tou Ng
All You May Need for VQA are Image Captions
Soravit Changpinyo, Doron Kukliansy, Idan Szpektor, Xi Chen, Nan Ding, Radu Soricut
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification
Jianhai Zhang, Mieradilijiang Maimaiti, Gao Xing, Yuanhang Zheng, Ji Zhang
Hero-Gang Neural Model For Named Entity Recognition
Jinpeng Hu, Yaling Shen, Yang Liu, Xiang Wan, Tsung-Hui Chang
Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling
Nuo Chen, Linjun Shou, MING GONG, Jian Pei, Daxin Jiang
DEGREE: A Data-Efficient Generation-Based Event Extraction Model
I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting
Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo
The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation
Tobias Domhan, Eva Hasler, Ke Tran, Sony Trenous, Bill Byrne, Felix Hieber
Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering
Rajat Kumar, Mayur Patidar, VAIBHAV VARSHNEY, Lovekesh Vig, Gautam Shroff
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators
Rilwan Akanni Adewoyin, Ritabrata Dutta, Yulan He
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
Non-Autoregressive Machine Translation: It’s Not as Fast as it Seems
Jindřich Helcl, Barry Haddow, Alexandra Birch
Proposition-Level Clustering for Multi-Document Summarization
Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation
Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu
ValCAT: Generating Variable-Length Contextualized Adversarial Transformations using Encoder-Decoder
Chuyun Deng, Mingxuan Liu, Yue Qin, Jia Zhang, Hai-Xin Duan, Donghong Sun
Representation Learning for Conversational Data using Discourse Mutual Information Maximization
Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal
MuPAD: A Chinese Multi-Domain Predicate-Argument Dataset
Yahui Liu, Haoping Yang, Chen Gong, Qingrong Xia, Zhenghua Li, Min Zhang
Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models
Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt
Improving Constituent Representation with Hypertree Neural Networks
Hao Zhou, Gongshen Liu, Kewei Tu
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness
Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai
Implicit n-grams Induced by Recurrence
Xiaobing Sun, Wei Lu
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation
Simiao Zuo, Qingru Zhang, Chen Liang, Pengcheng He, Tuo Zhao, Weizhu Chen
Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis
Jiahao Cao, Rui Liu, Huailiang Peng, Lei Jiang, Xu Bai
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media
Lixing Zhu, Gabriele Pergola, Zheng Fang, Robert Procter, Yulan He
BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation
Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding
Ao Jia, Yu He, Yazhou Zhang, Sagar Uprety, Dawei Song, Christina Lioma
Graph and Attention Based Fact Verification and Heterogeneous COVID-19 Claims Dataset
Miguel Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Robert Procter, Yulan He
Many Hands Make Light Work: Using Essay Traits to Automatically Score Essays
Rahul Kumar, Sandeep Mathias, Sriparna Saha, Pushpak Bhattacharyya
Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts
Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert
Go Back in Time: Generating Flashbacks in Stories with Event Plots and Temporal Prompts
Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng
Label Anchored Contrastive Learning for Language Understanding
Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He
AcTune: Uncertainty-Aware Active Self-Training for Active Fine-Tuning of Pretrained Language Models
Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang
Quality-Aware Decoding for Neural Machine Translation
Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning
Haiyan Yin, Dingcheng Li, Ping Li
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data
Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate
Hannah Rose Kirk, Bertram Vidgen, Paul Rottger, Tristan Thrush, Scott A. Hale
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge
Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation
Deeksha varshney, Akshara Prabhakar, Asif Ekbal
Quantifying Synthesis and Fusion and their Impact on Machine Translation
Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva
Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models
Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou
Context-Aware Abbreviation Expansion Using Large Language Models
Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael Brenner
MultiSpanQA: A Dataset for Multi-Span Question Answering
Haonan Li, Martin Tomko, Maria Vasardani, Timothy Baldwin
DISAPERE: A Dataset for Discourse Structure in Peer Review Discussions
Neha Nayak Kennard, Tim O’Gorman, Rajarshi Das, Akshay Sharma, Chhandak Bagchi, Matthew Clinton, Pranay Kumar Yelugam, Hamed Zamani, Andrew McCallum
Cross-Domain Detection of GPT-2-Generated Technical Text
Juan Diego Rodríguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan
Towards a Progression-Aware Autonomous Dialogue Agent
Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang
Unsupervised Slot Schema Induction for Task-oriented Dialog
Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau
Database Search Results Disambiguation for Task-Oriented Dialog Systems
Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar
Probing via Prompting and Pruning
Jiaoda Li, Mrinmaya Sachan, Ryan D Cotterell
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer
Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy
DREAM: Improving Situational QA by First Elaborating the Situation
Yuling Gu, Bhavana Dalvi, Peter Clark
Masked Part-Of-Speech Model: Does modeling long context help unsupervised POS-tagging?
Xiang Zhou, Shiyue Zhang, Mohit Bansal
Inducing and Using Alignments for Transition-based AMR Parsing
Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo
EmpHi: Generating Empathetic Responses with Human-like Intents
MAO YAN CHEN, Siheng Li, Yujiu Yang
ScAN: Suicide Attempt and Ideation Events Dataset
Bhanu Pratap Singh Rawat, Samuel Kovaly, Hong Yu, Wilfred Pigeon
FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization
David Wan, Mohit Bansal
DocTime: A Document-level Temporal Dependency Graph Parser
Puneet Mathur, Vlad I Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain
When a sentence does not introduce a discourse entity, Transformer-based models still often refer to it
Sebastian Schuster, Tal Linzen
KAT: A Knowledge Augmented Transformer for Vision-and-Language
Liangke Gui, Borui Wang, Qiuyuan Huang, Alexander G Hauptmann, Yonatan Bisk, Jianfeng Gao
Provably Confidential Language Modelling
Xuandong Zhao, Lei Li, Yu-Xiang Wang
OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering
Zhengbao Jiang, Yi Mao, Pengcheng He, Graham Neubig, Weizhu Chen
CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination
Hyounghun Kim, Abhay Zala, Mohit Bansal
CompactIE: Compact Facts in Open Information Extraction
Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding
Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah
Textless Speech-to-Speech Translation on Real Data
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Pino, Jiatao Gu, Wei-Ning Hsu
GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering
Yoonseok Yang, Kyu Seok Kim, Minsam Kim, Juneyoung Park
Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection
Angelica Chen, Vicky Zayats, Daniel David Walker, Dirk Padfield
Explaining Toxic Text via Knowledge Enhanced Text Generation
Rohit Sridhar, Diyi Yang
NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics
Ximing Lu, Sean Welleck, Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah Smith, Yejin Choi
Learning Dialogue Representations from Consecutive Utterances
Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang
Long-term Control for Dialogue Generation: Methods and Evaluation
Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q Weinberger, Ryan McDonald
On the Use of External Data for Spoken Named Entity Recognition
Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Han
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation
Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie
Meta Learning for Natural Language Processing: A Survey
Hung-yi Lee, Shang-Wen Li, Thang Vu
Reframing Human-AI Collaboration for Generating Free-Text Explanations
Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi
Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework
Minghao Zhu, Junli Wang, Chungang Yan
Cross-document Misinformation Detection based on Event Graph Reasoning
Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji
Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization
Haode Zhang, Haowen Liang, Yuwei Zhang, Li-Ming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam
On the Robustness of Reading Comprehension Models to Entity Renaming
Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren
Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Transfer
Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar
On Synthetic Data for Back Translation
Jiahao Xu, Yubin Ruan, Wei Bi, Guoping Huang, Shuming Shi, Lihui Chen, Lemao Liu
Diversifying Neural Dialogue Generation via Negative Distillation
Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li
Ask Me Anything in Your Native Language
Nikita Sorokin, Dmitry Abulkhanov, Irina Piontkovskaya, Valentin Malykh
Understand before Answer: Improve Temporal Reading Comprehension via Precise Question Understanding
Hao Huang, Xiubo Geng, Guodong Long, Daxin Jiang
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems
Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Kumar Nelakanti, Vineet Gandhi
TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation
Sajad Sotudeh, Nazli Goharian
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds
Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
Ali Modarressi, Mohsen Fayyaz, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar
Cooperative Self-training of Machine Reading Comprehension
Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass
Political Ideology and Polarization: A Multi-dimensional Approach
Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li
CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data
Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation
Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao
NewsEdits: A Dataset of News Article Revision Histories and a Novel Document-Level Reasoning Challenge
Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng
Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks
Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov
Semantic Diversity in Dialogue with Natural Language Inference
Katherine Stasaski, Marti Hearst
Learning Natural Language Generation from Scratch with Truncated Reinforcement Learning
Alice Martin, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin
Social Norms Guide Reference Resolution
Mitchell Abrams, Matthias Scheutz
Putting the Con in Context: Identifying Deceptive Actors in the Game of Mafia
Samee Omotayo Ibraheem, Gaoyue Zhou, John DeNero
SwahBERT: Language Model of Swahili
Gati L Martin, Medard Medard Mswahili, Young-Seob Jeong, Jiyoung Woo
Main Conference - Short Papers
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference
Emīls Kadiķis, Vaibhav Srivastav, Roman Klinger
MCSE: Multimodal Contrastive Learning of Sentence Embeddings
Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
Consistency Training with Virtual Adversarial Discrete Perturbation
Jungsoo Park, Gyuwan Kim, Jaewoo Kang
Conceptualizing Treatment Leakage in Text-based Causal Inference
Adel Daoud, Connor Thomas Jerzak, Richard Johansson
Label Definitions Improve Semantic Role Labeling
Li Zhang, Ishan Jindal, Yunyao Li
Contrastive Learning for Prompt-based Few-shot Language Learners
Yiren Jian, Chongyang Gao, Soroush Vosoughi
Embedding Hallucination for Few-shot Language Fine-tuning
Yiren Jian, Chongyang Gao, Soroush Vosoughi
Few-Shot Semantic Parsing with Language Models Trained On Code
Richard Shin, Benjamin Van Durme
Modeling Explicit Task Interactions in Document-Level Joint Entity and Relation Extraction
Liyan Xu, Jinho D. Choi
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?
Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances
Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens
Itay Itzhak, Omer Levy
Exact Paired Permutation Testing Algorithms for NLP Systems
Ran Zmigrod, Tim Vieira, Ryan D Cotterell
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting
Shuzheng Si, Shuang Zeng, Baobao Chang
Partial-input baselines show that NLI models can ignore context, but they don’t.
Neha Srikanth, Rachel Rudinger
How Do Construct-Driven vs. Construct-Agnostic Counterfactuals Affect the Robustness of Social Computing Models?
Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein
Learning to Generate Examples for Semantic Processing Tasks
Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili
Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge
Ian Porada, Alessandro Sordoni, Jackie CK Cheung
Learning Cross-Lingual IR from an English Retriever
Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil
A Follower-aware Speaker Model For Vision-and-Language Navigation
Zi-Yi Dou, Nanyun Peng
Uninformative Input Features and Counterfactual Invariance: Two Perspectives on Spurious Correlations in Natural Language
Jacob Eisenstein
Causal Distillation for Language Models
Zhengxuan Wu, Atticus Geiger, Joshua Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah Goodman
Grapheme-to-Phoneme Conversion for Thai using Neural Regression Models
Tomohiro Yamasaki
A Data Cartography based MixUp for Pre-trained Language Models
Seo Yeon Park, Cornelia Caragea
Improving negation detection with negation-focused pre-training
Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor
AISFG: Abundant Information Slot Filling Generator
Yang Yan, Junda Ye, Zhongbao Zhang, Liwen Wang
Collective Self-Labeling for Passage Retrieval
Jihyuk Kim, Minsoo Kim, seung-won hwang
Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval
Siyu Ren, Kenny Q. Zhu
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning
Hongyi Yuan, Zheng Yuan, Sheng Yu
Towards Debiasing Translation Artifacts
KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith
Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics
Zihan Zhang, Meng Fang, Ling Chen, Mohammad Reza Namazi Rad
Does it really generalize well on unseen data? Systematic Evaluation of Relational Triple Extraction Methods
Juhyuk Lee, Min-Joong Lee, June Yong Yang, Eunho Yang
Quantifying Language Variation Acoustically with Few Resources
Martijn Bartelds, Martijn Wieling
ChapterBreak: A Challenge Dataset for Long-Range Language Models
Simeng Sun, Katherine Thai, Mohit Iyyer
How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns
Stephanie Brandl, Ruixiang Cui, Anders Søgaard
Exposing the Limits of Video-Text Models through Contrast Sets
Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach
Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation
Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin
UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis
Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis
Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution
Connor Baumler, Rachel Rudinger
Tricks for Training Sparse Translation Models
Dheeru Dua, Shruti Bhosale, Vedanuj Goswami, James Cross, Mike Lewis, Angela Fan
Global Entity Disambiguation with BERT
Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto
Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption
Garam Lee, Jai Hyun Park, Minsoo Kim, seung-won hwang, Jung Hee Cheon
Incorporating Centering Theory into Entity Coreference Resolution
Haixia Chai, Michael Strube
Modal Dependency Parsing via Language Model Priming
Jiarui Yao, Nianwen Xue, Bonan Min
The USMLE® Step 2 Clinical Skills Patient Note Corpus
Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber
Question-Evidence Similarity Learning for Long-Context Question Answering
Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan
Using Natural Sentence Prompts for Understanding Biases in Language Models
Sarah Alnegheimish, Alicia Guo, Yi Sun
Data Augmentation with Dual Training for Offensive Span Detection
Nasim Nouri
Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning
Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis
Paragraph-based Transformer Pretraining for Multi-Sentence Inference
Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti
Cheat Codes to Quantify Missing Source Information in Neural Machine Translation
Proyag Pal, Kenneth Heafield
A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling
Forrest Sheng Bao, Ge Luo, Hebi Li, Cen Chen, Yinfei Yang, Youbiao He, Minghui Qiu
On the Diversity and Limits of Human Explanations
Chenhao Tan
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem
Ryoma Sato
Reference-free Summarization Evaluation via Semantic Correlation and Compression Ratio
Yizhu Liu, Qi Jia, Kenny Q. Zhu
Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations
Daniela Brook Weiss, Paul Roit, Ori Ernst, Ido Dagan
Combining Humor and Sarcasm for Improving Political Parody Detection
Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras
BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer
Marinela Parovic, Goran Glavaš, Ivan Vulić, Anna Korhonen
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models
Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan D Cotterell, Isabelle Augenstein
SKILL: Structured Knowledge Infusion for Large Language Models
Fedor Moiseev, Zhe Dong, Enrique Alfonseca, Martin Jaggi
Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation
Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach
Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction
Jiaxin Yu, Deqing Yang, Shuyu Tian
Pretrained Models for Multilingual Federated Learning
Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme
Sort by Structure: Language Model Ranking as Dependency Probing
Max Müller-Eberstein, Rob van der Goot, Barbara Plank
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions
Elior Sulem, Jamaal Hay, Dan Roth
Socially Aware Bias Measurements for Hindi Language Representations
Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang
On Curriculum Learning for Commonsense Reasoning
Adyasha Maharana, Mohit Bansal
Abstraction not Memory: BERT and the English Article System
Harish Tayyar Madabushi, Dagmar Divjak, Petar Milin
Generating Repetitions with Appropriate Repeated Words
Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining
Machel Reid, Mikel Artetxe
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Xiaolei Huang
Analyzing Modality Robustness in Multimodal Sentiment Analysis
Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria
EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction
Benfeng Xu, Quan Wang, Yajuan Lyu, Yabing Shi, Yong Zhu, Jie Gao, Zhendong Mao
Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations
Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction
Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo
A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction
Shu Liu, Kaiwen Li, Zuhe Li
SUBS: Subtree Substitution for Compositional Semantic Parsing
Jingfeng Yang, Le Zhang, Diyi Yang
LEA: Meta Knowledge-Driven Self-Attentive Document Embedding for Few-Shot Text Classification
Seungki Hong, Tae Young Jang
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction
Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu
Language Model Augmented Monotonic Attention for Simultaneous Translation
Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim
Special Theme Papers
On the Machine Learning of Ethical Judgments from Natural Language
Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan D Cotterell, Adina Williams
Machine-in-the-Loop Rewriting for Creative Image Captioning
Vishakh Padmakumar, He He
Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data
Jamar L. Sullivan, Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, Blase Ur
Automatic Correction of Human Translations
Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero
An Exploration of Post-Editing Effectiveness in Text Summarization
Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel R. Tetreault, Alejandro Jaimes
Mapping the Design Space of Human-AI Interaction in Text Summarization
Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes
User-driven research of medical Note Generation software
Tom Knoll, Francesco Moramarco, Alex Papadopoulos Korfiatis, Rachel Young, Claudia Ruffini, Mark Perera, Christian Perstl, Ehud Reiter, Anya Belz, Aleksandar Savkov
The Why and The How: A Survey on Natural Language Interaction in Visualization
Henrik Voigt, Ozge Alacam, Monique Meuschke, Kai Lawonn, Sina Zarrieß
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Kaitlyn Zhou, Su Lin Blodgett, Adam Trischler, Hal Daumé III, Kaheer Suleman, Alexandra Olteanu
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Xu Wang, Simin Fan, Jessica Houghton, Lu Wang
Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring?
Zijie Zeng, XINYU LI, Dragan Gasevic, Guanliang Chen
Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks
Paul Rottger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert
What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research
Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke
Findings
Cross-Domain Classification of Moral Values
Enrico Liscio, Alin Eugen Dondera, Andrei Geadau, Catholijn M Jonker, Pradeep Kumar Murukannaiah
ID10M: Idiom Identification in 10 Languages
Simone Tedeschi, Federico Martelli, Roberto Navigli
Query2Particles: Knowledge Graph Reasoning with Particle Embeddings
Jiaxin Bai, Zihao Wang, Hongming Zhang, Yangqiu Song
Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation
Yong Cao, Wei Li, Xianzhi Li, Min Chen, Guangyong Chen, Long Hu, Zhengdao Li, Kai Hwang
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
Zhihao Fan, zhongyu wei, Zejun Li, Siyuan Wang, Xuanjing Huang, Jianqing Fan
Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition
Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, Sheng Yu
EA$^2$E: Improving Consistency with Event Awareness for Document-Level Argument Extraction
Qi Zeng, Qiusi Zhan, Heng Ji
Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer
Zhengyuan Liu, Nancy F. Chen
Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation
Huan Lin, Baosong Yang, Liang Yao, Dayiheng Liu, Haibo Zhang, jun xie, Min Zhang, Jinsong Su
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks
Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee
TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization
Ze Yang, Christian WANG, Zhoujin Tian, Wei Wu, Zhoujun Li
KETOD: Knowledge-Enriched Task-Oriented Dialogue
Zhiyu Chen, Bing Liu, Seungwhan Moon, Chinnadhurai Sankar, Paul A. Crook, William Yang Wang
Zero-Shot Event Detection Based on Ordered Contrastive Learning and Prompt-Based Prediction
Senhui Zhang, Tao Ji, Wendi Ji, Xiaoling Wang
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation
Md Rashad Al Hasan Rony, Ricardo Usbeck, Jens Lehmann
Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning
Hongzhan Lin, Jing Ma, Liangliang Chen, Zhiwei Yang, Mingfei Cheng, Guang Chen
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
Tomer Wolfson, Daniel Deutch, Jonathan Berant
MTG: A Benchmark Suite for Multilingual Text Generation
Yiran Chen, Zhenqiao Song, Xianze Wu, Danqing Wang, Jingjing Xu, Jiaze Chen, Hao Zhou, Lei Li
TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning
Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier
AMRize, then Parse! Enhancing AMR Parsing with PseudoAMR Data
Liang Chen, Peiyi Wang, Runxin Xu, Tianyu Liu, Zhifang Sui, Baobao Chang
Latent Group Dropout for Multilingual and Multidomain Machine Translation
Minh-Quang PHAM, François Yvon, Josep Crego
RCL: Relation Contrastive Learning for Zero-Shot Relation Extraction
Shusen Wang, Bosen Zhang, Yajing Xu, Yanan Wu, Bo Xiao
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning
Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre
Learning Discriminative Representations for Open Relation Extraction with Instance Ranking and Label Calibration
Shusen Wang, Bin Duan, Yanan Wu, Yajing Xu
The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank
Daphne Ippolito, Liam Dugan, Emily Reif, Ann Yuan, Andy Coenen, Chris Callison-Burch
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training
Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Scott Yih, Sonal Gupta, Xilun Chen
Unsupervised Domain Adaptation for Question Generation with DomainData Selection and Self-training
Peide Zhu, Claudia Hauff
Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity
Kurt Shuster, Jack Urbanek, Arthur Szlam, Jason E Weston
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
Charlie Victor Snell, Mengjiao Yang, Justin Fu, Yi Su, Sergey Levine
Jointly Learning Guidance Induction and Faithful Summary Generation via Conditional Variational Autoencoders
Wang Xu, Tiejun Zhao
Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation
Zhijing Wu, Hua Xu, Jingliang Fang
Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback
Yunfan Hu, Zhaopeng Qiu, Xian Wu
Analytical Reasoning of Text
Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan
Weakly Supervised Text Classification using Supervision Signals from a Language Model
Ziqian Zeng, Weimin Ni, Tianqing Fang, Xiang Li, Xinran Zhao, Yangqiu Song
CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection
Zhen Li, Bing Xu, Conghui Zhu, Tiejun Zhao
LiST: Lite Prompted Self-training Makes Efficient Few-shot Learners
Yaqing Wang, Subhabrata Mukherjee, Xiaodong Liu, Jing Gao, Ahmed Hassan Awadallah, Jianfeng Gao
A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation
Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim
A Label-Aware Autoregressive Framework for Cross-Domain NER
Jinpeng Hu, He Zhao, Dan dan Guo, Xiang Wan, Tsung-Hui Chang
D2GCLF: Document-to-Graph Classifier for Legal Document Classification
Qiqi Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Ruofan Wang
Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning
Siyu Ren, Kenny Q. Zhu
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz, Gabriel Stanovsky
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching
Kunbo Ding, Weijie Liu, Yuejian Fang, Zhe Zhao, Qi Ju, Xuefeng Yang, Rong Tian, Zhu Tao, Haoyan Liu, Han Guo, Xingyu Bai, Weiquan Mao, Yudong Li, Weigang Guo, Taiqiang Wu, Ningyuan Sun
BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog
Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He
CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering
Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong
Towards Job-Transition-Tag Graph for a Better Job Title Representation Learning
Jun ZHU, CELINE HUDELOT
Semantic-Preserving Abstractive Text Summarization with Siamese Generative Adversarial Net
Xin Sheng, Linli Xu, Yinlong Xu, Deqiang Jiang, Bo Ren
Balancing Multi-Domain Corpora Learning for Open-Domain Response Generation
Yujie Xing, Jinglun Cai, Nils Barlaug, Peng Liu, Jon Atle Gulla
Learning Structural Information for Syntax-Controlled Paraphrase Generation
Erguang Yang, Chenglin Bai, Deyi Xiong, Yujie Zhang, Yao Meng, Jinan Xu, Yufeng Chen
Capturing Conversational Interaction for Question Answering via Global History Reasoning
Jin Qian, Bowei Zou, Mengxing Dong, Xiao Li, AiTi Aw, Yu Hong
Learning to Execute Actions or Ask Clarification Questions
Zhengxiang Shi, Yue Feng, Aldo Lipani
Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer
Haoran Xu, Kenton Murray
Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence
M.J Jang, Frank Martin Mtumbuka, Thomas Lukasiewicz
Challenges in Generalization in Open Domain Question Answering
Linqing Liu, Patrick Lewis, Sebastian Riedel, Pontus Stenetorp
NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue
Inigo Casanueva, Ivan Vulić, Georgios P. Spithourakis, Paweł Budzianowski
Uncertainty-Aware Cross-Lingual Transfer with Pseudo Partial Labels
Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Chang-Tien Lu
What kinds of errors do reference resolution models make and what can we learn from them?
Jorge Sánchez, Mauricio Mazuecos, Hernán Maina, Luciana Benotti
Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models
Joseph McDonald, Baolin Li, Nathan C. Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
Event Detection for Suicide Understanding
Luis Fernando Guzman-Nateras, Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
BehancePR: A Punctuation Restoration Dataset for Livestreaming Video Transcript
Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction
Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu
Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback
Duy-Hung Nguyen, Nguyen Viet Dung Nghiem, Bao-Sinh Nguyen, Tien Dung Le, Minh-Tien Nguyen, Shahab Sabahi, Hung Le
A Timestep aware Sentence Embedding and Acme Coverage for Brief but Informative Title Generation
Quanbin Wang, XieXiong Lin, Feng Wang
METGEN: A Module-based Entailment Tree Generation Framework for Answer Explanation
Ruixin Hong, Hongming Zhang, Xintong Yu, Changshui Zhang
Delving Deep into Regularity: A Simple but Effective Method for Chinese Named Entity Recognition
Yingjie Gu, Xiaoye Qu, Zhefeng Wang, Yi ZHENG, Baoxing Huai, Nicholas Jing Yuan
Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval
Jiaheng Liu, Tan Yu, Hanyu Peng, Mingming Sun, Ping Li
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
Kuan Xu, Yongbo Wang, Yongliang Wang, Zihao Wang, Zujie Wen, Yang Dong
HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea
Haneul Yoo, Jiho Jin, Juhee Son, JinYeong Bak, Kyunghyun Cho, Alice Oh
Learn from Relation Information: Towards Prototype Representation Rectification for Few-Shot Relation Extraction
Yang Liu, Jinpeng Hu, Xiang Wan, Tsung-Hui Chang
Exploiting Numerical-Contextual Knowledge to Improve Numerical Reasoning in Question Answering
Jeonghwan Kim, Junmo Kang, Kyung-min Kim, Giwon Hong, Sung-Hyon Myaeng
Exploring the Universal Vulnerability of Prompt-based Learning Paradigm
Lei Xu, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Zhiyuan Liu
Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base
Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou
Minimally-Supervised Relation Induction from Pre-trained Language Model
Lu Sun, Yongliang Shen, Weiming Lu
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?
Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi
Detecting Narrative Elements in Informational Text
Effi Levi, Guy Mor, Tamir Sheafer, Shaul Shenhav
Analyzing the Intensity of Complaints on Social Media
MING FANG, Shi Zong, Jing Li, Xinyu Dai, Shujian Huang, Jiajun Chen
$Great~Truths~are ~Always ~Simple:$ A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models
Jinhao Jiang, Kun Zhou, Ji-Rong Wen, Xin Zhao
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models
Tianlu Wang, Rohit Sridhar, Diyi Yang, Xuezhi Wang
Zero-shot Entity Linking with Less Data
G P Shrivatsa Bhargav, Dinesh Khandelwal, Saswati Dana, Dinesh Garg, Pavan Kapanipathi, Salim Roukos, Alexander Gray, L Venkata Subramaniam
A Dual-Channel Framework for Sarcasm Recognition by Detecting Sentiment Conflict
Yiyi Liu, Yequan Wang, Aixin Sun, Xuying Meng, Jing Li, Jiafeng Guo
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification
Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Inigo Casanueva, Paweł Budzianowski
Pruning Adatperfusion with Lottery Ticket Hypothesis
Jiarun Wu, Qingliang Chen, Zeguan Xiao, Yuliang Gu, Mengsi Sun
The Role of Context in Detecting Previously Fact-Checked Claims
Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov
Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction
Xiang Chen, Ningyu Zhang, Lei Li, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen
Dependency Position Encoding for Relation Extraction
Qiushi Guo, Xin Wang, Dehong Gao
DISARM: Detecting the Victims Targeted by Harmful Memes
Shivam Sharma, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
Hierarchical Transformers Are More Efficient Language Models
Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski
White-box Testing of NLP models with Mask Neuron Coverage
Arshdeep Sekhon, Yangfeng Ji, Matthew Dwyer, Yanjun Qi
UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering
Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih
Domain-matched Pre-training Tasks for Dense Retrieval
Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Scott Yih, Sonal Gupta, Yashar Mehdad
Efficient Few-Shot Fine-Tuning for Opinion Summarization
Arthur Brazinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer
Temporal Attention for Language Models
Guy D. Rosin, Kira Radinsky
MixQG: Neural Question Generation with Mixed Answer Types
Lidiya Murakhovs’ka, Chien-Sheng Wu, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong
BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation
Eleftheria Briakou, Sida Wang, Luke Zettlemoyer, Marjan Ghazvininejad
Exploring Neural Models for Query-Focused Summarization
Jesse Vig, Alexander Fabbri, Wojciech Maciej Kryscinski, Chien-Sheng Wu, Wenhao Liu
Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation
Junwei Yang, Zequn Liu, Ming Zhang, Sheng Wang
All Information is Valuable: Question Matching over Full Information Transmission Network
Le Qi, Yu Zhang, Qingyu Yin, Guidong Zheng, wen junjie, Jinlong Li, Ting Liu
Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions
Kosuke Nishida, Kyosuke Nishida, Shuichi Nishioka
Learn To Remember: Transformer with Recurrent Memory for Document-level Machine Translation
Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn
Unbiased Math Word Problems Benchmark for Mitigating Solving Bias
ZhiCheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang
RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation
Md Akmal Haidar, NITHIN ANCHURI, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart
Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention
Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di Jin, Dilek Hakkani-Tur
Low-resource Entity Set Expansion: A Comprehensive Study on User-generated Text
Yutong Shao, Nikita Bhutani, Sajjadur Rahman, Estevam Hruschka
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
Shujian Zhang, Chengyue Gong, Xingchao Liu, Pengcheng He, Weizhu Chen, Mingyuan Zhou
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee, Tahmid Hasan, Kazi Samin Mubasshir, Md Saiful Islam, Wasi Uddin Ahmad, Anindya Iqbal, M. Sohel Rahman, Rifat Shahriyar
To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning
Yunjie Ji, Liangyu Chen, Chenxiao Dou, Baochang Ma, Xiangang Li
Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency
Jin Liu, Chongfeng Fan, Fengyu Zhou, Huijuan Xu
A Survey on Stance Detection for Mis- and Disinformation Identification
Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems
Divya V Sharma, Arun Balaji Buduru
Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training
Yifan Gao, Qingyu Yin, zheng li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael Lyu
End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou
Towards Computationally Feasible Deep Active Learning
Akim Tsvigun, Artem Shelmanov, Gleb Kuzmin, Leonid Sanochkin, Daniil Larionov, Gleb Gennadjevich Gusev, Manvel Avetisian, Leonid Zhukov
DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks
Ziyang Luo, Yadong Xi, Jing Ma, Zhiwei Yang, Xiaoxi Mao, Changjie Fan, Rongsheng Zhang
Seeing the wood for the trees: a contrastive regularization method for the low-resource Knowledge Base Question Answering
Junping Liu, Shijie Mei, Xinrong Hu, Xun Yao, JACK Yang, Yi Guo
CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training
Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu
A Self-supervised Joint Training Framework for Document Reranking
Xiaozhi Zhu, Tianyong Hao, Sijie Cheng, Fu Lee Wang, Hai Liu
‘Diversity and Uncertainty in Moderation’’ are the Key to Data Selection for Multilingual Few-shot Transfer
Shanu Kumar, Sandipan Dandapat, Monojit Choudhury
Probing the Role of Positional Information in Vision-Language Models
Philipp J. Rösch, Jindřich Libovický
Masked Summarization to Generate Factually Inconsistent Summaries for Improved Factual Consistency Checking
Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung
Restoring Hebrew Diacritics Without a Dictionary
Elazar Gershuni, Yuval Pinter
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving
Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei QIN, Yunshi Lan, Jie Shao, Xiangliang Zhang
QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning
Zechen Li, Anders Søgaard
MM-Claims: A Dataset for Multimodal Claim Detection in Social Media
Gullal Singh Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth
SHARP: Search-Based Adversarial Attack for Structured Prediction
Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu
Self-Training with Differentiable Teacher
Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha
An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models
Victor Steinborn, Philipp Dufter, Haris Jabbar, Hinrich Schütze
Improving Contextual Representation with Gloss Regularized Pre-training
Yu Lin, Zhecheng An, Peihao Wu, Zejun MA
Learning Rich Representation of Keyphrases from Text
Mayank Kulkarni, Debanjan Mahata, Ravneet Singh Arora, Rajarshi Bhowmik
Efficient Learning of Multiple NLP Tasks via Collective Weight Factorization on BERT
Christos Charalampos Papadopoulos, Yannis Panagakis, Manolis Koubarakis, Mihalis Nicolaou
The Limits of Word Level Differential Privacy
Justus Mattern, Benjamin Weggenmann, Florian Kerschbaum
Attention Fusion: a light yet efficient late fusion mechanism for task adaptation in NLU
Jin Cao, Chandana Satya Prakash, Wael Hamza
Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems
Azlaan Mustafa Samad, Kshitij Mishra, Mauajama Firdaus, Asif Ekbal
Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment
Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver
Learning to Embed Multi-Modal Contexts for Situated Conversational Agents
Yunseon Choi, Oh Joon Kwon, Haeju Lee, Kee-Eung Kim, Jinhyeon Kim, Ran Han, Yoonhyung Kim, Youngjune Lee, Minho Park, Kangwook Lee, Haebin Shin
MultiNER: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition
Simone Tedeschi, Roberto Navigli
Permutation Invariant Strategy Using Transformer Encoders for Table Understanding
Sarthak Dash, Sugato Bagchi, Nandana Mihindukulasooriya, Alfio Gliozzo
A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis
Ehsan Hosseini-Asl, Wenhao Liu, Caiming Xiong
LM-CORE: Language Models with Contextually Relevant External Knowledge
Jivat Neet Kaur, Sumit Bhatia, Milan Aggarwal, Rachit Bansal, Balaji Krishnamurthy
Challenging America: Modeling language in longer time scales
Jakub Pokrywka, Filip Graliński, Krzysztof Jassem, Karol Kaczmarek, Krzysztof Jan Jurkiewicz, Piotr Wierzchon
LongT5: Efficient Text-To-Text Transformer for Long Sequences
Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang
A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning
Yang Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang
Data Augmentation for Low-Resource Dialogue Summarization
Joshua Maynez, Yongtai Liu, Shashi Narayan, Gonçalo Simões
Entity Cloze By Date: Understanding what LMs know about unseen entities
Yasumasa Onoe, Michael JQ Zhang, Eunsol Choi, Greg Durrett
LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework
Mengjie Zhao, Fei Mi, Yasheng Wang, Minglei Li, Xin Jiang, Qun Liu, Hinrich Schuetze
Opponent Modeling in Negotiation Dialogues by Related Data Adaptation
Kushal Chawla, Gale Lucas, Jonathan May, Jonathan Gratch
Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting
Jesin James, Vithya Yogarajan, Isabella Shields, Catherine Watson, Peter Keegan, Keoni Mahelona, Peter-Lucas Jones
CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality
Maria Nadejde, Anna Currey, Benjamin Hsu, Xing Niu, Georgiana Dinu, Marcello Federico
StATIK: Structure and Text for Inductive Knowledge Graph Completion
Elan Sopher Markowitz, Keshav Balasubramanian, Mehrnoosh Mirtaheri, Murali Annavaram, Aram Galstyan, Greg Ver Steeg
Instilling Type Knowledge in Language Models via Multi-Task QA
Shuyang Li, Mukund Sridhar, Chandana Satya Prakash, Jin Cao, Wael Hamza, Julian McAuley
Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis
Seth Kulick, Neville Ryant, Beatrice Santorini
Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System
Chang Tian, Wenpeng Yin, Marie-Francine Moens
On Measuring Social Biases in Prompt-Based Learning
Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Şerife Leman Runyun, Derry Wijaya
Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity
Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schuetze
Improving the Faithfulness of Abstractive Summarization via Entity Coverage Control
Haopeng Zhang, Semih Yavuz, Wojciech Maciej Kryscinski, Kazuma Hashimoto, Yingbo Zhou
Fine-grained Image Captioning with CLIP Reward
Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal
Harmless Transfer Learning for Item Embeddings
Chengyue Gong, xiaocong du, Dhruv Choudhary, Bhargav Bhushanam, qiang liu, Arun Kejariwal
A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations
Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco
Video-based Multimodal Intent Discovery
Adyasha Maharana, Quan Hung Tran, Seunghyun Yoon, Franck Dernoncourt, Trung Bui, Walter W Chang, Mohit Bansal
Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner
Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Henghui Zhu, Rui Dong, Xinchi Chen, Peng Xu, zhiheng huang, Andrew Arnold, Dan Roth
Improving Few-Shot Relation Classification by Prototypical Representation Learning with Definition Text
Li Zhenzhen, Yuyang Zhang, Jian-Yun Nie, Dongsheng Li
Literature-Augmented Clinical Outcome Prediction
Aakanksha Naik, Sravanthi Parasa, Sergey Feldman, Lucy Lu Wang, Tom Hope
DOCmT5: Document-Level Pre-training of Multilingual Language Models
Chia-Hsuan Lee, Aditya Siddhant, Viresh Ratnakar, Melvin Johnson
Few-Shot Self-Rationalization with Natural Language Prompts
Ana Marasovic, Iz Beltagy, Doug Downey, Matthew E Peters
From Cognitive to Computational Modeling: Text-based Risky Decision-Making Guided by Fuzzy Trace Theory
Jaron Mar, Jiamou Liu
TEAM: A multitask learning based Taxonomy Expansion approach for Attach and Merge
Bornali Phukon, Anasua Mitra, Ranbir Singh Sanasam, Priyankoo Sarmah
Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback
Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang
PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting
Zhen Zhang, Wei Zhu, Jinfan Zhang, Peng Wang, Rize Jin, Tae-Sun Chung
Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision
Yang Li, Guodong Long, Tao Shen, Jing Jiang
Exploring the Value of Multi-View Learning for Session-Aware Query Representation
Diego Ortiz, Jose G Moreno, Gilles Hubert, Karen Pinel-Sauvagnat, Lynda Tamine Tamine
A Framework to Generate High-quality Datapoints for Multiple Novel Intent Detection
Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly
Zero-shot Cross-lingual Conversational Semantic Role Labeling
Han Wu, Haochen Tan, Kun Xu, Shuqi LIU, Lianwei Wu, Linqi Song
PerKGQA: Question Answering over Personalized Knowledge Graphs
Ritam Dutt, Kasturi Bhattacharjee, Rashmi Gangadharaiah, Dan Roth, Carolyn Rose
FreeTransfer-X: Safe and Annotation-Free Cross-Lingual Transfer for Different Networks
Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu
Lacuna Reconstruction: Self-supervised Pre-training for Low-Resource Historical Document Transcription
Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick
SemAttack: Natural Textual Attacks via Different Semantic Spaces
Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li
Multi-Hop Open-Domain Question Answering over Structured andUnstructured Knowledge
Yue Feng, Zhen Han, Mingming Sun, Ping Li
How to Translate Your Samples and Choose Your Shots? Analyzing Translate-train & Few-shot Cross-lingual Transfer
Iman Jundi, Gabriella Lapesa
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral
Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks
Zhao Meng, Yihan Dong, Mrinmaya Sachan, Roger Wattenhofer
An Item Response Theory Framework for Persuasion
Anastassia Kornilova, Vladimir Eidelman, Daniel Argyle
LongChecker: Improving scientific claim verification by modeling full-abstract context
David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi
SEQZERO: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models
Jingfeng Yang, Haoming Jiang, Qingyu Yin, Danqing Zhang, Bing Yin, Diyi Yang
Improving Conversational Recommendation Systems’ Quality with Context-Aware Item Meta-Information
Bowen Yang, Cong Han, Yu Li, Lei Zuo, Zhou Yu
PromptGen: Automatically Generate Prompts using Generative Models
Yue Zhang, Hongliang Fei, Dingcheng Li, Ping Li
Masked Measurement Prediction: Learning to Jointly Predict Quantities and Units from Textual Context
Daniel Spokoyny, Ivan Lee, Zhao Jin, Taylor Berg-Kirkpatrick
PubHealthTab: A Public Health Table-based Dataset for Evidence-based Fact Checking
Mubashara Akhtar, Oana Cocarascu, Elena Simperl
One Size Does Not Fit All: The Case for Personalised Word Complexity Models
Sian Gooding, Manuel Tragut
Design Challenges for a Multi-Perspective Search Engine
Sihao Chen, Siyi Liu, Xander Uyttendaele, Yi Zhang, William W. Bruno, Dan Roth
Aligning Generative Language Models with Human Values
Ruibo Liu, Ge Zhang, Xinyu Feng, Soroush Vosoughi
Opportunities for Human-centered Evaluation of Machine Translation Systems
Daniel J. Liebling, Katherine A Heller, Samantha Robertson, Wesley Hanwen Deng
POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection
Yujian Liu, Xinliang Frederick Zhang, David Wegsman, Nicholas Beauchamp, Lu Wang
Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation
Prakhar Gupta, Harsh Jhamtani, Jeffrey Bigham
CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection
Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee
Industry Track
Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems
Mohammad Kachuee, Jinseok Nam, Sarthak Ahuja, Jin-Myung Won, SUNGJIN LEE
CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning
Penghui Wei, Xuanhua Yang, ShaoGuo Liu, Liang Wang, Bo Zheng
Augmenting Poetry Composition with Verse by Verse
David Uthus, Maria Voitovich, R.J. Mical
AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
Raphael Petegrosso, VasistaKrishna Baderdinnni, Thibaud Senechal, Benjamin Bullough
Temporal Generalization for Spoken Language Understanding
Judith Gaspers, Anoop Kumar, Greg Ver Steeg, Aram Galstyan
An End-to-End Dialogue Summarization System for Sales Calls
Abedelkadir Asi, Song Wang, Roy Eisenstadt, Dean Geckt, Yarin Kuper, Yi Mao, Royi Ronen
Controlled Data Generation via Insertion Operations for NLU
Manoj Kumar, Yuval Merhav, Haidar Khan, Rahul Gupta, Anna Rumshisky, Wael Hamza
Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model
Li GongZheng LGZ, Yadong Xi, Jingzhen Ding, Duan Wang, Ziyang Luo, Rongsheng Zhang, Bai Liu, Changjie Fan, Xiaoxi Mao, Zeng Zhao
Self-supervised Product Title Rewrite for Product Listing Ads
Xue Zhao, Dayiheng Liu, Junwei Ding, Liang Yao, Mahone Yan, Huibo wang, Wenqing Yao
Efficient Semi-supervised Consistency Training for Natural Language Understanding
George Leung, Joshua Tan
Distantly Supervised Aspect Clustering And Naming For E-Commerce Reviews
Prateek Sircar, Aniket Chakrabarti, DEEPAK GUPTA, Anirban Majumder
Local-to-global learning for iterative training of production SLU models on new features
Yulia Grishina, Daniil Sorokin
CULG: Commercial Universal Language Generation
Haonan Li, yameng huang, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin, Nan Duan
Constraining word alignments with posterior regularization for label transfer
Thomas Gueudre, Kevin Martin Jose
Explaining the Effectiveness of Multi-Task Learning for Efficient Knowledge Extraction from Spine MRI Reports
Arijit Sehanobish, McCullen Sandora, Nabila Abraham, Jayashri Pawar, Danielle Torres, Anasuya Das, Murray Becker, Richard Herzog, Benjamin Odry, Ron Vianu
FPI: Failure Point Isolation in Large-scale Conversational Assistants
Rinat Khaziev, Usman Shahid, Tobias Roeding, Rakesh Chada, Emir Kapanci, Pradeep Natarajan
Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks
Weiyi Lu, Sunny Rajagopalan, Priyanka Nigam, Jaspreet Singh, Xiaodi Sun, Yi Xu, Belinda Zeng, Trishul Chilimbi
Augmenting Training Data for Massive Semantic Matching Models in Low-Traffic E-commerce Stores
Ashutosh Joshi, Shankar Vishwanath, Choon Hui Teo, Vaclav Petricek, Vishy Vishwanathan, Rahul Bhagat, Jonathan May
Retrieval Based Response Letter Generation For a Customer Care Setting
Biplob Biswas, Renhao Cui, Rajiv Ramnath
Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning
Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell
Knowledge extraction from aeronautical messages (NOTAMs) with self-supervised language models for aircraft pilots
Alexandre Arnold, Fares Ernez, Catherine Kobus, Marion-Cécile Martin
Intent Discovery for Enterprise Virtual Assistants: Applications of Utterance Embedding and Clustering to Intent Mining
Minhua Chen, Badrinath Jayakumar, Michael Johnston, S. Eman Mahmoodi, Daniel Pressel
ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking
Tom Ayoola, Shubhi Tyagi, Joseph Fisher, Christos Christodoulopoulos, Andrea Pierleoni
Lightweight Transformers for Conversational AI
Daniel Pressel, Wenshuo Liu, Michael Johnston, Minhua Chen
NER-MQMRC: Formulating Named Entity Recognition as Multi Question Machine Reading Comprehension
Anubhav Shrimal, Avi Jain, Kartik Mehta, Promod Yenigalla
What Do Users Care About? Detecting Actionable Insights from User Feedback
Kasturi Bhattacharjee, Rashmi Gangadharaiah, Kathleen McKeown, Dan Roth
CTM - A Model for Large-Scale Multi-View Tweet Topic Classification
Vivek Kulkarni, Kenny Leung, Aria Haghighi
Developing a Production System for Purpose of Call Detection in Business Phone Conversations
Elena Khasanova, Pooja Hiranandani, Shayna Gardiner, Cheng Chen, Simon Corston-Oliver, Xue-Yong Fu
Adversarial Text Normalization
Joanna Bitton, Maya Pavlova, Ivan Evtimov
Fast Bilingual Grapheme-To-Phoneme Conversion
Hwa-Yeon Kim, Jong-Hwan Kim, Jae-Min Kim
Knowledge Extraction From Texts Based on Wikidata
Anastasia Shimorina, Johannes Heinecke, Frédéric Herledan
AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry
Yannis Katsis, Saneem Ahmed Chemmengath, vishwajeet kumar, Samarth Bharadwaj, MUSTAFA CANIM, Michael Glass, Alfio Gliozzo, Feifei Pan, Jaydeep Sen, Karthik Sankaranarayanan, Soumen Chakrabarti
Parameter-efficient Continual Learning Framework in Industrial Real-time Text Classification System
Tao Zhu, Zhe Zhao, Weijie Liu, Jiachi Liu, Yiren Chen, Weiquan Mao, Haoyan Liu, Kunbo Ding, Yudong Li, Xuefeng Yang, Kimmo Yan
Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI
Pragaash Ponnusamy, Clint Solomon Mathialagan, Gustavo Aguilar, Chengyuan Ma, Chenlei Guo
Fast and Light-Weight Answer Text Retrieval in Dialogue Systems
Hui Wan, Siva Sankalp Patel, J William Murdock, Saloni Potdar, Sachindra Joshi
BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations
Md Tahmid Rahman Laskar, Cheng Chen, Aliaksandr Martsinovich, Jonathan Johnston, Xue-Yong Fu, Shashi Bhushan Tn, Simon Corston-Oliver
Q2R: A Query-to-Resolution System for Natural-Language Queries
Shiau Hong Lim, Laura Wynter
Identifying Corporate Credit Risk Sentiments from Financial News
Noujoud Ahbali, Xinyuan Liu, Albert Aristotle Nanda, Jamie Stark, Ashit Talukder, Rupinder Paul Khandpur