Main Conference Schedule
On this page, you can choose the sessions (and individual papers/posters) of your choice and generate a PDF of your customized schedule. For the best experience, use a non-mobile device with a resolution of at least 1920x1080 and a full-screen browser. For help, simply type “?” while on the page or click on the “Help” button.
The overall schedule structure is final, but the assignment of papers to sessions and order of papers within sessions might still be modified to accommodate the final mode of presentation (virtual or in-person) chosen by the authors.
Regarding Virtual Poster Q&A Sessions: To foster discussion, virtual poster Q&A sessions will be organized into groups that bring together posters on similar themes.
All times are Pacific Daylight Time (GMT-7). Icons: = Session Chair; = Link to PDF on ACL Anthology; = Paper Award.
Jump to: Monday, July 11 Tuesday, July 12 Wednesday, July 13
- Click on the ”+” button or the title of a session to toggle it. Click the “Expand All Sessions ↓” button to expand all sessions in one go. Click again to collapse them.
- To expand parallel sessions simultaneously, Hold Shift and click on any of them.
- Hover over the time for any session to see its day and date as a tooltip.
- Click on a paper or poster to toggle its selection. You can select more than one paper for a time slot.
- Click the “Download PDF” button at the bottom to download your customized PDF.
time | location | info |
---|
: Saadia Gabriel | |
Learning to Transfer Prompts for Text Generation. Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Xin Zhao | |
Long-term Control for Dialogue Generation: Methods and Evaluation. Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q Weinberger, Ryan McDonald | |
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding. Antoine Chaffin, Vincent Claveau, Ewa Kijak | |
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators. Rilwan Akanni Adewoyin, Ritabrata Dutta, Yulan He | |
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning. Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen | |
TRUE: Re-evaluating Factual Consistency Evaluation. Or Honovich, Roee Aharoni, Jonathan Herzig, Hagai Taitelbaum, Doron Kukliansy, Vered Cohen, Thomas Scialom, Idan Szpektor, Avinatan Hassidim, Yossi Matias |
: Jackie Cheung | |
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization. Alexander Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona T. Diab | |
QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization. Alexander Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong | |
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics. Daniel Deutsch, Rotem Dror, Dan Roth | |
Massive-scale Decoding for Text Generation using Lattices. Jiacheng Xu, Siddhartha Jonnalagadda, Greg Durrett | |
FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations. Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal | |
CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning. Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Amit Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev |
: Alan Ritter | |
An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling. Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui | |
Modeling Multi-Granularity Hierarchical Features for Relation Extraction. Xinnian Liang, Shuangzhi Wu, Mu Li, Zhoujun Li | |
Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities. Benjamin Hsu, Graham Horwood | |
Cross-Lingual Event Detection via Optimized Adversarial Training. Luis Fernando Guzman-Nateras, Minh Van Nguyen, Thien Huu Nguyen | |
Learning to Borrow– Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion. Huda Hakami, Mona Hakami, Angrosh Mandya, Danushka Bollegala | |
[TACL] Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference. Bangzheng Li, Wenpeng Yin, Muhao Chen |
: Roy Schwarz | |
KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge Distillation. Marzieh S. Tahaei, Ella Charlaix, Vahid Partovi Nia, Ali Ghodsi, Mehdi Rezagholizadeh | |
Meta Learning for Natural Language Processing: A Survey. Hung-yi Lee, Shang-Wen Li, Thang Vu | |
On Transferability of Prompt Tuning for Natural Language Processing. Yusheng Su, Xiaozhi Wang, Yujia Qin, Chi-Min Chan, Yankai Lin, Huadong Wang, Kaiyue Wen, Zhiyuan Liu, Peng Li, Juanzi Li, Lei Hou, Maosong Sun, Jie Zhou | |
Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models. Qinyuan Ye, Madian Khabsa, Mike Lewis, Sinong Wang, Xiang Ren, Aaron Jaech | |
Sketching as a Tool for Understanding and Accelerating Self-attention for Long Sequences. Yifan Chen, Qi Zeng, Dilek Hakkani-Tur, Di Jin, Heng Ji, Yun Yang | |
FNet: Mixing Tokens with Fourier Transforms. James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon |
: Rashmi Gangadharaiah | |
LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking. Yifan Wang, Jing Zhao, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He | |
Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting. Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang | |
Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation. Shumpei Inoue, Tsungwei Liu, Son Hong Nguyen, Minh-Tien Nguyen | |
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog. Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš | |
[TACL] Reducing conversational agents' overconfidence through linguistic calibration. Mielke, Arthur Szlam, Emily Dinan, Y-Lan Boureau | |
Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering. Rajat Kumar, Mayur Patidar, VAIBHAV VARSHNEY, Lovekesh Vig, Gautam Shroff |
Computational Social Science and Cultural Analytics |
Political Ideology and Polarization: A Multi-dimensional Approach. Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li |
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection. Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein |
Combining Humor and Sarcasm for Improving Political Parody Detection. Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras |
Ethics, Bias, and Fairness |
Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models. Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt |
Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution. Connor Baumler, Rachel Rudinger |
Using Natural Sentence Prompts for Understanding Biases in Language Models. Sarah Alnegheimish, Alicia Guo, Yi Sun |
Human-Centered NLP |
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs. Xu Wang, Simin Fan, Jessica Houghton, Lu Wang |
Machine-in-the-Loop Rewriting for Creative Image Captioning. Vishakh Padmakumar, He He |
What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research. Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke |
Information Retrieval and Text Mining |
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds. Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han |
Learning Cross-Lingual IR from an English Retriever. Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil |
Collective Relevance Labeling for Passage Retrieval. Jihyuk Kim, Minsoo Kim, seung-won hwang |
Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation. Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin |
Interpretability and Analysis of Models for NLP |
Residue-Based Natural Language Adversarial Attack Detection. Vyas Raina, Mark Gales |
Locally Aggregated Feature Attribution on Natural Language Model Understanding. Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song |
Simple Local Attentions Remain Competitive for Long-Context Tasks. Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad |
Reframing Human-AI Collaboration for Generating Free-Text Explanations. Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi |
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language. Jacob Eisenstein |
On the Diversity and Limits of Human Explanations. Chenhao Tan |
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models. Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein |
[TACL] Explanation-Based Human Debugging of NLP Models: A Survey. Piyawat Lertvittayakumjorn, Francesca Toni |
Language Grounding to Vision, Robotics and Beyond |
Exposing the Limits of Video-Text Models through Contrast Sets. Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach |
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation. Zi-Yi Dou, Nanyun Peng |
MCSE: Multimodal Contrastive Learning of Sentence Embeddings. Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow |
Machine Translation |
Quality-Aware Decoding for Neural Machine Translation. Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins |
Cheat Codes to Quantify Missing Source Information in Neural Machine Translation. Proyag Pal, Kenneth Heafield |
Language Model Augmented Monotonic Attention for Simultaneous Translation. Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim |
Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations. Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan |
Training Mixed-Domain Translation Models via Federated Learning. Peyman Passban, Tanya Roosta, Rahul Gupta, Ankit Chadha, Clement Chung |
Multilinguality |
Towards Debiasing Translation Artifacts. KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith |
Pretrained Models for Multilingual Federated Learning. Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme |
[CL] Investigating Language Relationships in Multilingual Sentence Encoders through the Lens of Linguistic Typology. Rochelle Choenni, Ekaterina Shutova |
NLP Applications |
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents. Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar |
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations. Prashanth Vijayaraghavan, Soroush Vosoughi |
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony. Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya |
Cross-document Misinformation Detection based on Event Graph Reasoning. Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji |
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction. Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo |
: Sebastian Schuster | |
ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models. Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Xin Zhao, Ji-Rong Wen | |
When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes. Mycal Tucker, Tiwalayo Eisape, Peng Qian, Roger P. Levy, Julie Shah | |
ExSum: From Local Explanations to Model Understanding. Yilun Zhou, Marco Tulio Ribeiro, Julie Shah | |
Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases. Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch | |
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora. Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew Arnold, Xiang Ren | |
Even the Simplest Baseline Needs Careful Re-investigation: A Case Study on XML-CNN. Si-An Chen, JIE-JYUN LIU, Tsung-Han Yang, Hsuan-Tien Lin, Chih-Jen Lin |
: Jacob Eisenstein | |
Testing the Ability of Language Models to Interpret Figurative Language. Emmy Liu, Chenxuan Cui, Kenneth Zheng, Graham Neubig | |
Compositional Task-Oriented Parsing as Abstractive Question Answering. Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie | |
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling. Jakob Prange, Nathan Schneider, Lingpeng Kong | |
Improving Compositional Generalization with Latent Structure and Data Augmentation. Linlu Qiu, Peter Shaw, Panupong Pasupat, Pawel Krzysztof Nowak, Tal Linzen, Fei Sha, Kristina Toutanova | |
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass | |
Bilingual Tabular Inference: A Case Study on Indic Languages. Chaitanya Agarwal, Vivek Gupta, Anoop Kunchukuttan, Manish Shrivastava |
: Waleed Ammar | |
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand. Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Daniel Morrison, Alexander Fabbri, Yejin Choi, Noah Smith | |
CORWA: A Citation-Oriented Related Work Annotation Dataset. Xiangci Li, Biswadip Mandal, Jessica Ouyang | |
Shedding New Light on the Language of the Dark Web. Youngjin Jin, Eugene Jang, Yongjae Lee, Seungwon Shin, Jin-Woo Chung | |
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation. David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles HACHEME, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi, Yvonne Wambui Gitau, Jade Abbott, Mohamed Ahmed, Millicent Ochieng, Anuoluwapo Aremu, Perez Ogayo, Jonathan Mukiibi, Fatoumata Ouoba Kabore, Godson Koffi KALIPE, Derguene Mbaye, Allahsera Auguste Tapo, Victoire Memdjokam Koagne, Edwin Munkoh-Buabeng, Valencia Wagner, Idris Abdulmumin, Ayodele Awokoya, Happy Buzaaba, Blessing Sibanda, Andiswa Bukula, Sam Manthalu | |
Does Summary Evaluation Survive Translation to Other Languages?. Spencer Braun, Oleg Vasilyev, Neslihan Iskender, John Bohannon | |
DISAPERE: A Dataset for Discourse Structure in Peer Review Discussions. Neha Nayak Kennard, Tim O'Gorman, Rajarshi Das, Akshay Sharma, Chhandak Bagchi, Matthew Clinton, Pranay Kumar Yelugam, Hamed Zamani, Andrew McCallum |
: Jonathan May | |
Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance. Jingwei Ni, Zhijing Jin, Markus Freitag, Mrinmaya Sachan, Bernhard Schölkopf | |
On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation. Kelly Marchisio, Markus Freitag, David Grangier | |
The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation. Tobias Domhan, Eva Hasler, Ke Tran, Sony Trenous, Bill Byrne, Felix Hieber | |
BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation. Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou | |
Non-Autoregressive Machine Translation: It's Not as Fast as it Seems. Jindřich Helcl, Barry Haddow, Alexandra Birch | |
[TACL] High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics. Markus Freitag, David Grangier, Qijun Tan, Bowen Liang |
: Preslav Nakov | |
ScAN: Suicide Attempt and Ideation Events Dataset. Bhanu Pratap Singh Rawat, Samuel Kovaly, Hong Yu, Wilfred Pigeon | |
DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks. LIN TIAN, Xiuzhen Zhang, Jey Han Lau | |
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset. Fengzhu ZENG, Wei Gao | |
Frustratingly Easy System Combination for Grammatical Error Correction. Muhammad Reza Qorib, Seung-Hoon Na, Hwee Tou Ng | |
On the Use of Bert for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation. Yongjie Wang, Chuan Wang, Ruobing Li, Hui Lin | |
KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media. Wenqian Zhang, Shangbin Feng, Zilong Chen, Zhenyu Lei, Jundong Li, Minnan Luo |
Dialogue and Interactive Systems |
Towards a Progression-Aware Autonomous Dialogue Agent. Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang, Deepanshu Dey, Jonas Braasch, Dakuo Wang |
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models. Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park |
Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances. Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee |
Database Search Results Disambiguation for Task-Oriented Dialog Systems. Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar |
Learning Dialogue Representations from Consecutive Utterances. Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang |
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation. Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao |
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?. Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy |
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue. Raghav Gupta, Harrison Lee, Jeffrey Zhao, Yuan Cao, Abhinav Rastogi, Yonghui Wu |
Generating Repetitions with Appropriate Repeated Words. Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura |
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction. Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu |
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances. Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang |
Discourse and Pragmatics |
Incorporating Centering Theory into Neural Coreference Resolution. Haixia Chai, Michael Strube |
Efficient Methods in NLP |
LEA: Meta Knowledge-Driven Self-Attentive Document Embedding for Few-Shot Text Classification. Seungki Hong, Tae Young Jang |
Information Extraction |
Event Schema Induction with Double Graph Autoencoders. Xiaomeng Jin, Manling Li, Heng Ji |
Unified Semantic Typing with Meaningful Label Inference. James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen |
Crossroads, Buildings and Neighborhoods: A Dataset for Fine-grained Location Recognition. Pei Chen, Haotian Xu, Cheng Zhang, Ruihong Huang |
CompactIE: Compact Facts in Open Information Extraction. Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish |
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction. Liyan Xu, Jinho D. Choi |
Language Generation |
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts. Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng |
Cross-Domain Detection of GPT-2-Generated Technical Text. Juan Diego Rodriguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan |
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning. Haiyan Yin, Dingcheng Li, Ping Li |
AmbiPun: Generating Humorous Puns with Ambiguous Context. Anirudh Mittal, Yufei Tian, Nanyun Peng |
Machine Learning for NLP: Classification and Structured Prediction Models |
Inducing and Using Alignments for Transition-based AMR Parsing. Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo |
Consistency Training with Virtual Adversarial Discrete Perturbation. Jungsoo Park, Gyuwan Kim, Jaewoo Kang |
Contrastive Learning for Prompt-based Few-shot Language Learners. Yiren Jian, Chongyang Gao, Soroush Vosoughi |
Embedding Hallucination for Few-shot Language Fine-tuning. Yiren Jian, Chongyang Gao, Soroush Vosoughi |
Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning. Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis |
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models |
Efficient Hierarchical Domain Adaptation for Pretrained Language Models. Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge |
Learning Natural Language Generation with Truncated Reinforcement Learning. Alice Martin, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin |
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model. Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung |
Learning to Generate Examples for Semantic Processing Tasks. Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili |
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining. Machel Reid, Mikel Artetxe |
On Curriculum Learning for Commonsense Reasoning. Adyasha Maharana, Mohit Bansal |
Summarization |
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness. Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai |
TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation. Sajad Sotudeh, Nazli Goharian |
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling. Forrest Sheng Bao, Ge Luo, Hebi Li, Minghui Qiu, Yinfei Yang, Youbiao He, Cen Chen |
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries. Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev |
Syntax: Tagging, Chunking, and Parsing |
Sort by Structure: Language Model Ranking as Dependency Probing. Max Müller-Eberstein, Rob van der Goot, Barbara Plank |
[CL] The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance. Mark Anderson, Carlos Gómez-Rodríguez |
User-Driven Research of Medical Note Generation Software. Tom Knoll, Francesco Moramarco, Alex Papadopoulos Korfiatis, Rachel Young, Claudia Ruffini, Mark Perera, Christian Perstl, Ehud Reiter, Anya Belz, Aleksandar Savkov | |
Automatic Correction of Human Translations. Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero | |
NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics. Ximing Lu, Sean Welleck, Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah Smith, Yejin Choi |
: Nanyun Peng | |
Low Resource Style Transfer via Domain Adaptive Meta Learning. Xiangyang Li, Xiang Long, Yu Xia, Sujian Li | |
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation. Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu | |
Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer. Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar | |
MOVER: Mask, Over-generate and Rank for Hyperbole Generation. Yunxiang Zhang, Xiaojun Wan |
: Eduardo Blanco | |
[TACL] Fact Checking with Insufficient Evidence. Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein | |
A Double-Graph Based Framework for Frame Semantic Parsing. Ce Zheng, Xudong Chen, Runxin Xu, Baobao Chang | |
Identifying Implicitly Abusive Remarks about Identity Groups using a Linguistically Informed Approach. Michael Wiegand, Elisabeth Eder, Josef Ruppenhofer | |
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media. Lixing Zhu, Zheng Fang, Gabriele Pergola, Robert Procter, Yulan He |
: Yogarshi Vyas | |
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction. Yue Zhang, Zhenghua Li, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang | |
A Corpus for Understanding and Generating Moral Stories. Jian Guan, Ziqi Liu, Minlie Huang | |
End-to-End Chinese Speaker Identification. Dian Yu, Ben Zhou, Dong Yu | |
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding. Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah |
: Emma Strubell | |
Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training. Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou | |
Knowledge Inheritance for Pre-trained Language Models. Yujia Qin, Yankai Lin, Jing Yi, Jiajie Zhang, Xu Han, Zhengyan Zhang, Yusheng Su, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou | |
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline. Xiangyang Liu, Tianxiang Sun, JunLiang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu | |
Adaptable Adapters. Nafise Sadat Moosavi, Quentin Delfosse, Kristian Kersting, Iryna Gurevych |
: Danqi Chen | |
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking. Xuming Hu, Zhijiang Guo, GuanYu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu | |
Progressive Class Semantic Matching for Semi-supervised Text Classification. Haiming Xu, Lingqiao Liu, Ehsan M Abbasnejad | |
Unsupervised Paraphrasability Prediction for Compound Nominalizations. John Sie Yuen Lee, Ho Hung Lim, Carol Webster | |
Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables. Nan Hu, Zirui Wu, Yuxuan Lai, Xiao Liu, Yansong Feng |
: Kasturi Bhattacharjee | |
CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning. Penghui Wei, Xuanhua Yang, ShaoGuo Liu, Liang Wang, Bo Zheng | |
Augmenting Poetry Composition with Verse by Verse. David Uthus, Maria Voitovich, R.J. Mical | |
FPI: Failure Point Isolation in Large-scale Conversational Assistants. Rinat Khaziev, Usman Shahid, Tobias Roeding, Rakesh Chada, Emir Kapanci, Pradeep Natarajan | |
ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking. Tom Ayoola, Shubhi Tyagi, Joseph Fisher, Christos Christodoulopoulos, Andrea Pierleoni |
Dialogue and Interactive Systems |
[SRW] Explicit Use of Topicality in Dialogue Response Generation. Takumi Yoshikoshi, Hayato Atarashi, Takashi Kodama, Sadao Kurohashi |
[SRW] Automating Human Evaluation of Dialogue Systems. Sujan Reddy A |
Generating Repetitions with Appropriate Repeated Words. Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura |
EmpHi: Generating Empathetic Responses with Human-like Intents. MAO YAN CHEN, Siheng Li, Yujiu Yang |
Towards a Progression-Aware Autonomous Dialogue Agent. Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang, Deepanshu Dey, Jonas Braasch, Dakuo Wang |
Representation Learning for Conversational Data using Discourse Mutual Information Maximization. Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal |
D2U: Distance-to-Uniform Learning for Out-of-Scope Detection. Eyup Halit Yilmaz, Cagri Toraman |
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models. Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park |
Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold. Yanan Wu, Keqing He, Yuanmeng Yan, QiXiang Gao, Zhiyuan Zeng, Fujia Zheng, Lulu Zhao, Huixing Jiang, Wei Wu, Weiran Xu |
AISFG: Abundant Information Slot Filling Generator. Yang Yan, Junda Ye, Zhongbao Zhang, Liwen Wang |
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting. Shuzheng Si, Shuang Zeng, Baobao Chang |
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances. Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang |
[Findings] Learning to Execute Actions or Ask Clarification Questions. Zhengxiang Shi, Yue Feng, Aldo Lipani |
[Findings] BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog. Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He |
[Findings] Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity. Kurt Shuster, Jack Urbanek, Arthur Szlam, Jason E Weston |
[Findings] DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation. Md Rashad Al Hasan Rony, Ricardo Usbeck, Jens Lehmann |
[Findings] Zero-shot Cross-lingual Conversational Semantic Role Labeling. Han Wu, Haochen Tan, Kun Xu, Shuqi LIU, Lianwei Wu, Linqi Song |
[Findings] A Framework to Generate High-Quality Datapoints for Multiple Novel Intent Detection. Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly |
[Findings] Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System. Chang Tian, Wenpeng Yin, Marie-Francine Moens |
[Findings] Prompt Augmented Generative Replay via Supervised Contrastive Learning for Lifelong Intent Detection. VAIBHAV VARSHNEY, Mayur Patidar, Rajat Kumar, Lovekesh Vig, Gautam Shroff |
[Findings] NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue. Inigo Casanueva, Ivan Vulić, Georgios P. Spithourakis, Paweł Budzianowski |
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction. Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu |
[Findings] Improving Conversational Recommendation Systems’ Quality with Context-Aware Item Meta-Information. Bowen Yang, Cong Han, Yu Li, Lei Zuo, Zhou Yu |
Efficient Methods in NLP |
LEA: Meta Knowledge-Driven Self-Attentive Document Embedding for Few-Shot Text Classification. Seungki Hong, Tae Young Jang |
Information Extraction |
Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction. Jiaxin Yu, Deqing Yang, Shuyu Tian |
Hero-Gang Neural Model For Named Entity Recognition. Jinpeng Hu, Yaling Shen, Yang Liu, Xiang Wan, Tsung-Hui Chang |
Modal Dependency Parsing via Language Model Priming. Jiarui Yao, Nianwen Xue, Bonan Min |
Document-Level Event Argument Extraction by Leveraging Redundant Information and Closed Boundary Loss. Hanzhang Zhou, Kezhi Mao |
Global Entity Disambiguation with BERT. Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto |
Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods. Juhyuk Lee, Min-Joong Lee, June Yong Yang, Eunho Yang |
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning. Hongyi Yuan, Zheng Yuan, Sheng Yu |
RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction. Yuan Liang, Zhuoxuan Jiang, di yin, Bo Ren |
[Findings] Improving Few-Shot Relation Classification by Prototypical Representation Learning with Definition Text. Li Zhenzhen, Yuyang Zhang, Jian-Yun Nie, Dongsheng Li |
[Findings] Dependency Position Encoding for Relation Extraction. Qiushi Guo, Xin Wang, Dehong Gao |
[Findings] XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction. Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu |
[Findings] A Label-Aware Autoregressive Framework for Cross-Domain NER. Jinpeng Hu, He Zhao, Dan dan Guo, Xiang Wan, Tsung-Hui Chang |
[Findings] Learning Discriminative Representations for Open Relation Extraction with Instance Ranking and Label Calibration. Shusen Wang, Bin Duan, Yanan Wu, Yajing Xu |
[Findings] RCL: Relation Contrastive Learning for Zero-Shot Relation Extraction. Shusen Wang, Bosen Zhang, Yajing Xu, Yanan Wu, Bo Xiao |
[Findings] Zero-Shot Event Detection Based on Ordered Contrastive Learning and Prompt-Based Prediction. Senhui Zhang, Tao Ji, Wendi Ji, Xiaoling Wang |
[Findings] Minimally-Supervised Relation Induction from Pre-trained Language Model. Lu Sun, Yongliang Shen, Weiming Lu |
[Findings] Learn from Relation Information: Towards Prototype Representation Rectification for Few-Shot Relation Extraction. Yang Liu, Jinpeng Hu, Xiang Wan, Tsung-Hui Chang |
[Findings] Delving Deep into Regularity: A Simple but Effective Method for Chinese Named Entity Recognition. Yingjie Gu, Xiaoye Qu, Zhefeng Wang, Yi ZHENG, Baoxing Huai, Nicholas Jing Yuan |
Event Schema Induction with Double Graph Autoencoders. Xiaomeng Jin, Manling Li, Heng Ji |
Information Retrieval and Text Mining |
SKILL: Structured Knowledge Infusion for Large Language Models. Fedor Moiseev, Zhe Dong, Enrique Alfonseca, Martin Jaggi |
Collective Relevance Labeling for Passage Retrieval. Jihyuk Kim, Minsoo Kim, seung-won hwang |
[Findings] Domain-matched Pre-training Tasks for Dense Retrieval. Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Scott Yih, Sonal Gupta, Yashar Mehdad |
[Findings] CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering. Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong |
[Findings] Weakly Supervised Text Classification using Supervision Signals from a Language Model. Ziqian Zeng, Weimin Ni, Tianqing Fang, Xiang Li, Xinran Zhao, Yangqiu Song |
Learning Cross-Lingual IR from an English Retriever. Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil |
Interpretability and Analysis of Models for NLP |
[SRW] Probe-Less Probing of BERT's Layer-Wise Linguistic Knowledge with Masked Word Prediction. Tatsuya Aoyama, Nathan Schneider |
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models. Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein |
How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns. Stephanie Brandl, Ruixiang Cui, Anders Søgaard |
Residue-Based Natural Language Adversarial Attack Detection. Vyas Raina, Mark Gales |
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens. Itay Itzhak, Omer Levy |
[Findings] Phrase-level Textual Adversarial Attack with Label Preservation. Yibin Lei, Yu Cao, Dianqi Li, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy |
[Findings] Specializing Pre-trained Language Models for Better Relational Reasoning via Network Pruning. Siyu Ren, Kenny Q. Zhu |
Language Generation |
AmbiPun: Generating Humorous Puns with Ambiguous Context. Anirudh Mittal, Yufei Tian, Nanyun Peng |
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning. Haiyan Yin, Dingcheng Li, Ping Li |
Linguistic Theories, Cognitive Modeling and Psycholinguistics |
Abstraction not Memory: BERT and the English Article System. Harish Tayyar Madabushi, Dagmar Divjak, Petar Milin |
Machine Learning for NLP: Classification and Structured Prediction Models |
Inducing and Using Alignments for Transition-based AMR Parsing. Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo |
Label Anchored Contrastive Learning for Language Understanding. Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He |
Improving Constituent Representation with Hypertree Neural Networks. Hao Zhou, Gongshen Liu, Kewei Tu |
Generic and Trend-aware Curriculum Learning for Relation Extraction. Nidhi Vakil, Hadi Amiri |
On the Effectiveness of Sentence Encoding for Intent Detection Meta-Learning. Tingting Ma, Qianhui Wu, Zhiwei Yu, Tiejun Zhao, Chin-Yew Lin |
A Data Cartography based MixUp for Pre-trained Language Models. Seo Yeon Park, Cornelia Caragea |
Embedding Hallucination for Few-shot Language Fine-tuning. Yiren Jian, Chongyang Gao, Soroush Vosoughi |
Contrastive Learning for Prompt-based Few-shot Language Learners. Yiren Jian, Chongyang Gao, Soroush Vosoughi |
Consistency Training with Virtual Adversarial Discrete Perturbation. Jungsoo Park, Gyuwan Kim, Jaewoo Kang |
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference. Emīls Kadiķis, Vaibhav Srivastav, Roman Klinger |
Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning. Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis |
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models |
[SRW] Regularized Training of Nearest Neighbor Language Models. Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Joshua M. Susskind |
On Curriculum Learning for Commonsense Reasoning. Adyasha Maharana, Mohit Bansal |
Efficient Hierarchical Domain Adaptation for Pretrained Language Models. Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge |
Improving In-Context Few-Shot Learning via Self-Supervised Training. Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva |
Learning to Generate Examples for Semantic Processing Tasks. Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili |
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model. Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung |
[Findings] A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis. Ehsan Hosseini-Asl, Wenhao Liu, Caiming Xiong |
[Findings] RGL: A Simple yet Effective Relation Graph Augmented Prompt-based Tuning Approach for Few-Shot Learning. Yaqing Wang, Xin Tian, Haoyi Xiong, Yueyang Li, Zeyu Chen, Sheng Guo, Dejing Dou |
[Findings] Speeding Up Entmax. Maxat Tezekbayev, Vassilina Nikoulina, Matthias Gallé, Zhenisbek Assylbekov |
[Findings] MixQG: Neural Question Generation with Mixed Answer Types. Lidiya Murakhovs'ka, Chien-Sheng Wu, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong |
[Findings] SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising. Kuan Xu, Yongbo Wang, Yongliang Wang, Zihao Wang, Zujie Wen, Yang Dong |
[Findings] DecBERT: Enhancing the Language Understanding of BERT with Causal Attention Masks. Ziyang Luo, Yadong Xi, Jing Ma, Zhiwei Yang, Xiaoxi Mao, Changjie Fan, Rongsheng Zhang |
Learning Natural Language Generation with Truncated Reinforcement Learning. Alice Martin, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin |
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining. Machel Reid, Mikel Artetxe |
Machine Translation |
[SRW] Analysing the Correlation between Lexical Ambiguity and Translation Quality in a Multimodal Setting using WordNet. Ali Hatami, Paul Buitelaar, Mihael Arcan |
Language Model Augmented Monotonic Attention for Simultaneous Translation. Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim |
On Synthetic Data for Back Translation. Jiahao Xu, Yubin Ruan, Wei Bi, Guoping Huang, Shuming Shi, Lihui Chen, Lemao Liu |
Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework. Minghao Zhu, Junli Wang, Chungang Yan |
Cheat Codes to Quantify Missing Source Information in Neural Machine Translation. Proyag Pal, Kenneth Heafield |
Training Mixed-Domain Translation Models via Federated Learning. Peyman Passban, Tanya Roosta, Rahul Gupta, Ankit Chadha, Clement Chung |
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation. Pengzhi Gao, Zhongjun He, Hua Wu, Haifeng Wang |
[Findings] Latent Group Dropout for Multilingual and Multidomain Machine Translation. Minh-Quang PHAM, François Yvon, Josep Crego |
[Findings] Bridging the Gap between Training and Inference: Multi-Candidate Optimization for Diverse Neural Machine Translation. Huan Lin, Baosong Yang, Liang Yao, Dayiheng Liu, Haibo Zhang, jun xie, Min Zhang, Jinsong Su |
Semantics: Sentence-level Semantics and Textual Inference |
Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks. Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov |
Syntax: Tagging, Chunking, and Parsing |
Sort by Structure: Language Model Ranking as Dependency Probing. Max Müller-Eberstein, Rob van der Goot, Barbara Plank |
: Siva Reddy | |
Measure and Improve Robustness in NLP Models: A Survey. Xuezhi Wang, Haohan Wang, Diyi Yang | |
Using Paraphrases to Study Properties of Contextual Embeddings. Laura Burdick, Jonathan K Kummerfeld, Rada Mihalcea | |
Can Rationalization Improve Robustness?. Howard Chen, Jacqueline He, Karthik R Narasimhan, Danqi Chen | |
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection. Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko | |
What do tokens know about their characters and how do they know it?. Ayush Kaushal, Kyle Mahowald | |
Do Prompt-Based Models Really Understand the Meaning of Their Prompts?. Albert Webson, Ellie Pavlick |
: Greg Durrett | |
NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias. Nayeon Lee, Yejin Bang, Tiezheng YU, Andrea Madotto, Pascale Fung | |
Joint Learning-based Heterogeneous Graph Attention Network for Timeline Summarization. Jingyi You, Dongyuan Li, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura | |
Interactive Query-Assisted Summarization via Deep Reinforcement Learning. Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Ido Dagan, Yael Amsterdamer | |
FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization. David Wan, Mohit Bansal | |
Proposition-Level Clustering for Multi-Document Summarization. Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan | |
[TACL] A Multi-Level Optimization Framework for End-to-End Text Augmentation. Sai Ashish Somayajula, Linfeng Song, Pengtao Xie |
: Pedro Rodriguez | |
CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data. Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang | |
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval. Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych | |
Boosted Dense Retriever. Patrick Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Scott Yih, Sebastian Riedel | |
A Dataset for N-ary Relation Extraction of Drug Combinations. Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg | |
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia | |
Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity. Sheshera Mysore, Arman Cohan, Tom Hope |
: Parisa Kordjamshidi | |
KAT: A Knowledge Augmented Transformer for Vision-and-Language. Liangke Gui, Borui Wang, Qiuyuan Huang, Alexander G Hauptmann, Yonatan Bisk, Jianfeng Gao | |
Do Trajectories Encode Verb Meaning?. Dylan Ebert, Chen Sun, Ellie Pavlick | |
Diagnosing Vision-and-Language Navigation: What Really Matters. Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang | |
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer. Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi | |
Guiding Visual Question Generation. Nihir Vedd, Zixu Wang, Marek Rei, yishu miao, Lucia Specia | |
Interactive Symbol Grounding with Complex Referential Expressions. Rimvydas Rubavicius, Alex Lascarides |
: Joao Sedoc | |
Learning to Express in Knowledge-Grounded Conversation. Xueliang Zhao, Tingchen Fu, Chongyang Tao, Wei Wu, Dongyan Zhao, Rui Yan | |
Partner Personas Generation for Dialogue Response Generation. Hongyuan Lu, Wai Lam, Hong Cheng, Helen M. Meng | |
Robust Conversational Agents against Imperceptible Toxicity Triggers. Ninareh Mehrabi, Ahmad Beirami, Fred Morstatter, Aram Galstyan | |
VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems. Hung Le, Nancy F. Chen, Steven HOI | |
Multimodal Dialogue State Tracking. Hung Le, Nancy F. Chen, Steven HOI | |
Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation. Deeksha varshney, Akshara Prabhakar, Asif Ekbal |
Language Resources and Evaluation |
SkillSpan: Hard and Soft Skill Extraction from English Job Postings. Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank |
Transparent Human Evaluation for Image Captioning. Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith |
TVShowGuess: Character Comprehension in Stories as Speaker Guessing. Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton |
CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course. Changyoon Lee, Yeon Seonwoo, Alice Oh |
Semantic Diversity in Dialogue with Natural Language Inference. Katherine Stasaski, Marti Hearst |
Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations. Daniela Brook Weiss, Paul Roit, Ori Ernst, Ido Dagan |
ChapterBreak: A Challenge Dataset for Long-Range Language Models. Simeng Sun, Katherine Thai, Mohit Iyyer |
The USMLE® Step 2 Clinical Skills Patient Note Corpus. Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber |
[TACL] Czech Grammar Error Correction with a Large and Diverse Corpus. Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen |
Linguistic Theories, Cognitive Modeling and Psycholinguistics |
A Computational Acquisition Model for Multimodal Word Categorization. Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann |
[TACL] He Thinks He Knows Better than the Doctors: BERT for Event Factuality Fails on Pragmatics. Nanjiang Jiang, Marie-Catherine de Marneffe |
Question Answering |
ProQA: Structural Prompt-based Pre-training for Unified Question Answering. Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan |
DREAM: Improving Situational QA by First Elaborating the Situation. Yuling Gu, Bhavana Dalvi, Peter Clark |
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning. Yu Wang, Vijay Srinivasan, Hongxia Jin |
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions. Elior Sulem, Jamaal Hay, Dan Roth |
Long Context Question Answering via Supervised Contrastive Learning. Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan |
[TACL] Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study. Xiangyang Mou, Chenghao Yang, Mo Yu, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su |
Semantics: Sentence-level Semantics and Textual Inference |
DocAMR: Multi-Sentence AMR Representation and Evaluation. Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider |
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer. Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy |
Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks. Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov |
Label Definitions Improve Semantic Role Labeling. Li Zhang, Ishan Jindal, Yunyao Li |
Few-Shot Semantic Parsing with Language Models Trained on Code. Richard Shin, Benjamin Van Durme |
Partial-input baselines show that NLI models can ignore context, but they don't.. Neha Srikanth, Rachel Rudinger |
Improving negation detection with negation-focused pre-training. Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor |
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti |
SUBS: Subtree Substitution for Compositional Semantic Parsing. Jingfeng Yang, Le Zhang, Diyi Yang |
Sentiment Analysis and Stylistic Analysis |
Multi-Domain Targeted Sentiment Analysis. Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim |
UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis. Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis |
Data Augmentation with Dual Training for Offensive Span Detection. Nasim Nouri |
Analyzing Modality Robustness in Multimodal Sentiment Analysis. Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria |
Speech |
Quantifying Language Variation Acoustically with Few Resources. Martijn Bartelds, Martijn Wieling |
: Kai-Wei Chang | |
What Factors Should Paper-Reviewer Assignments Rely On? Community Perspectives on Issues and Ideals in Conference Peer-Review. Terne Sasha Thorn Jakobsen, Anna Rogers | |
Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models. Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou | |
Benchmarking Intersectional Biases in NLP. John P. Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi | |
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection. Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi, Noah Smith | |
Features or Spurious Artifacts? Data-centric Baselines for Fair and Robust Hate Speech Detection. Alan Ramponi, Sara Tonelli | |
Gender Bias in Masked Language Models for Multiple Languages. Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki |
: Narges Tabari | |
Putting the Con in Context: Identifying Deceptive Actors in the Game of Mafia. Samee Omotayo Ibraheem, Gaoyue Zhou, John DeNero | |
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation. Joosung Lee, Wooin Lee | |
COGMEN: COntextualized GNN based Multimodal Emotion recognitioN. Abhinav Joshi, Ashwani Bhat, Ayush Jain, Atin Vikram Singh, Ashutosh Modi | |
Domain Confused Contrastive Learning for Unsupervised Domain Adaptation. Quanyu Long, Tianze Luo, Wenya Wang, Sinno Pan | |
Text Style Transfer via Optimal Transport. Nasim Nouri | |
SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis. Zheng Zhang, Zili Zhou, Yanna Wang |
: Luca Soldaini | |
DEGREE: A Data-Efficient Generation-Based Event Extraction Model. I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng | |
[TACL] Text-based NP Enrichment. Yanai Elazar, Victoria Basmov, Yoav Goldberg | |
Few-Shot Document-Level Relation Extraction. Nicholas Popovic, Michael Färber | |
Joint Extraction of Entities, Relations, and Events via Modeling Inter-Instance and Inter-Label Dependencies. Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Huu Nguyen | |
Hyperbolic Relevance Matching for Neural Keyphrase Extraction. Mingyang Song, Yi Feng, Liping Jing | |
A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction. Runxin Xu, Peiyi Wang, Tianyu Liu, Shuang Zeng, Baobao Chang, Zhifang Sui |
: Marti Hearst | |
Mapping the Design Space of Human-AI Interaction in Text Summarization. Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes | |
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications. Kaitlyn Zhou, Su Lin Blodgett, Adam Trischler, Hal Daumé III, Kaheer Suleman, Alexandra Olteanu | |
User-Centric Gender Rewriting. Bashar Alhafni, Nizar Habash, Houda Bouamor | |
Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data. Jamar L. Sullivan Jr., Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, Blase Ur | |
An Exploration of Post-Editing Effectiveness in Text Summarization. Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel R. Tetreault, Alejandro Jaimes | |
The Why and The How: A Survey on Natural Language Interaction in Visualization. Henrik Voigt, Ozge Alacam, Monique Meuschke, Kai Lawonn, Sina Zarrieß |
: Jill Burstein | |
Cryptocurrency Bubble Detection: A New Stock Market Dataset, Financial Task & Hyperbolic Models. Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava | |
Many Hands Make Light Work: Using Essay Traits to Automatically Score Essays. Rahul Kumar, Sandeep Mathias, Sriparna Saha, Pushpak Bhattacharyya | |
Aligning to Social Norms and Values in Interactive Narratives. Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hannaneh Hajishirzi, Yejin Choi | |
LITE: Intent-based Task Representation Learning Using Weak Supervision. Naoki Otani, Michael Gamon, Sujay Kumar Jauhar, Mei Yang, Sri Raghu Malireddi, Oriana Riva | |
Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts. Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert | |
Context-Aware Abbreviation Expansion Using Large Language Models. Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael Brenner |
: Sandesh Swamy | |
Self-supervised Product Title Rewrite for Product Listing Ads. Xue Zhao, Dayiheng Liu, Junwei Ding, Liang Yao, Mahone Yan, Huibo wang, Wenqing Yao | |
Local-to-global learning for iterative training of production SLU models on new features. Yulia Grishina, Daniil Sorokin | |
Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning. Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell | |
CTM - A Model for Large-Scale Multi-View Tweet Topic Classification. Vivek Kulkarni, Kenny Leung, Aria Haghighi | |
Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI. Pragaash Ponnusamy, Clint Solomon Mathialagan, Gustavo Aguilar, Chengyuan Ma, Chenlei Guo | |
Aspect-based Analysis of Advertising Appeals for Search Engine Advertising. Soichiro Murakami, Peinan Zhang, Sho Hoshino, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura |
[SRW] Systematicity Emerges in Transformers when Abstract Grammatical Roles Guide Attention. Ayush K Chakravarthy, Jacob Labe Russin, Randall O'Reilly |
[SRW] Grounding in social media: An approach to building a chit-chat dialogue model. Ritvik Choudhary, Daisuke Kawahara |
[SRW] ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki |
[SRW] Neural Retriever and Go Beyond: A Thesis Proposal. Man Luo |
[SRW] Improving Classification of Infrequent Cognitive Distortions: Domain-Specific Model vs. Data Augmentation. Xiruo Ding, Kevin Lybarger, Justin Tauscher, Trevor Cohen |
[SRW] Towards Gender Biased Language Classification: A Case Study with British English Archival Metadata Descriptions. Lucy Havens |
[SRW] What "Drives" the Use of Metaphorical Language? Negative Insights from Abstractness, Affect, Discourse Coherence and Contextualized Word Representations. Prisca Piccirilli, Sabine Schulte im Walde |
[SRW] Generate, Evaluate, and Select: A Dialogue System with a Response Evaluator for Diversity-Aware Response Generation. Ryoma Sakaeda, Daisuke Kawahara |
[SRW] Building a Personalized Dialogue System with Prompt-Tuning. Tomohito Kasahara, Daisuke Kawahara, Nguyen Tung, Shengzhe Li, Kenta Shinzato, Toshinori Sato |
[SRW] MM-GATBT: Enriching Multimodal Representation Using Graph Attention Network. Seung Byum Seo, Hyoungwook Nam, Payam Delgosha |
[SRW] ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation. Long Phan, Hieu Tran, Hieu Nguyen, Trieu H. Trinh |
[SRW] Compositional Generalization in Grounded Language Learning via Induced Model Sparsity. Sam Spilsbury, Alexander Ilin |
[SRW] How do people talk about images? A study on open-domain conversations with images.. Yi-Pei Chen, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama |
[SRW] Preschool Children Speech Recognition for Early Childhood Intervention: Motivation and Challenges. Satwik Dutta, Dwight W. Irvin, John H. L. Hansen |
[SRW] A Simple Approach to Jointly Rank Passages and Select Relevant Sentences in the OBQA Context. Man Luo, Shuguang Chen, Chitta Baral |
[SRW] Multimodal Modeling of Task-Mediated Confusion. Camille Mince, Skye Rhomberg, Cecilia Alm, Reynold Bailey, Alex Ororbia |
[SRW] Machine Narrative Comprehension in Fictional Characters Personality Prediction Task. Yisi Sang, Xiangyang Mou, Mo Yu, Dakuo Wang, Jing Li, Jeffrey Stanton |
[SRW] Divide & Conquer for Entailment-aware Multi-hop Evidence Retrieval. Fan Luo, Mihai Surdeanu |
[SRW] Multimodal large language models for inclusive collaboration learning tasks. Armanda Lewis |
[SRW] Neural Networks in a Product of Hyperbolic Spaces. Jun Takeuchi, Noriki Nishida, Hideki Nakayama |
[SRW] Strong Heuristics for Named Entity Linking. Marko Čuljak, Andreas Spitz, Robert West, Akhil Arora |
[SRW] Unifying Parsing and Tree-Structured Models for Generating Sentence Semantic Representations. Antoine Simoulin, Benoit Crabbé |
[SRW] Defending Compositionality in Emergent Languages. Michal Auersperger, Pavel Pecina |
[SRW] Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition. Aditya Yadavalli, Ganesh Sai Mirishkar, Anil Vuppala |
: Jesse Thomason | |
All You May Need for VQA are Image Captions. Soravit Changpinyo, Doron Kukliansy, Idan Szpektor, Xi Chen, Nan Ding, Radu Soricut | |
Imagination-Augmented Natural Language Understanding. Yujie Lu, Wanrong Zhu, Xin Eric Wang, Miguel Eckstein, William Yang Wang | |
Visual Commonsense in Pretrained Unimodal and Multimodal Models. Chenyu Zhang, Benjamin Van Durme, Zhuowan Li, Elias Stengel-Eskin | |
Few-shot Subgoal Planning with Language Models. Lajanugen Logeswaran, Yao Fu, Moontae Lee, Honglak Lee | |
Disentangling Categorization in Multi-agent Emergent Communication. Washington Garcia, Hamilton Scott Clouse, Kevin R. B. Butler | |
CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination. Hyounghun Kim, Abhay Zala, Mohit Bansal |
: Miguel Ballesteros | |
[CL] Tractable Parsing for CCGs of Bounded Degree. Lena Katharina Schiffer, Marco Kuhlmann, Giorgio Satta | |
Template-free Prompt Tuning for Few-shot NER. Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang, Xuanjing Huang | |
Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition. Besnik Fetahu, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi | |
Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data. Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn | |
Masked Part-Of-Speech Model: Does Modeling Long Context Help Unsupervised POS-tagging?. Xiang Zhou, Shiyue Zhang, Mohit Bansal | |
Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs. Songlin Yang, Wei Liu, Kewei Tu |
: Antonis Anastasopoulos | |
A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank. Dan Malkin, Tomasz Limisiewicz, Gabriel Stanovsky | |
When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer. Ameet Deshpande, Partha Talukdar, Karthik R Narasimhan | |
Lifting the Curse of Multilinguality by Pre-training Modular Transformers. Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe | |
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations. Gábor Berend | |
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat | |
Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling. Nuo Chen, Linjun Shou, MING GONG, Jian Pei, Daxin Jiang |
: Wei Xu | |
DEMix Layers: Disentangling Domains for Modular Language Modeling. Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah Smith, Luke Zettlemoyer | |
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers. Vivek Kumar, Rishabh Maheshwary, Vikram Pudi | |
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks. Belinda Z. Li, Jane A. Yu, Madian Khabsa, Luke Zettlemoyer, Alon Y. Halevy, Jacob Andreas | |
KALA: Knowledge-Augmented Language Model Adaptation. Minki Kang, Jinheon Baek, Sung Ju Hwang | |
Extreme Zero-Shot Learning for Extreme Text Classification. Yuanhao Xiong, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Inderjit S Dhillon | |
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding. Le Zhang, Zichao Yang, Diyi Yang |
: Ashish Sabharwal | |
QuALITY: Question Answering with Long Input Texts, Yes!. Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny L Ma, Jana Thompson, He He, Samuel R. Bowman | |
On the Robustness of Reading Comprehension Models to Entity Renaming. Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren | |
OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering. Zhengbao Jiang, Yi Mao, Pengcheng He, Graham Neubig, Weizhu Chen | |
Modeling Exemplification in Long-form Question Answering via Retrieval. Shufan Wang, Fangyuan Xu, Laure Thompson, Eunsol Choi, Mohit Iyyer | |
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering. Yueqing Sun, Qi Shi, Le Qi, Yu Zhang | |
Clues Before Answers: Generation-Enhanced Multiple-Choice QA. Zixian Huang, Ao Wu, Jiaying Zhou, Yu Gu, Yue Zhao, Gong Cheng |
: Daphne Ippolito | |
[SRW] Methods for Estimating and Improving Robustness of Language Models. Michal Stefanik | |
[SRW] Retrieval-augmented Generation across Heterogeneous Knowledge. Wenhao Yu | |
[SRW] Neural Retriever and Go Beyond: A Thesis Proposal. Man Luo | |
[SRW] Towards Gender Biased Language Classification: A Case Study with British English Archival Metadata Descriptions. Lucy Havens | |
[SRW] Multimodal large language models for inclusive collaboration learning tasks. Armanda Lewis |
Industry Track Posters |
Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems. Mohammad Kachuee, Jinseok Nam, Sarthak Ahuja, Jin-Myung Won, SUNGJIN LEE |
AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy. Raphael Petegrosso, VasistaKrishna Baderdinnni, Thibaud Senechal, Benjamin Bullough |
Temporal Generalization for Spoken Language Understanding. Judith Gaspers, Anoop Kumar, Greg Ver Steeg, Aram Galstyan |
An End-to-End Dialogue Summarization System for Sales Calls. Abedelkadir Asi, Song Wang, Roy Eisenstadt, Dean Geckt, Yarin Kuper, Yi Mao, Royi Ronen |
Controlled Data Generation via Insertion Operations for NLU. Manoj Kumar, Yuval Merhav, Haidar Khan, Rahul Gupta, Anna Rumshisky, Wael Hamza |
Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model. Li GongZheng LGZ, Yadong Xi, Jingzhen Ding, Duan Wang, Ziyang Luo, Rongsheng Zhang, Bai Liu, Changjie Fan, Xiaoxi Mao, Zeng Zhao |
Efficient Semi-supervised Consistency Training for Natural Language Understanding. George Leung, Joshua Tan |
Distantly Supervised Aspect Clustering And Naming For E-Commerce Reviews. Prateek Sircar, Aniket Chakrabarti, DEEPAK GUPTA, Anirban Majumder |
CULG: Commercial Universal Language Generation. Haonan Li, yameng huang, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin, Nan Duan |
Constraining word alignments with posterior regularization for label transfer. Thomas Gueudre, Kevin Martin Jose |
Explaining the Effectiveness of Multi-Task Learning for Efficient Knowledge Extraction from Spine MRI Reports. Arijit Sehanobish, McCullen Sandora, Nabila Abraham, Jayashri Pawar, Danielle Torres, Anasuya Das, Murray Becker, Richard Herzog, Benjamin Odry, Ron Vianu |
Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks. Weiyi Lu, Sunny Rajagopalan, Priyanka Nigam, Jaspreet Singh, Xiaodi Sun, Yi Xu, Belinda Zeng, Trishul Chilimbi |
Augmenting Training Data for Massive Semantic Matching Models in Low-Traffic E-commerce Stores. Ashutosh Joshi, Shankar Vishwanath, Choon Hui Teo, Vaclav Petricek, Vishy Vishwanathan, Rahul Bhagat, Jonathan May |
Retrieval Based Response Letter Generation For a Customer Care Setting. Biplob Biswas, Renhao Cui, Rajiv Ramnath |
Knowledge extraction from aeronautical messages (NOTAMs) with self-supervised language models for aircraft pilots. Alexandre Arnold, Fares Ernez, Catherine Kobus, Marion-Cécile Martin |
Intent Discovery for Enterprise Virtual Assistants: Applications of Utterance Embedding and Clustering to Intent Mining. Minhua Chen, Badrinath Jayakumar, Michael Johnston, S. Eman Mahmoodi, Daniel Pressel |
Lightweight Transformers for Conversational AI. Daniel Pressel, Wenshuo Liu, Michael Johnston, Minhua Chen |
NER-MQMRC: Formulating Named Entity Recognition as Multi Question Machine Reading Comprehension. Anubhav Shrimal, Avi Jain, Kartik Mehta, Promod Yenigalla |
What Do Users Care About? Detecting Actionable Insights from User Feedback. Kasturi Bhattacharjee, Rashmi Gangadharaiah, Kathleen McKeown, Dan Roth |
Developing a Production System for Purpose of Call Detection in Business Phone Conversations. Elena Khasanova, Pooja Hiranandani, Shayna Gardiner, Cheng Chen, Simon Corston-Oliver, Xue-Yong Fu |
Adversarial Text Normalization. Joanna Bitton, Maya Pavlova, Ivan Evtimov |
Constraint-based Multi-hop Question Answering with Knowledge Graph. Sayantan Mitra, Roshni Ramnani, Shubhashis Sengupta |
Fast Bilingual Grapheme-To-Phoneme Conversion. Hwa-Yeon Kim, Jong-Hwan Kim, Jae-Min Kim |
Knowledge Extraction From Texts Based on Wikidata. Anastasia Shimorina, Johannes Heinecke, Frédéric Herledan |
AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry. Yannis Katsis, Saneem Ahmed Chemmengath, vishwajeet kumar, Samarth Bharadwaj, MUSTAFA CANIM, Michael Glass, Alfio Gliozzo, Feifei Pan, Jaydeep Sen, Karthik Sankaranarayanan, Soumen Chakrabarti |
Parameter-efficient Continual Learning Framework in Industrial Real-time Text Classification System. Tao Zhu, Zhe Zhao, Weijie Liu, Jiachi Liu, Yiren Chen, Weiquan Mao, Haoyan Liu, Kunbo Ding, Yudong Li, Xuefeng Yang, Kimmo Yan |
Fast and Light-Weight Answer Text Retrieval in Dialogue Systems. Hui Wan, Siva Sankalp Patel, J William Murdock, Saloni Potdar, Sachindra Joshi |
BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations. Md Tahmid Rahman Laskar, Cheng Chen, Aliaksandr Martsinovich, Jonathan Johnston, Xue-Yong Fu, Shashi Bhushan Tn, Simon Corston-Oliver |
Q2R: A Query-to-Resolution System for Natural-Language Queries. Shiau Hong Lim, Laura Wynter |
Identifying Corporate Credit Risk Sentiments from Financial News. Noujoud Ahbali, Xinyuan Liu, Albert Aristotle Nanda, Jamie Stark, Ashit Talukder, Rupinder Paul Khandpur |
Demo Track Posters |
textless-lib: a Library for Textless Spoken Language Processing. Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossef Mordechay Adi |
Web-based Annotation Interface for Derivational Morphology. Lukáš Kyjánek |
TurkishDelightNLP: A Neural Turkish NLP Toolkit. Huseyin Alecakir, Necva Bölücü, Burcu Can |
ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations. Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, Bonan Min |
Flowstorm: Open-Source Platform with Hybrid Dialogue Architecture. Jan Pichl, Petr Marek, Jakub Konrád, Petr Lorenc, Ondrej Kobza, Tomáš Zajíček, Jan Šedivý |
Contrastive Explanations of Text Classifiers as a Service. Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, Navid Nobani, Andrea Seveso |
RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios. Xinya Du, Zixuan Zhang, Sha Li, Pengfei Yu, Hongwei Wang, Tuan Lai, Xudong Lin, Ziqi Wang, Iris Liu, Ben Zhou, Haoyang Wen, Manling Li, Darryl Hannan, Jie Lei, Hyounghun Kim, Rotem Dror, Haoyu Wang, Michael Regan, Qi Zeng, Qing Lyu, Charles Yu, Carl Edwards, Xiaomeng Jin, Yizhu Jiao, Ghazaleh Kazeminejad, Zhenhailong Wang, Chris Callison-Burch, Mohit Bansal, Carl Vondrick, Jiawei Han, Dan Roth, Shih-Fu Chang, Martha Palmer, Heng Ji |
A Human-machine Interface for Few-shot Rule Synthesis for Information Extraction. Robert Vacareanu, George C. G. Barbosa, Enrique Noriega-Atala, Gus Hahn-Powell, Rebecca Sharp, Marco Antonio Valenzuela-Escárcega, Mihai Surdeanu |
SETSum: Summarization and Visualization of Student Evaluations of Teaching. Yinuo Hu, Shiyue Zhang, Viji Sathy, Abigail Panter, Mohit Bansal |
Towards Open-Domain Topic Classification. Hantian Ding, Jinrui Yang, Yuqian Deng, Hongming Zhang, Dan Roth |
SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features. Greta Tuckute, Aalok Sathe, Mingye Wang, Harley Yoder, Cory Shain, Evelina Fedorenko |
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit. Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang |
DadmaTools: Natural Language Processing Toolkit for Persian Language. Romina Etezadi, Mohammad Karrabi, Najmeh Zare, Mohamad Bagher Sajadi, Mohammad Taher Pilehvar |
FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction. Minh Van Nguyen, Nghia Trung Ngo, Bonan Min, Thien Huu Nguyen |
Computational Social Science and Cultural Analytics |
[Findings] Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning. Hongzhan Lin, Jing Ma, Liangliang Chen, Zhiwei Yang, Mingfei Cheng, Guang Chen |
Dialogue and Interactive Systems |
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation. Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao |
Learning Dialogue Representations from Consecutive Utterances. Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang |
Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances. Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee |
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue. Raghav Gupta, Harrison Lee, Jeffrey Zhao, Yuan Cao, Abhinav Rastogi, Yonghui Wu |
Disentangling Indirect Answers to Yes-No Questions in Real Conversations. Krishna Chaitanya Sanagavarapu, Jathin Pranav Singaraju, Anusha Kakileti, Anirudh Kaza, Aaron Abraham Mathews, Helen Li, Nathan Raul Brito, Eduardo Blanco |
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?. Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy |
[Findings] Instilling Type Knowledge in Language Models via Multi-Task QA. Shuyang Li, Mukund Sridhar, Chandana Satya Prakash, Jin Cao, Wael Hamza, Julian McAuley |
[Findings] A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning. Yang Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang |
Database Search Results Disambiguation for Task-Oriented Dialog Systems. Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar |
Efficient Methods in NLP |
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem. Ryoma Sato |
Causal Distillation for Language Models. Zhengxuan Wu, Atticus Geiger, Joshua Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah Goodman |
[Findings] Attention Fusion: a light yet efficient late fusion mechanism for task adaptation in NLU. Jin Cao, Chandana Satya Prakash, Wael Hamza |
[Findings] Towards Computationally Feasible Deep Active Learning. Akim Tsvigun, Artem Shelmanov, Gleb Kuzmin, Leonid Sanochkin, Daniil Larionov, Gleb Gennadjevich Gusev, Manvel Avetisian, Leonid Zhukov |
[Findings] Pruning Adatperfusion with Lottery Ticket Hypothesis. Jiarun Wu, Qingliang Chen, Zeguan Xiao, Yuliang Gu, Mengsi Sun |
Ethics, Bias, and Fairness |
Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models. Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt |
Using Natural Sentence Prompts for Understanding Biases in Language Models. Sarah Alnegheimish, Alicia Guo, Yi Sun |
Human-Centered NLP |
Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring?. Zijie Zeng, XINYU LI, Dragan Gasevic, Guanliang Chen |
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs. Xu Wang, Simin Fan, Jessica Houghton, Lu Wang |
Machine-in-the-Loop Rewriting for Creative Image Captioning. Vishakh Padmakumar, He He |
Information Extraction |
Sentence-Level Resampling for Named Entity Recognition. Xiaochen Wang, Yue Wang |
Unified Semantic Typing with Meaningful Label Inference. James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen |
Crossroads, Buildings and Neighborhoods: A Dataset for Fine-grained Location Recognition. Pei Chen, Haotian Xu, Cheng Zhang, Ruihong Huang |
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction. Liyan Xu, Jinho D. Choi |
[Findings] GraphCache: Message Passing as Caching for Sentence-Level Relation Extraction. Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Bryan Hooi |
Information Retrieval and Text Mining |
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds. Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han |
Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation. Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin |
Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics. Zihan Zhang, Meng Fang, Ling Chen, Mohammad Reza Namazi Rad |
Interpretability and Analysis of Models for NLP |
Reframing Human-AI Collaboration for Generating Free-Text Explanations. Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi |
Implicit n-grams Induced by Recurrence. Xiaobing Sun, Wei Lu |
Locally Aggregated Feature Attribution on Natural Language Model Understanding. Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song |
[Findings] White-box Testing of NLP models with Mask Neuron Coverage. Arshdeep Sekhon, Yangfeng Ji, Matthew Dwyer, Yanjun Qi |
Simple Local Attentions Remain Competitive for Long-Context Tasks. Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad |
On the Diversity and Limits of Human Explanations. Chenhao Tan |
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language. Jacob Eisenstein |
[TACL] Explanation-Based Human Debugging of NLP Models: A Survey. Piyawat Lertvittayakumjorn, Francesca Toni |
Language Generation |
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts. Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng |
[Findings] Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency. Jin Liu, chongfeng fan, zhou Fengyu, Huijuan Xu |
Language Grounding to Vision, Robotics and Beyond |
Exposing the Limits of Video-Text Models through Contrast Sets. Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach |
Language Resources and Evaluation |
Semantic Diversity in Dialogue with Natural Language Inference. Katherine Stasaski, Marti Hearst |
CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course. Changyoon Lee, Yeon Seonwoo, Alice Oh |
The USMLE® Step 2 Clinical Skills Patient Note Corpus. Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber |
Transparent Human Evaluation for Image Captioning. Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith |
ChapterBreak: A Challenge Dataset for Long-Range Language Models. Simeng Sun, Katherine Thai, Mohit Iyyer |
TVShowGuess: Character Comprehension in Stories as Speaker Guessing. Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton |
SkillSpan: Hard and Soft Skill Extraction from English Job Postings. Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank |
Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations. Daniela Brook Weiss, Paul Roit, Ori Ernst, Ido Dagan |
[TACL] Czech Grammar Error Correction with a Large and Diverse Corpus. Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen |
Linguistic Theories, Cognitive Modeling and Psycholinguistics |
A Computational Acquisition Model for Multimodal Word Categorization. Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann |
[TACL] He Thinks He Knows Better than the Doctors: BERT for Event Factuality Fails on Pragmatics. Nanjiang Jiang, Marie-Catherine de Marneffe |
Machine Translation |
Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations. Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan |
Quality-Aware Decoding for Neural Machine Translation. Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins |
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation. Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu |
Tricks for Training Sparse Translation Models. Dheeru Dua, Shruti Bhosale, Vedanuj Goswami, James Cross, Mike Lewis, Angela Fan |
[Findings] When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?. Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi |
Multilinguality |
[CL] Investigating Language Relationships in Multilingual Sentence Encoders through the Lens of Linguistic Typology. Rochelle Choenni, Ekaterina Shutova |
NLP Applications |
Cross-document Misinformation Detection based on Event Graph Reasoning. Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji |
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction. Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo |
Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption. Garam Lee, Minsoo Kim, Jai Hyun Park, seung-won hwang, Jung Hee Cheon |
[Findings] Harmless Transfer Learning for Item Embeddings. Chengyue Gong, Xiaocong Du, Dhruv Choudhary, Bhargav Bhushanam, qiang liu, Arun Kejariwal |
Phonology, Morphology and Word Segmentation |
Grapheme-to-Phoneme Conversion for Thai using Neural Regression Models. Tomohiro Yamasaki |
Question Answering |
[TACL] Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study. Xiangyang Mou, Chenghao Yang, Mo Yu, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su |
Semantics: Lexical Semantics |
[Findings] Improving Contextual Representation with Gloss Regularized Pre-training. Yu Lin, Zhecheng An, Peihao Wu, Zejun MA |
Semantics: Sentence-level Semantics and Textual Inference |
SUBS: Subtree Substitution for Compositional Semantic Parsing. Jingfeng Yang, Le Zhang, Diyi Yang |
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer. Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy |
MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset. Yahui Liu, Haoping Yang, Chen Gong, Qingrong Xia, Zhenghua Li, Min Zhang |
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification. Jianhai Zhang, Mieradilijiang Maimaiti, Gao Xing, Yuanhang Zheng, Ji Zhang |
DocAMR: Multi-Sentence AMR Representation and Evaluation. Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider |
Improving negation detection with negation-focused pre-training. Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor |
Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge. Ian Porada, Alessandro Sordoni, Jackie CK Cheung |
Partial-input baselines show that NLI models can ignore context, but they don't.. Neha Srikanth, Rachel Rudinger |
[Findings] Analytical Reasoning of Text. Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan |
[Findings] ATP: AMRize Then Parse! Enhancing AMR Parsing with PseudoAMRs. Liang Chen, Peiyi Wang, Runxin Xu, Tianyu Liu, Zhifang Sui, Baobao Chang |
Sentiment Analysis and Stylistic Analysis |
A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction. Shu Liu, Kaiwen Li, Zuhe Li |
Data Augmentation with Dual Training for Offensive Span Detection. Nasim Nouri |
Multi-Domain Targeted Sentiment Analysis. Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim |
UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis. Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis |
Analyzing Modality Robustness in Multimodal Sentiment Analysis. Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria |
Speech |
[Findings] End-to-end Spoken Conversational Question Answering: Task, Dataset and Model. Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou |
Summarization |
TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation. Sajad Sotudeh, Nazli Goharian |
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness. Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai |
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling. Forrest Sheng Bao, Ge Luo, Hebi Li, Minghui Qiu, Yinfei Yang, Youbiao He, Cen Chen |
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries. Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev |
Syntax: Tagging, Chunking, and Parsing |
[CL] The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance. Mark Anderson, Carlos Gómez-Rodríguez |
: Gabriel Stanovsky | |
When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it. Sebastian Schuster, Tal Linzen | |
Analyzing Encoded Concepts in Transformer Language Models. Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu | |
Probing via Prompting. Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan | |
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers. Ali Modarressi, Mohsen Fayyaz, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar |
: Jessica Ouyang | |
From spoken dialogue to formal summary: An utterance rewriting for dialogue summarization. Yue Fang, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Bo Long, Yanyan Lan, Yanquan Zhou | |
Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization. Lulu Zhao, Fujia Zheng, Weihao Zeng, Keqing He, Weiran Xu, Huixing Jiang, Wei Wu, Yanan Wu | |
DialSummEval: Revisiting Summarization Evaluation for Dialogues. Mingqi Gao, Xiaojun Wan | |
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles. Encarna Segarra, Vicent Ahuir, Lluís-F. Hurtado, José Ángel González |
: Qiang Ning | |
Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting. Linzhi Wu, Pengjun Xie, Jie Zhou, Meishan Zhang, Ma Chunping, Guangwei Xu, Min Zhang | |
GMN: Generative Multi-modal Network for Practical Document Information Extraction. Haoyu Cao, Jiefeng Ma, Antai Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren | |
DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction. MeiHan Tong, Bin Xu, Shuai Wang, Meihuan Han, Yixin Cao, Jiangqi Zhu, Siyu Chen, Lei Hou, Juanzi Li | |
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction. Shuliang Liu, Xuming Hu, Chenwei Zhang, Shu'ang Li, Lijie Wen, Philip S. Yu |
: Kenneth Heafield | |
Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints. Chun Zeng, Jiangjie Chen, Tianyi Zhuang, Rui Xu, Hao Yang, Qin Ying, shimin tao, Yanghua Xiao | |
Nearest Neighbor Knowledge Distillation for Neural Machine Translation. Zhixian Yang, Renliang Sun, Xiaojun Wan | |
Cross-modal Contrastive Learning for Speech Translation. Rong Ye, Mingxuan Wang, Lei Li | |
One Reference Is Not Enough: Diverse Distillation with Reference Selection for Non-Autoregressive Translation. Chenze Shao, Xuanfu Wu, Yang Feng |
: Zhou Yu | |
Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation. Hanxun Zhong, Zhicheng Dou, Yutao Zhu, Hongjin Qian, Ji-Rong Wen | |
Diversifying Neural Dialogue Generation via Negative Distillation. Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li | |
Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition. Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, hong yu, Sachindra Joshi | |
Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs. Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur |
: Marjan Ghazvininejad | |
LaMemo: Language Modeling with Look-Ahead Memory. Haozhe Ji, Rongsheng Zhang, Zhenyu Yang, Zhipeng Hu, Minlie Huang | |
[TACL] Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity. Yiding Hao, Dana Angluin, Robert Evan Frank | |
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models. Peter West, Chandra Bhagavatula, Jack Hessel, Jena D. Hwang, Liwei Jiang, Ronan Le Bras, Ximing Lu, Sean Welleck, Yejin Choi | |
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models. Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz |
Computational Social Science and Cultural Analytics |
[SRW] Again, Dozens of Refugees Drowned: A Computational Study of Political Framing Evoked by Presuppositions. Qi Yu |
Political Ideology and Polarization: A Multi-dimensional Approach. Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li |
Combining Humor and Sarcasm for Improving Political Parody Detection. Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras |
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection. Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein |
Conceptualizing Treatment Leakage in Text-based Causal Inference. Adel Daoud, Connor Thomas Jerzak, Richard Johansson |
[Findings] DISARM: Detecting the Victims Targeted by Harmful Memes. Shivam Sharma, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty |
[Findings] Analyzing the Intensity of Complaints on Social Media. MING FANG, Shi Zong, Jing Li, Xinyu Dai, Shujian Huang, Jiajun Chen |
[Findings] CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection. Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee |
Efficient Methods in NLP |
[SRW] Impact of Training Instance Selection on Domain-Specific Entity Extraction using BERT. Eileen Salhofer, Xing Lan Liu, Roman Kern |
Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval. Siyu Ren, Kenny Q. Zhu |
Exact Paired-Permutation Testing for Structured Test Statistics. Ran Zmigrod, Tim Vieira, Ryan Cotterell |
[Findings] Efficient Learning of Multiple NLP Tasks via Collective Weight Factorization on BERT. Christos Charalampos Papadopoulos, Yannis Panagakis, Manolis Koubarakis, Mihalis Nicolaou |
[Findings] RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation. Md Akmal Haidar, NITHIN ANCHURI, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart |
[Findings] PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting. Zhen Zhang, Wei Zhu, Jinfan Zhang, Peng Wang, Rize Jin, Tae-Sun Chung |
Ethics, Bias, and Fairness |
[SRW] Text Style Transfer for Bias Mitigation using Masked Language Modeling. Ewoenam Kwaku Tokpo, Toon Calders |
[SRW] Differentially Private Instance Encoding against Privacy Attacks. Shangyu Xie, Yuan Hong |
Triggerless Backdoor Attack for NLP Tasks with Clean Labels. Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan |
[Findings] An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models. Victor Steinborn, Philipp Dufter, Haris Jabbar, Hinrich Schuetze |
Human-Centered NLP |
What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research. Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke |
[Findings] Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation. Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Wenhao Liu, Caiming Xiong |
Language Generation |
[SRW] Methods for Estimating and Improving Robustness of Language Models. Michal Stefanik |
Cross-Domain Detection of GPT-2-Generated Technical Text. Juan Diego Rodriguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan |
[Findings] Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach. Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi |
[Findings] Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer. Zhengyuan Liu, Nancy F. Chen |
[Findings] Unsupervised Domain Adaptation for Question Generation with DomainData Selection and Self-training. Peide Zhu, Claudia Hauff |
[Findings] Learning Structural Information for Syntax-Controlled Paraphrase Generation. Erguang Yang, Chenglin Bai, Deyi Xiong, Yujie Zhang, Yao Meng, Jinan Xu, Yufeng Chen |
Language Grounding to Vision, Robotics and Beyond |
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation. Zi-Yi Dou, Nanyun Peng |
[Findings] KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan |
[Findings] Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval. Jiaheng Liu, Tan Yu, Hanyu Peng, Mingming Sun, Ping Li |
Language Resources and Evaluation |
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding. Ao Jia, Yu He, Yazhou Zhang, Sagar Uprety, Dawei Song, Christina Lioma |
Are All the Datasets in Benchmark Necessary? A Pilot Study of Dataset Evaluation for Text Classification. Yang Xiao, Jinlan Fu, See-Kiong Ng, Pengfei Liu |
[Findings] MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation). Simone Tedeschi, Roberto Navigli |
[Findings] ID10M: Idiom Identification in 10 Languages. Simone Tedeschi, Federico Martelli, Roberto Navigli |
Machine Learning for NLP: Classification and Structured Prediction Models |
CIAug: Equipping Interpolative Augmentation with Curriculum Learning. Ramit Sawhney, Ritesh Singh Soun, Shrey Pandit, Megh Thakkar, Sarvagya Malaviya, Yuval Pinter |
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification. Minyi Zhao, Lu Zhang, Yi Xu, Jiandong Ding, Jihong Guan, Shuigeng Zhou |
NLP Applications |
Enhancing Self-Attention with Knowledge-Assisted Attention Maps. Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Defu Cao, Mingliang Zhang, Yunhai Tong, Yaming Yang, Jing Bai, Ruofei Zhang, Hao Sun, Wei Shen |
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims. Miguel Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Robert Procter, Yulan He |
ValCAT: Variable-Length Contextualized Adversarial Transformations Using Encoder-Decoder Language Model. Chuyun Deng, Mingxuan Liu, Yue Qin, Jia Zhang, Hai-Xin Duan, Donghong Sun |
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents. Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar |
Non-Autoregressive Chinese ASR Error Correction with Phonological Training. Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao |
[Findings] Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment. Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver |
[Findings] CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training. Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu |
[Findings] Unbiased Math Word Problems Benchmark for Mitigating Solving Bias. ZhiCheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang |
[Findings] Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation. Junwei Yang, Zequn Liu, Ming Zhang, Sheng Wang |
[Findings] D2GCLF: Document-to-Graph Classifier for Legal Document Classification. Qiqi Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Ruofan Wang |
[Findings] Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation. Yong Cao, Wei Li, Xianzhi Li, Min Chen, Guangyong Chen, Long Hu, Zhengdao Li, Kai Hwang |
[Findings] Towards Job-Transition-Tag Graph for a Better Job Title Representation Learning. Jun ZHU, CELINE HUDELOT |
Question Answering |
[SRW] Eliciting Complex Relational Knowledge From Masked Language Models. Arun Sundaresan, Ming Hsu, Zhihao Zhang |
Understand before Answer: Improve Temporal Reading Comprehension via Precise Question Understanding. Hao Huang, Xiubo Geng, Guodong Long, Daxin Jiang |
Re2G: Retrieve, Rerank, Generate. Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Naik, Pengshan Cai, Alfio Gliozzo |
[Findings] Seeing the wood for the trees: a contrastive regularization method for the low-resource Knowledge Base Question Answering. Junping Liu, Shijie Mei, Xinrong Hu, Xun Yao, JACK Yang, Yi Guo |
[Findings] To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning. Yunjie Ji, Liangyu Chen, Chenxiao Dou, Baochang Ma, Xiangang Li |
[Findings] All Information is Valuable: Question Matching over Full Information Transmission Network. Le Qi, Yu Zhang, Qingyu Yin, Guidong Zheng, wen junjie, Jinlong Li, Ting Liu |
[Findings] $Great~Truths~are ~Always ~Simple:$ A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models. Jinhao Jiang, Kun Zhou, Ji-Rong Wen, Xin Zhao |
[Findings] Capturing Conversational Interaction for Question Answering via Global History Reasoning. Jin Qian, Bowei Zou, Mengxing Dong, Xiao Li, AiTi Aw, Yu Hong |
[Findings] Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation. Zhijing Wu, Hua Xu, Jingliang Fang, Kai Gao |
Semantics: Sentence-level Semantics and Textual Inference |
Label Definitions Improve Semantic Role Labeling. Li Zhang, Ishan Jindal, Yunyao Li |
Sentiment Analysis and Stylistic Analysis |
[SRW] Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation. Prakhar Saxena, Yin Jou Huang, Sadao Kurohashi |
Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis. Jiahao Cao, Rui Liu, Huailiang Peng, Lei Jiang, Xu Bai |
Generative Cross-Domain Data Augmentation for Aspect and Opinion Co-Extraction. Junjie Li, Jianfei Yu, Rui Xia |
[Findings] A Dual-Channel Framework for Sarcasm Recognition by Detecting Sentiment Conflict. Yiyi Liu, Yequan Wang, Aixin Sun, Xuying Meng, Jing Li, Jiafeng Guo |
[Findings] CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection. Zhen Li, Bing Xu, Conghui Zhu, Tiejun Zhao |
Speech |
[SRW] Towards Unsupervised Speech Synthesis. Alexander H. Liu, Cheng-I Lai, James R. Glass |
[SRW] Investigating the effectiveness of various speaker embeddings for multi-speaker end-to-end speech synthesis system using small-sized speech data. Sheng-Yao Wang, Yi-Chin Huang |
[SRW] Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation. Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta Ruiz Costa-jussà |
Quantifying Language Variation Acoustically with Few Resources. Martijn Bartelds, Martijn Wieling |
[Findings] FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems. Divya V Sharma, Arun Balaji Buduru |
Syntax: Tagging, Chunking, and Parsing |
[SRW] Simulating Feature Structures with Simple Types. Valentin D. Richard |
[Findings] Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis. Seth Kulick, Neville Ryant, Beatrice Santorini |
[Findings] SHARP: Search-Based Adversarial Attack for Structured Prediction. Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu |
: Yonatan Belinkov | |
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs. Ghazi Felhi, Joseph Le Roux, Djamé Seddah | |
Time Waits for No One! Analysis and Challenges of Temporal Misalignment. Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah Smith | |
What do Toothbrushes do in the Kitchen? How Transformers Think our World is Structured. Alexander Henlein, Alexander Mehler | |
A Study of the Attention Abnormality in Trojaned BERTs. Weimin Lyu, Songzhu Zheng, Tengfei Ma, Chao Chen |
: Lu Wang | |
Mitigating Toxic Degeneration with Empathetic Data: Exploring the Relationship Between Toxicity and Empathy. Allison Lahnala, Charles Welch, Béla Neuendorf, Lucie Flek | |
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate. Hannah Rose Kirk, Bertie Vidgen, Paul Rottger, Tristan Thrush, Scott A. Hale | |
A Holistic Framework for Analyzing the COVID-19 Vaccine Debate. Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser | |
Hate Speech and Counter Speech Detection: Conversational Context Does Matter. Xinchen Yu, Eduardo Blanco, Lingzi Hong |
: Cissi Alm | |
[TACL] Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression. Yuxia Wang, Daniel Beck, Timothy Baldwin, Karin Verspoor | |
[TACL] Heterogeneous Supervised Topic Models. Dhanya Sridhar, Hal Daumé III, David Meir Blei | |
Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks. Paul Rottger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert | |
On the Machine Learning of Ethical Judgments from Natural Language. Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams |
: Graham Neubig | |
SURF: Semantic-level Unsupervised Reward Function for Machine Translation. Atijit Anuchitanukul, Julia Ive | |
Reducing Disambiguation Biases in NMT by Leveraging Explicit Word Sense Information. Niccolò Campolungo, Tommaso Pasini, Denis Emelin, Roberto Navigli | |
Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES. Felix Stahlberg, Shankar Kumar | |
Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation. Siyu Lai, Zhen Yang, Fandong Meng, Xue Zhang, Yufeng Chen, Jinan Xu, Jie Zhou |
: Jinho Choi | |
Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization. Haode Zhang, Haowen Liang, Yuwei Zhang, Li-Ming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam | |
You Don’t Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers’ Private Personas. Haoran Li, Yangqiu Song, Lixin Fan | |
Unsupervised Slot Schema Induction for Task-oriented Dialog. Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau | |
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning. Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine |
: Sopan Khosla | |
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts. Daniel Khashabi, Xinxi Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Sameer Singh, Yejin Choi | |
MetaICL: Learning to Learn In Context. Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi | |
Learning To Retrieve Prompts for In-Context Learning. Ohad Rubin, Jonathan Herzig, Jonathan Berant | |
IDPG: An Instance-Dependent Prompt Generation Method. Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V.G.Vinod Vydiswaran, Hao Ma |
Discourse and Pragmatics |
Incorporating Centering Theory into Neural Coreference Resolution. Haixia Chai, Michael Strube |
Efficient Methods in NLP |
[Findings] ALLSH: Active Learning Guided by Local Sensitivity and Hardness. Shujian Zhang, Chengyue Gong, Xingchao Liu, Pengcheng He, Weizhu Chen, Mingyuan Zhou |
Ethics, Bias, and Fairness |
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification. Xiaolei Huang |
Socially Aware Bias Measurements for Hindi Language Representations. Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang |
Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution. Connor Baumler, Rachel Rudinger |
Information Extraction |
[SRW] Dr. Livingstone, I presume? Polishing of foreign character identification in literary texts. Aleksandra Konovalova, Antonio Toral, Kristiina Taivalkoski-Shilov |
[SRW] CSSS: A Novel Candidate Summary Selection Strategy for Summary-level Extractive Summarization. Shuai Gong, Zhenfang Zhu, Wenqing Wu, Zhen Zhao, Dianyuan Zhang |
EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction. Benfeng Xu, Quan Wang, Yajuan Lyu, Yabing Shi, Yong Zhu, Jie Gao, Zhendong Mao |
CompactIE: Compact Facts in Open Information Extraction. Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish |
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing. Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao |
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition. Xinyu Wang, Min Gui, Yong Jiang, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang, Kewei Tu |
[Findings] Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision. Yang Li, Guodong Long, Tao Shen, Jing Jiang |
[Findings] Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction. Xiang Chen, Ningyu Zhang, Lei Li, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen |
[Findings] Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition. Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, Sheng Yu |
Language Grounding to Vision, Robotics and Beyond |
Disentangled Action Recognition with Knowledge Bases. Zhekun Luo, Shalini Ghosh, Devin Guillory, Keizo Kato, Trevor Darrell, Huijuan Xu |
Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation. Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach |
MCSE: Multimodal Contrastive Learning of Sentence Embeddings. Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow |
[Findings] Fine-grained Image Captioning with CLIP Reward. Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal |
[Findings] CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations. Jialu Li, Hao Tan, Mohit Bansal |
[Findings] What kinds of errors do reference resolution models make and what can we learn from them?. Jorge Sánchez, Mauricio Mazuecos, Hernán Maina, Luciana Benotti |
[Findings] Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval. Zhihao Fan, zhongyu wei, Zejun Li, Siyuan Wang, Xuanjing Huang, Jianqing Fan |
[Findings] RoViST: Learning Robust Metrics for Visual Storytelling. Eileen Wang, Caren Han, Josiah Poon |
[Findings] CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations. Jialu Li, Hao Tan, Mohit Bansal |
Language Resources and Evaluation |
[SRW] Zuo Zhuan Ancient Chinese Dataset for Word Sense Disambiguation. Xiaomeng Pan, Hongfei Wang, Teruaki Oka, Mamoru Komachi |
SwahBERT: Language Model of Swahili. Gati L Martin, Medard Medard Mswahili, Young-Seob Jeong, Jiyoung Woo |
[Findings] BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla. Abhik Bhattacharjee, Tahmid Hasan, Wasi Uddin Ahmad, Kazi Samin Mubasshir, Md Saiful Islam, Anindya Iqbal, M. Sohel Rahman, Rifat Shahriyar |
[Findings] EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification. Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Inigo Casanueva, Paweł Budzianowski |
[Findings] Detecting Narrative Elements in Informational Text. Effi Levi, Guy Mor, Tamir Sheafer, Shaul Rafael Shenhav |
Multilinguality |
Pretrained Models for Multilingual Federated Learning. Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme |
BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer. Marinela Parović, Goran Glavaš, Ivan Vulić, Anna Korhonen |
Towards Debiasing Translation Artifacts. KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith |
[Findings] FreeTransfer-X: Safe and Label-Free Cross-Lingual Transfer from Off-the-Shelf Models. Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu |
[Findings] Uncertainty-Aware Cross-Lingual Transfer with Pseudo Partial Labels. Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Chang-Tien Lu |
[Findings] Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching. Kunbo Ding, Weijie Liu, Yuejian Fang, Zhe Zhao, Qi Ju, Xuefeng Yang, Rong Tian, Zhu Tao, Haoyan Liu, Han Guo, Xingyu Bai, Weiquan Mao, Yudong Li, Weigang Guo, Taiqiang Wu, Ningyuan Sun |
[Findings] How to Translate Your Samples and Choose Your Shots? Analyzing Translate-train & Few-shot Cross-lingual Transfer. Iman Jundi, Gabriella Lapesa |
NLP Applications |
[SRW] Understanding Long Document with Different Position-Aware Attentions. Hai Pham, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang |
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony. Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya |
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations. Prashanth Vijayaraghavan, Soroush Vosoughi |
Question Answering |
Cooperative Self-training of Machine Reading Comprehension. Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass |
Ask Me Anything in Your Native Language. Nikita Sorokin, Dmitry Abulkhanov, Irina Piontkovskaya, Valentin Malykh |
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions. Elior Sulem, Jamaal Hay, Dan Roth |
DREAM: Improving Situational QA by First Elaborating the Situation. Yuling Gu, Bhavana Dalvi, Peter Clark |
OPERA: Operation-Pivoted Discrete Reasoning over Text. Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, jiahui liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao |
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages. Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu |
Long Context Question Answering via Supervised Contrastive Learning. Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan |
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering. JianGuo Mao, Wenbin Jiang, Xiangdong Wang, Zhifan Feng, Yajuan Lyu, Hong Liu, Yong Zhu |
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning. Yu Wang, Vijay Srinivasan, Hongxia Jin |
ProQA: Structural Prompt-based Pre-training for Unified Question Answering. Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan |
[Findings] Multi-Hop Open-Domain Question Answering over Structured and Unstructured Knowledge. Yue Feng, Zhen Han, Mingming Sun, Ping Li |
[Findings] Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base. Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou |
Semantics: Sentence-level Semantics and Textual Inference |
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti |
Few-Shot Semantic Parsing with Language Models Trained on Code. Richard Shin, Benjamin Van Durme |
Summarization |
[SRW] Few-shot fine-tuning SOTA summarization models for medical dialogues. David Fraile Navarro, Mark Dras, Shlomo Berkovsky |
Reference-free Summarization Evaluation via Semantic Correlation and Compression Ratio. Yizhu Liu, Qi Jia, Kenny Q. Zhu |
[Findings] Data Augmentation for Low-Resource Dialogue Summarization. Yongtai Liu, Joshua Maynez, Gonçalo Simões, Shashi Narayan |
[Findings] OTExtSum: Extractive Text Summarisation with Optimal Transport. Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang |
[Findings] Exploring Neural Models for Query-Focused Summarization. Jesse Vig, Alexander Fabbri, Wojciech Maciej Kryscinski, Chien-Sheng Wu, Wenhao Liu |
[Findings] Post-Training Dialogue Summarization using Pseudo-Paraphrasing. Qi Jia, Yizhu Liu, Haifeng Tang, Kenny Q. Zhu |
[Findings] TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization. Ze Yang, Christian WANG, Zhoujin Tian, Wei Wu, Zhoujun Li |
[Findings] Jointly Learning Guidance Induction and Faithful Summary Generation via Conditional Variational Autoencoders. Wang Xu, Tiejun Zhao |
: Vinay Bannihatti Kumar | |
FRUIT: Faithfully Reflecting Updated Information in Text. Robert L. Logan IV, Alexandre Tachard Passos, Sameer Singh, Ming-Wei Chang | |
Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation. Zhexin Zhang, Jiaxin Wen, Jian Guan, Minlie Huang | |
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation. Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie | |
Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation. Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Xing Fan, Chenlei Guo, Yang Liu | |
Zero-shot Sonnet Generation with Discourse-level Planning and Aesthetics Features. Yufei Tian, Nanyun Peng |
: Brian Roark | |
Textless Speech-to-Speech Translation on Real Data. Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Pino, Jiatao Gu, Wei-Ning Hsu | |
Unsupervised Stem-based Cross-lingual Part-of-Speech Tagging for Morphologically Rich Low-Resource Languages. Ramy Eskander, Cass Lowry, Sujay Khandagale, Judith Lynn Klavans, Maria Polinsky, Smaranda Muresan | |
Quantifying Synthesis and Fusion and their Impact on Machine Translation. Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva | |
On the Use of External Data for Spoken Named Entity Recognition. Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Han | |
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems. Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Kumar Nelakanti, Vineet Gandhi |
: Ben Van Durme | |
Improving Entity Disambiguation by Reasoning over a Knowledge Base. Tom Ayoola, Joseph Fisher, Andrea Pierleoni | |
DocTime: A Document-level Temporal Dependency Graph Parser. Puneet Mathur, Vlad I Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain | |
SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction. Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han | |
Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis. Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi | |
MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection. Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen | |
GenIE: Generative Information Extraction. Martin Josifoski, Nicola De Cao, Maxime Peyrard, Fabio Petroni, Robert West |
: Swabha Swayamdipta | |
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants. Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela | |
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting. Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo | |
NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge. Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng | |
Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis. Baber Khalid, SUNGJIN LEE | |
Answer Consolidation: Formulation and Benchmarking. Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen | |
[TACL] Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation. Zoey Liu, Emily Prud’hommeau |
: Mohit Iyyer | |
Efficient Constituency Tree based Encoding for Natural Language to Bash Translation. Shikhar Bharadwaj, Shirish Shevade | |
Multi-Relational Graph Transformer for Automatic Short Answer Grading. Rajat Agarwal, Varun Khurana, Karish Grover, Mukesh Mohania, Vikram Goyal | |
ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence. Yibo Hu, MohammadSaleh Hosseini, Erick Skorupa Parolin, Javier Osorio, Latifur Khan, Patrick Brandt, Vito D'Orazio | |
Semantically Informed Slang Interpretation. Zhewei Sun, Richard Zemel, Yang Xu | |
Don’t sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks. Jonathan Rusert, Padmini Srinivasan | |
GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering. Yoonseok Yang, Kyu Seok Kim, Minsam Kim, Juneyoung Park |
Computational Social Science and Cultural Analytics |
[Findings] Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity. Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schuetze |
[Findings] HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea. Haneul Yoo, Jiho Jin, Juhee Son, JinYeong Bak, Kyunghyun Cho, Alice Oh |
Dialogue and Interactive Systems |
[Findings] Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems. Azlaan Mustafa Samad, Kshitij Mishra, Mauajama Firdaus, Asif Ekbal |
[Findings] Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation. Prakhar Gupta, Harsh Jhamtani, Jeffrey Bigham |
[Findings] Balancing Multi-Domain Corpora Learning for Open-Domain Response Generation. Yujie Xing, Jinglun Cai, Nils Barlaug, Peng Liu, Jon Atle Gulla |
[Findings] Context-Aware Language Modeling for Goal-Oriented Dialogue Systems. Charlie Victor Snell, Sherry Yang, Justin Fu, Yi Su, Sergey Levine |
[Findings] KETOD: Knowledge-Enriched Task-Oriented Dialogue. Zhiyu Chen, Bing Liu, Seungwhan Moon, Chinnadhurai Sankar, Paul A. Crook, William Yang Wang |
Discourse and Pragmatics |
[Findings] Improve Discourse Dependency Parsing with Contextualized Representations. Yifei Zhou, Yansong Feng |
Efficient Methods in NLP |
[Findings] Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention. Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di Jin, Dilek Hakkani-Tur |
[Findings] Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models. Joseph McDonald, Baolin Li, Nathan C. Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi |
[Findings] AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks. Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee |
[Findings] LongT5: Efficient Text-To-Text Transformer for Long Sequences. Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang |
[Findings] LM-CORE: Language Models with Contextually Relevant External Knowledge. Jivat Neet Kaur, Sumit Bhatia, Milan Aggarwal, Rachit Bansal, Balaji Krishnamurthy |
Ethics, Bias, and Fairness |
[Findings] Cross-Domain Classification of Moral Values. Enrico Liscio, Alin Eugen Dondera, Andrei Geadau, Catholijn M Jonker, Pradeep Kumar Murukannaiah |
[Findings] On Measuring Social Biases in Prompt-Based Multi-Task Learning. Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Serife Leman Runyun, Derry Wijaya |
Interpretability and Analysis of Models for NLP |
[Findings] Few-Shot Self-Rationalization with Natural Language Prompts. Ana Marasovic, Iz Beltagy, Doug Downey, Matthew E Peters |
[Findings] Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner. Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Rui Dong, Xiaokai Wei, Henghui Zhu, Xinchi Chen, Peng Xu, zhiheng huang, Andrew Arnold, Dan Roth |
[Findings] Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models. Tianlu Wang, Rohit Sridhar, Diyi Yang, Xuezhi Wang |
[Findings] Exploring the Universal Vulnerability of Prompt-based Learning Paradigm. Lei Xu, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Zhiyuan Liu |
[Findings] On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations. Roy Schwartz, Gabriel Stanovsky |
[Findings] Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence. M.J Jang, Frank Martin Mtumbuka, Thomas Lukasiewicz |
Machine Learning for NLP: Classification and Structured Prediction Models |
[Findings] 'Diversity and Uncertainty in Moderation'' are the Key to Data Selection for Multilingual Few-shot Transfer. Shanu Kumar, Sandipan Dandapat, Monojit Choudhury |
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models |
[Findings] Entity Cloze By Date: What LMs Know About Unseen Entities. Yasumasa Onoe, Michael JQ Zhang, Eunsol Choi, Greg Durrett |
[Findings] Masked Measurement Prediction: Learning to Jointly Predict Quantities and Units from Textual Context. Daniel Spokoyny, Ivan Lee, Zhao Jin, Taylor Berg-Kirkpatrick |
[Findings] Learning Rich Representation of Keyphrases from Text. Mayank Kulkarni, Debanjan Mahata, Ravneet Singh Arora, Rajarshi Bhowmik |
[Findings] Temporal Attention for Language Models. Guy D. Rosin, Kira Radinsky |
[Findings] Lacuna Reconstruction: Self-Supervised Pre-Training for Low-Resource Historical Document Transcription. Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick |
[Findings] Hierarchical Transformers Are More Efficient Language Models. Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski |
Multilinguality |
[Findings] DOCmT5: Document-Level Pretraining of Multilingual Language Models. Chia-Hsuan Lee, Aditya Siddhant, Viresh Ratnakar, Melvin Johnson |
[Findings] Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer. Haoran Xu, Kenton Murray |
[Findings] MTG: A Benchmark Suite for Multilingual Text Generation. Yiran Chen, Zhenqiao Song, Xianze Wu, Danqing Wang, Jingjing Xu, Jiaze Chen, Hao Zhou, Lei Li |
Question Answering |
[Findings] MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving. Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei QIN, Yunshi Lan, Jie Shao, Xiangliang Zhang |
[Findings] Exploiting Numerical-Contextual Knowledge to Improve Numerical Reasoning in Question Answering. Jeonghwan Kim, Junmo Kang, Kyung-min Kim, Giwon Hong, Sung-Hyon Myaeng |
[Findings] METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation. Ruixin Hong, Hongming Zhang, Xintong Yu, Changshui Zhang |
[Findings] Challenges in Generalization in Open Domain Question Answering. Linqing Liu, Patrick Lewis, Sebastian Riedel, Pontus Stenetorp |
[Findings] CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training. Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Scott Yih, Sonal Gupta, Xilun Chen |
[Findings] UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering. Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih |
[Findings] PerKGQA: Question Answering over Personalized Knowledge Graphs. Ritam Dutt, Kasturi Bhattacharjee, Rashmi Gangadharaiah, Dan Roth, Carolyn Rose |
Semantics: Lexical Semantics |
[Findings] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning. Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier |
Semantics: Sentence-level Semantics and Textual Inference |
[Findings] A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations. Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco |
[Findings] The Role of Context in Detecting Previously Fact-Checked Claims. Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov |
[Findings] SEQZERO: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models. Jingfeng Yang, Haoming Jiang, Qingyu Yin, Danqing Zhang, Bing Yin, Diyi Yang |
[Findings] Weakly Supervised Text-to-SQL Parsing through Question Decomposition. Tomer Wolfson, Daniel Deutch, Jonathan Berant |
Sentiment Analysis and Stylistic Analysis |
[Findings] POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection. Yujian Liu, Xinliang Frederick Zhang, David Wegsman, Nicholas Beauchamp, Lu Wang |
[Findings] A Survey on Stance Detection for Mis- and Disinformation Identification. Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein |
: Diyi Yang | |
Selective Differential Privacy for Language Modeling. Weiyan Shi, Aiqi Cui, Evan Li, Ruoxi Jia, Zhou Yu | |
Federated Learning with Noisy User Feedback. Rahul Sharma, Anil Ramakrishna, Ansel MacLaughlin, Anna Rumshisky, Jimit Majmudar, Clement Chung, Salman Avestimehr, Rahul Gupta | |
Provably Confidential Language Modelling. Xuandong Zhao, Lei Li, Yu-Xiang Wang | |
Optimising Equal Opportunity Fairness in Model Training. Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin, Lea Frermann | |
How Gender Debiasing Affects Internal Model Representations, and Why It Matters. Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov | |
Explaining Toxic Text via Knowledge Enhanced Text Generation. Rohit Sridhar, Diyi Yang |
: Nathan Schneider | |
Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization. Prasetya Ajie Utama, Joshua Bambrick, Nafise Sadat Moosavi, Iryna Gurevych | |
Maximum Bayes Smatch Ensemble Distillation for AMR Parsing. Young-Suk Lee, Ramon Fernandez Astudillo, Hoang Thanh Lam, Tahira Naseem, Radu Florian, Salim Roukos | |
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding. Zeming Chen, Qiyue Gao | |
Syn2Vec: Synset Colexification Graphs for Lexical Semantic Similarity. John Harvill, Roxana Girju, Mark A. Hasegawa-Johnson | |
WiC = TSV = WSD: On the Equivalence of Three Semantic Tasks. Bradley Hauer, Grzegorz Kondrak | |
EASE: Entity-Aware Contrastive Learning of Sentence Embedding. Sosuke Nishikawa, Ryokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen |
: Jacob Andreas | |
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks. Ruixiang Cui, Daniel Hershcovich, Anders Søgaard | |
What company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris. Mikael Brunila, Jack LaViolette | |
Learning the Ordering of Coordinate Compounds and Elaborate Expressions in Hmong, Lahu, and Chinese. Chenxuan Cui, Katherine J. Zhang, David R Mortensen | |
Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?. SUBBA REDDY OOTA, JASHN ARORA, Veeral Agarwal, mounika marreddy, Manish Gupta, Bapi Raju Surampudi | |
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models. Patrick Huber, Giuseppe Carenini | |
Social Norms Guide Reference Resolution. Mitchell Abrams, Matthias Scheutz |
: Kenton Lee | |
A Structured Span Selector. Tianyu Liu, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan | |
Entity Linking via Explicit Mention-Mention Coreference Modeling. Dhruv Agarwal, Rico Angell, Nicholas Monath, Andrew McCallum | |
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification. Han Wang, Canwen Xu, Julian McAuley | |
AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models. Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang | |
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation. Simiao Zuo, Qingru Zhang, Chen Liang, Pengcheng He, Tuo Zhao, Weizhu Chen | |
Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection. Angelica Chen, Vicky Zayats, Daniel David Walker, Dirk Padfield |
: Avi Sil | |
Learning to Retrieve Passages without Supervision. Ori Ram, Gal Shachaf, Omer Levy, Jonathan Berant, Amir Globerson | |
Interpretable Proof Generation via Iterative Backward Reasoning. Hanhao Qu, Yu Cao, Jun Gao, Liang Ding, Ruifeng Xu | |
MultiSpanQA: A Dataset for Multi-Span Question Answering. Haonan Li, Martin Tomko, Maria Vasardani, Timothy Baldwin | |
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks. Akari Asai, Matt Gardner, Hannaneh Hajishirzi | |
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning. Yu Jin Kim, Beong-woo Kwak, Youngwook Kim, Reinald Kim Amplayo, seung-won hwang, Jinyoung Yeo | |
[TACL] MuSiQue: Multi-hop Questions via Single-hop Question Composition. Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal |
Human-Centered NLP |
[Findings] Opportunities for Human-centered Evaluation of Machine Translation Systems. Daniel J. Liebling, Katherine A Heller, Samantha Robertson, Wesley Deng |
[Findings] One Size Does Not Fit All: The Case for Personalised Word Complexity Models. Sian Gooding, Manuel Tragut |
[Findings] Aligning Generative Language Models with Human Values. Ruibo Liu, Ge Zhang, Xinyu Feng, Soroush Vosoughi |
[Findings] Design Challenges for a Multi-Perspective Search Engine. Sihao Chen, Siyi Liu, Xander Uyttendaele, Yi Zhang, William Bruno, Dan Roth |
Information Extraction |
[Findings] PromptGen: Automatically Generate Prompts using Generative Models. Yue Zhang, Hongliang Fei, Dingcheng Li, Ping Li |
[Findings] Extracting Temporal Event Relation with Syntax-guided Graph Transformer. SHUAICHENG ZHANG, Qiang Ning, Lifu Huang |
[Findings] StATIK: Structure and Text for Inductive Knowledge Graph Completion. Elan Sopher Markowitz, Keshav Balasubramanian, Mehrnoosh Mirtaheri, Murali Annavaram, Aram Galstyan, Greg Ver Steeg |
[Findings] Permutation Invariant Strategy Using Transformer Encoders for Table Understanding. Sarthak Dash, Sugato Bagchi, Nandana Mihindukulasooriya, Alfio Gliozzo |
[Findings] Self-Training with Differentiable Teacher. Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha |
[Findings] Low-resource Entity Set Expansion: A Comprehensive Study on User-generated Text. Yutong Shao, Nikita Bhutani, Sajjadur Rahman, Estevam Hruschka |
[Findings] Zero-shot Entity Linking with Less Data. G P Shrivatsa Bhargav, Dinesh Khandelwal, Saswati Dana, Dinesh Garg, Pavan Kapanipathi, Salim Roukos, Alexander Gray, L Venkata Subramaniam |
[Findings] Event Detection for Suicide Understanding. Luis Fernando Guzman-Nateras, Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen |
[Findings] Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning. Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre |
[Findings] EA$^2$E: Improving Consistency with Event Awareness for Document-Level Argument Extraction. Qi Zeng, Qiusi Zhan, Heng Ji |
[Findings] Dangling-Aware Entity Alignment with Mixed High-Order Proximities. Juncheng Liu, Zequn Sun, Bryan Hooi, Yiwei Wang, Dayiheng Liu, Baosong Yang, Xiaokui Xiao, Muhao Chen |
Information Retrieval and Text Mining |
[Findings] Literature-Augmented Clinical Outcome Prediction. Aakanksha Naik, Sravanthi Parasa, Sergey Feldman, Lucy Lu Wang, Tom Hope |
[Findings] Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training. Yifan Gao, Qingyu Yin, zheng li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael Lyu |
Language Generation |
[Findings] Controllable Sentence Simplification via Operation Classification. Liam Cripwell, Joël Legrand, Claire Gardent |
[Findings] The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank. Daphne Ippolito, Liam Dugan, Emily Reif, Ann Yuan, Andy Coenen, Chris Callison-Burch |
Language Grounding to Vision, Robotics and Beyond |
[Findings] Probing the Role of Positional Information in Vision-Language Models. Philipp J. Rösch, Jindřich Libovický |
[Findings] Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions. Kosuke Nishida, Kyosuke Nishida, Shuichi Nishioka |
Language Resources and Evaluation |
[Findings] FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks. Bill Yuchen Lin, Chaoyang He, Zihang Zeng, Hulin Wang, Yufen Huang, Christophe Dupuy, Rahul Gupta, Mahdi Soltanolkotabi, Xiang Ren, Salman Avestimehr |
[Findings] Challenging America: Modeling language in longer time scales. Jakub Pokrywka, Filip Graliński, Krzysztof Jassem, Karol Kaczmarek, Krzysztof Jan Jurkiewicz, Piotr Wierzchon |
[Findings] PubHealthTab: A Public Health Table-based Dataset for Evidence-based Fact Checking. Mubashara Akhtar, Oana Cocarascu, Elena Simperl |
[Findings] MM-Claims: A Dataset for Multimodal Claim Detection in Social Media. Gullal Singh Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth |
[Findings] In-BoXBART: Get Instructions into Biomedical Multi-Task Learning. Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral |
[Findings] SemAttack: Natural Textual Attacks via Different Semantic Spaces. Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li |
[Findings] Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting. Jesin James, Vithya Yogarajan, Isabella Shields, Catherine Watson, Peter Keegan, Keoni Mahelona, Peter-Lucas Jones |
[Findings] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation. Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim |
Machine Translation |
[Findings] CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality. Maria Nadejde, Anna Currey, Benjamin Hsu, Xing Niu, Marcello Federico, Georgiana Dinu |
[Findings] BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation. Eleftheria Briakou, Sida Wang, Luke Zettlemoyer, Marjan Ghazvininejad |
[Findings] Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation. Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn |
NLP Applications |
[Findings] Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback. Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang |
[Findings] TEAM: A multitask learning based Taxonomy Expansion approach for Attach and Merge. Bornali Phukon, Anasua Mitra, Ranbir Singh Sanasam, Priyankoo Sarmah |
[Findings] Multimodal Intent Discovery from Livestream Videos. Adyasha Maharana, Quan Hung Tran, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter W Chang, Mohit Bansal |
[Findings] Opponent Modeling in Negotiation Dialogues by Related Data Adaptation. Kushal Chawla, Gale Lucas, Jonathan May, Jonathan Gratch |
[Findings] Learning to Embed Multi-Modal Contexts for Situated Conversational Agents. Haeju Lee, Oh Joon Kwon, Yunseon Choi, Minho Park, Ran Han, Yoonhyung Kim, Jinhyeon Kim, Youngjune Lee, Haebin Shin, Kangwook Lee, Kee-Eung Kim |
[Findings] MultiVerS: Improving scientific claim verification with weak supervision and full-document context. David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi |
[Findings] An Item Response Theory Framework for Persuasion. Anastassia Kornilova, Vladimir Eidelman, Daniel Argyle |
[Findings] Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks. Zhao Meng, Yihan Dong, Mrinmaya Sachan, Roger Wattenhofer |
[Findings] The Limits of Word Level Differential Privacy. Justus Mattern, Benjamin Weggenmann, Florian Kerschbaum |
[Findings] Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback. Yunfan Hu, Zhaopeng Qiu, Xian Wu |
[Findings] Query2Particles: Knowledge Graph Reasoning with Particle Embeddings. Jiaxin Bai, Zihao Wang, Hongming Zhang, Yangqiu Song |
Phonology, Morphology and Word Segmentation |
[Findings] Restoring Hebrew Diacritics Without a Dictionary. Elazar Gershuni, Yuval Pinter |
Speech |
[Findings] BehancePR: A Punctuation Restoration Dataset for Livestreaming Video Transcript. Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen |
Summarization |
[Findings] Masked Summarization to Generate Factually Inconsistent Summaries for Improved Factual Consistency Checking. Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung |
[Findings] Efficient Few-Shot Fine-Tuning for Opinion Summarization. Arthur Brazinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer |
[Findings] Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback. Duy-Hung Nguyen, Nguyen Viet Dung Nghiem, Bao-Sinh Nguyen, Tien Dung Le, Shahab Sabahi, Minh-Tien Nguyen, Hung Le |