Please ensure Javascript is enabled for purposes of website accessibility

Main Conference Schedule

On this page, you can choose the sessions (and individual papers/posters) of your choice and generate a PDF of your customized schedule. For the best experience, use a non-mobile device with a resolution of at least 1920x1080 and a full-screen browser. For help, simply type “?” while on the page or click on the “Help” button.

The overall schedule structure is final, but the assignment of papers to sessions and order of papers within sessions might still be modified to accommodate the final mode of presentation (virtual or in-person) chosen by the authors.

Regarding Virtual Poster Q&A Sessions: To foster discussion, virtual poster Q&A sessions will be organized in small Zoom rooms that bring together posters on similar themes.

All times are Pacific Daylight Time (GMT-7). Icons: = Session Chair; = Paper Award.

  • Click on the ”+” button or the title of a session to toggle it. Click the “Expand All Sessions ↓” button to expand all sessions in one go. Click again to collapse them.
  • To expand parallel sessions simultaneously, Hold Shift and click on any of them.
  • Hover over the time for any session to see its day and date as a tooltip.
  • Click on a paper or poster to toggle its selection. You can select more than one paper for a time slot.
  • Click the “Download PDF” button at the bottom to download your customized PDF.
timelocationinfo

Sunday, July 10, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Tutorials9:00 – 17:30
Welcome Reception18:00 – 20:00
Monday, July 11, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Welcome Session8:45 – 9:15Columbia C/D (Overflow: Columbia A & 302 Beckler)
Plenary Invited Talk 1: Batya Friedman: "Shaping Technology with Moral Imagination: Leveraging the Machinery of Value Sensitive Design"9:15 – 10:15Columbia C/D (Overflow: Columbia A & 302 Beckler)
Break10:15 – 10:45Regency A & B
Oral Session 1 + In-person Poster Session 1
10:45 – 12:15
Choose AllRemove All
Learning to Transfer Prompts for Text Generation. Junyi Li, Tianyi Tang, Jian-Yun Nie, Ji-Rong Wen, Xin Zhao
Long-term Control for Dialogue Generation: Methods and Evaluation. Ramya Ramakrishnan, Hashan Buddhika Narangodage, Mauro Schilman, Kilian Q Weinberger, Ryan McDonald
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding. Antoine Chaffin, Vincent Claveau, Ewa Kijak
RSTGen: Imbuing Fine-Grained Interpretable Control into Long-FormText Generators. Rilwan Akanni Adewoyin, Ritabrata Dutta, Yulan He
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning. Fei Wang, Zhewei Xu, Pedro Szekely, Muhao Chen
TRUE: Re-evaluating Factual Consistency Evaluation. Or Honovich, Roee Aharoni, Jonathan Herzig, Hagai Taitelbaum, Doron Kukliansy, Vered Cohen, Thomas Scialom, Idan Szpektor, Avinatan Hassidim, Yossi Matias
10:45 – 12:15
Choose AllRemove All
AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization. Alexander Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona T. Diab
QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization. Alexander Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics. Daniel Deutsch, Rotem Dror, Dan Roth
Massive-scale Decoding for Text Generation using Lattices. Jiacheng Xu, Siddhartha Jonnalagadda, Greg Durrett
FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations. Leonardo F. R. Ribeiro, Mengwen Liu, Iryna Gurevych, Markus Dreyer, Mohit Bansal
CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning. Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Amit Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
10:45 – 12:15
Choose AllRemove All
An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling. Peiyi Wang, Runxin Xu, Tianyu Liu, Qingyu Zhou, Yunbo Cao, Baobao Chang, Zhifang Sui
Modeling Multi-Granularity Hierarchical Features for Relation Extraction. Xinnian Liang, Shuangzhi Wu, Mu Li, Zhoujun Li
Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities. Benjamin Hsu, Graham Horwood
Cross-Lingual Event Detection via Optimized Adversarial Training. Luis Fernando Guzman-Nateras, Minh Van Nguyen, Thien Huu Nguyen
Learning to Borrow– Relation Representation for Without-Mention Entity-Pairs for Knowledge Graph Completion. Huda Hakami, Mona Hakami, Angrosh Mandya, Danushka Bollegala
[TACL] Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference. Bangzheng Li, Wenpeng Yin, Muhao Chen
10:45 – 12:15
Choose AllRemove All
KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge Distillation. Marzieh S. Tahaei, Ella Charlaix, Vahid Partovi Nia, Ali Ghodsi, Mehdi Rezagholizadeh
Meta Learning for Natural Language Processing: A Survey. Hung-yi Lee, Shang-Wen Li, Thang Vu
On Transferability of Prompt Tuning for Natural Language Processing. Yusheng Su, Xiaozhi Wang, Yujia Qin, Chi-Min Chan, Yankai Lin, Huadong Wang, Kaiyue Wen, Zhiyuan Liu, Peng Li, Juanzi Li, Lei Hou, Maosong Sun, Jie Zhou
Sparse Distillation: Speeding Up Text Classification by Using Bigger Student Models. Qinyuan Ye, Madian Khabsa, Mike Lewis, Sinong Wang, Xiang Ren, Aaron Jaech
Sketching as a Tool for Understanding and Accelerating Self-attention for Long Sequences. Yifan Chen, Qi Zeng, Dilek Hakkani-Tur, Di Jin, Heng Ji, Yun Yang
 FNet: Mixing Tokens with Fourier Transforms. James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon
10:45 – 12:15
Choose AllRemove All
LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking. Yifan Wang, Jing Zhao, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He
Stylized Knowledge-Grounded Dialogue Generation via Disentangled Template Rewriting. Qingfeng Sun, Can Xu, Huang Hu, Yujing Wang, Jian Miao, Xiubo Geng, Yining Chen, Fei Xu, Daxin Jiang
Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text Generation. Shumpei Inoue, Tsungwei Liu, Son Hong Nguyen, Minh-Tien Nguyen
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog. Chia-Chien Hung, Anne Lauscher, Ivan Vulić, Simone Paolo Ponzetto, Goran Glavaš
[TACL] Reducing conversational agents' overconfidence through linguistic calibration. Mielke, Arthur Szlam, Emily Dinan, Y-Lan Boureau
Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering. Rajat Kumar, Mayur Patidar, VAIBHAV VARSHNEY, Lovekesh Vig, Gautam Shroff
10:45 – 12:15 Regency A & B
Computational Social Science and Cultural Analytics
Political Ideology and Polarization: A Multi-dimensional Approach. Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection. Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein
Combining Humor and Sarcasm for Improving Political Parody Detection. Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras
Ethics, Bias, and Fairness
Measuring Fairness with Biased Rulers: A Comparative Study on Bias Metrics for Pre-trained Language Models. Pieter Delobelle, Ewoenam Kwaku Tokpo, Toon Calders, Bettina Berendt
Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution. Connor Baumler, Rachel Rudinger
Using Natural Sentence Prompts for Understanding Biases in Language Models. Sarah Alnegheimish, Alicia Guo, Yi Sun
Human-Centered NLP
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs. Xu Wang, Simin Fan, Jessica Houghton, Lu Wang
Machine-in-the-Loop Rewriting for Creative Image Captioning. Vishakh Padmakumar, He He
What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research. Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke
Information Retrieval and Text Mining
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds. Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han
Learning Cross-Lingual IR from an English Retriever. Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee, Avirup Sil
Collective Relevance Labeling for Passage Retrieval. Jihyuk Kim, Minsoo Kim, seung-won hwang
Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation. Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin
Interpretability and Analysis of Models for NLP
Residue-Based Natural Language Adversarial Attack Detection. Vyas Raina, Mark Gales
Locally Aggregated Feature Attribution on Natural Language Model Understanding. Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song
Simple Local Attentions Remain Competitive for Long-Context Tasks. Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad
Reframing Human-AI Collaboration for Generating Free-Text Explanations. Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens. Itay Itzhak, Omer Levy
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language. Jacob Eisenstein
On the Diversity and Limits of Human Explanations. Chenhao Tan
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models. Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein
[TACL] Explanation-Based Human Debugging of NLP Models: A Survey. Piyawat Lertvittayakumjorn, Francesca Toni
Language Grounding to Vision, Robotics and Beyond
Exposing the Limits of Video-Text Models through Contrast Sets. Jae Sung Park, Sheng Shen, Ali Farhadi, Trevor Darrell, Yejin Choi, Anna Rohrbach
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation. Zi-Yi Dou, Nanyun Peng
MCSE: Multimodal Contrastive Learning of Sentence Embeddings. Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow
Machine Translation
Quality-Aware Decoding for Neural Machine Translation. Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins
Cheat Codes to Quantify Missing Source Information in Neural Machine Translation. Proyag Pal, Kenneth Heafield
Language Model Augmented Monotonic Attention for Simultaneous Translation. Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim
Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations. Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan
Training Mixed-Domain Translation Models via Federated Learning. Peyman Passban, Tanya Roosta, Rahul Gupta, Ankit Chadha, Clement Chung
Multilinguality
Towards Debiasing Translation Artifacts. KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith
Pretrained Models for Multilingual Federated Learning. Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme
[CL] Investigating Language Relationships in Multilingual Sentence Encoders through the Lens of Linguistic Typology. Rochelle Choenni, Ekaterina Shutova
NLP Applications
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents. Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations. Prashanth Vijayaraghavan, Soroush Vosoughi
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony. Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya
Cross-document Misinformation Detection based on Event Graph Reasoning. Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction. Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo
Lunch12:15 – 13:15
Industry Panel: Careers in NLP13:15 – 14:15Columbia C/D (Overflow: Columbia A & 302 Beckler)
Break14:15 – 14:30Regency A & B
Oral Session 2 + In-person Poster Session 2
14:30 – 16:00
Choose AllRemove All
ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models. Junyi Li, Tianyi Tang, Zheng Gong, Lixin Yang, Zhuohao Yu, Zhipeng Chen, Jingyuan Wang, Xin Zhao, Ji-Rong Wen
When Does Syntax Mediate Neural Language Model Performance? Evidence from Dropout Probes. Mycal Tucker, Tiwalayo Eisape, Peng Qian, Roger P. Levy, Julie Shah
ExSum: From Local Explanations to Model Understanding. Yilun Zhou, Marco Tulio Ribeiro, Julie Shah
Is "My Favorite New Movie" My Favorite Movie? Probing the Understanding of Recursive Noun Phrases. Qing Lyu, Hua Zheng, Daoxin Li, Li Zhang, Marianna Apidianaki, Chris Callison-Burch
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora. Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew Arnold, Xiang Ren
Even the Simplest Baseline Needs Careful Re-investigation: A Case Study on XML-CNN. Si-An Chen, JIE-JYUN LIU, Tsung-Han Yang, Hsuan-Tien Lin, Chih-Jen Lin
14:30 – 16:00
Choose AllRemove All
Testing the Ability of Language Models to Interpret Figurative Language. Emmy Liu, Chenxuan Cui, Kenneth Zheng, Graham Neubig
Compositional Task-Oriented Parsing as Abstractive Question Answering. Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling. Jakob Prange, Nathan Schneider, Lingpeng Kong
Improving Compositional Generalization with Latent Structure and Data Augmentation. Linlu Qiu, Peter Shaw, Panupong Pasupat, Pawel Krzysztof Nowak, Tal Linzen, Fei Sha, Kristina Toutanova
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass
Bilingual Tabular Inference: A Case Study on Indic Languages. Chaitanya Agarwal, Vivek Gupta, Anoop Kunchukuttan, Manish Shrivastava
14:30 – 16:00
Choose AllRemove All
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand. Jungo Kasai, Keisuke Sakaguchi, Ronan Le Bras, Lavinia Dunagan, Jacob Daniel Morrison, Alexander Fabbri, Yejin Choi, Noah Smith
CORWA: A Citation-Oriented Related Work Annotation Dataset. Xiangci Li, Biswadip Mandal, Jessica Ouyang
Shedding New Light on the Language of the Dark Web. Youngjin Jin, Eugene Jang, Yongjae Lee, Seungwon Shin, Jin-Woo Chung
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation. David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Chinenye Emezue, Colin Leong, Michael Beukman, Shamsuddeen Hassan Muhammad, Guyo Dub Jarso, Oreen Yousuf, Andre Niyongabo Rubungo, Gilles HACHEME, Eric Peter Wairagala, Muhammad Umair Nasir, Benjamin Ayoade Ajibade, Tunde Oluwaseyi Ajayi, Yvonne Wambui Gitau, Jade Abbott, Mohamed Ahmed, Millicent Ochieng, Anuoluwapo Aremu, Perez Ogayo, Jonathan Mukiibi, Fatoumata Ouoba Kabore, Godson Koffi KALIPE, Derguene Mbaye, Allahsera Auguste Tapo, Victoire Memdjokam Koagne, Edwin Munkoh-Buabeng, Valencia Wagner, Idris Abdulmumin, Ayodele Awokoya, Happy Buzaaba, Blessing Sibanda, Andiswa Bukula, Sam Manthalu
Does Summary Evaluation Survive Translation to Other Languages?. Spencer Braun, Oleg Vasilyev, Neslihan Iskender, John Bohannon
DISAPERE: A Dataset for Discourse Structure in Peer Review Discussions. Neha Nayak Kennard, Tim O'Gorman, Rajarshi Das, Akshay Sharma, Chhandak Bagchi, Matthew Clinton, Pranay Kumar Yelugam, Hamed Zamani, Andrew McCallum
14:30 – 16:00
Choose AllRemove All
Original or Translated? A Causal Analysis of the Impact of Translationese on Machine Translation Performance. Jingwei Ni, Zhijing Jin, Markus Freitag, Mrinmaya Sachan, Bernhard Schölkopf
On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation. Kelly Marchisio, Markus Freitag, David Grangier
The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation. Tobias Domhan, Eva Hasler, Ke Tran, Sony Trenous, Bill Byrne, Felix Hieber
BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation. Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Jian Yang, Haoyang Huang, Rico Sennrich, Ryan Cotterell, Mrinmaya Sachan, Ming Zhou
Non-Autoregressive Machine Translation: It's Not as Fast as it Seems. Jindřich Helcl, Barry Haddow, Alexandra Birch
[TACL] High Quality Rather than High Model Probability: Minimum Bayes Risk Decoding with Neural Metrics. Markus Freitag, David Grangier, Qijun Tan, Bowen Liang
14:30 – 16:00
Choose AllRemove All
ScAN: Suicide Attempt and Ideation Events Dataset. Bhanu Pratap Singh Rawat, Samuel Kovaly, Hong Yu, Wilfred Pigeon
DUCK: Rumour Detection on Social Media by Modelling User and Comment Propagation Networks. LIN TIAN, Xiuzhen Zhang, Jey Han Lau
Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset. Fengzhu ZENG, Wei Gao
Frustratingly Easy System Combination for Grammatical Error Correction. Muhammad Reza Qorib, Seung-Hoon Na, Hwee Tou Ng
On the Use of Bert for Automated Essay Scoring: Joint Learning of Multi-Scale Essay Representation. Yongjie Wang, Chuan Wang, Ruobing Li, Hui Lin
KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media. Wenqian Zhang, Shangbin Feng, Zilong Chen, Zhenyu Lei, Jundong Li, Minnan Luo
14:30 – 16:00 Regency A & B
Dialogue and Interactive Systems
Towards a Progression-Aware Autonomous Dialogue Agent. Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang, Deepanshu Dey, Jonas Braasch, Dakuo Wang
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models. Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park
Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances. Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee
Database Search Results Disambiguation for Task-Oriented Dialog Systems. Kun Qian, Satwik Kottur, Ahmad Beirami, Shahin Shayandeh, Paul A. Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar
Learning Dialogue Representations from Consecutive Utterances. Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation. Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?. Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue. Raghav Gupta, Harrison Lee, Jeffrey Zhao, Yuan Cao, Abhinav Rastogi, Yonghui Wu
Generating Repetitions with Appropriate Repeated Words. Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction. Xun Yuan, Derek Pham, Sam Davidson, Zhou Yu
Discourse and Pragmatics
Incorporating Centering Theory into Neural Coreference Resolution. Haixia Chai, Michael Strube
Efficient Methods in NLP
LEA: Meta Knowledge-Driven Self-Attentive Document Embedding for Few-Shot Text Classification. Seungki Hong, Tae Young Jang
Information Extraction
Event Schema Induction with Double Graph Autoencoders. Xiaomeng Jin, Manling Li, Heng Ji
Unified Semantic Typing with Meaningful Label Inference. James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
Crossroads, Buildings and Neighborhoods: A Dataset for Fine-grained Location Recognition. Pei Chen, Haotian Xu, Cheng Zhang, Ruihong Huang
CompactIE: Compact Facts in Open Information Extraction. Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction. Liyan Xu, Jinho D. Choi
Language Generation
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts. Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng
Cross-Domain Detection of GPT-2-Generated Technical Text. Juan Diego Rodriguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan
Learning to Selectively Learn for Weakly Supervised Paraphrase Generation with Model-based Reinforcement Learning. Haiyan Yin, Dingcheng Li, Ping Li
AmbiPun: Generating Humorous Puns with Ambiguous Context. Anirudh Mittal, Yufei Tian, Nanyun Peng
Machine Learning for NLP: Classification and Structured Prediction Models
Inducing and Using Alignments for Transition-based AMR Parsing. Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo
Consistency Training with Virtual Adversarial Discrete Perturbation. Jungsoo Park, Gyuwan Kim, Jaewoo Kang
Contrastive Learning for Prompt-based Few-shot Language Learners. Yiren Jian, Chongyang Gao, Soroush Vosoughi
Embedding Hallucination for Few-shot Language Fine-tuning. Yiren Jian, Chongyang Gao, Soroush Vosoughi
Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning. Vishakh Padmakumar, Leonard Lausen, Miguel Ballesteros, Sheng Zha, He He, George Karypis
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
Efficient Hierarchical Domain Adaptation for Pretrained Language Models. Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge
Learning Natural Language Generation with Truncated Reinforcement Learning. Alice Martin, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model. Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung
Learning to Generate Examples for Semantic Processing Tasks. Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili
PARADISE: Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining. Machel Reid, Mikel Artetxe
On Curriculum Learning for Commonsense Reasoning. Adyasha Maharana, Mohit Bansal
Summarization
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness. Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai
TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation. Sajad Sotudeh, Nazli Goharian
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling. Forrest Sheng Bao, Ge Luo, Hebi Li, Minghui Qiu, Yinfei Yang, Youbiao He, Cen Chen
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries. Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
Syntax: Tagging, Chunking, and Parsing
Sort by Structure: Language Model Ranking as Dependency Probing. Max Müller-Eberstein, Rob van der Goot, Barbara Plank
[CL] The Impact of Edge Displacement Vaserstein Distance on UD Parsing Performance. Mark Anderson, Carlos Gómez-Rodríguez
Break16:00 – 16:30Regency A & B
16:30 – 17:30 Columbia C/D (Overflow: Columbia A & 302 Beckler)
 User-Driven Research of Medical Note Generation Software. Tom Knoll, Francesco Moramarco, Alex Papadopoulos Korfiatis, Rachel Young, Claudia Ruffini, Mark Perera, Christian Perstl, Ehud Reiter, Anya Belz, Aleksandar Savkov
 Automatic Correction of Human Translations. Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero
 NeuroLogic A*esque Decoding: Constrained Text Generation with Lookahead Heuristics. Ximing Lu, Sean Welleck, Peter West, Liwei Jiang, Jungo Kasai, Daniel Khashabi, Ronan Le Bras, Lianhui Qin, Youngjae Yu, Rowan Zellers, Noah Smith, Yejin Choi
Tuesday, July 12, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Oral Session 3 + Industry Oral Session 1 + Virtual Poster Q&A Session 1
8:00 – 9:00
Choose AllRemove All
Low Resource Style Transfer via Domain Adaptive Meta Learning. Xiangyang Li, Xiang Long, Yu Xia, Sujian Li
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation. Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu
Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer. Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar
MOVER: Mask, Over-generate and Rank for Hyperbole Generation. Yunxiang Zhang, Xiaojun Wan
8:00 – 9:00
Choose AllRemove All
[TACL] Fact Checking with Insufficient Evidence. Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein
A Double-Graph Based Framework for Frame Semantic Parsing. Ce Zheng, Xudong Chen, Runxin Xu, Baobao Chang
Identifying Implicitly Abusive Remarks about Identity Groups using a Linguistically Informed Approach. Michael Wiegand, Elisabeth Eder, Josef Ruppenhofer
Disentangled Learning of Stance and Aspect Topics for Vaccine Attitude Detection in Social Media. Lixing Zhu, Zheng Fang, Gabriele Pergola, Robert Procter, Yulan He
8:00 – 9:00
Choose AllRemove All
MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Grammatical Error Correction. Yue Zhang, Zhenghua Li, Zuyi Bao, Jiacheng Li, Bo Zhang, Chen Li, Fei Huang, Min Zhang
A Corpus for Understanding and Generating Moral Stories. Jian Guan, Ziqi Liu, Minlie Huang
End-to-End Chinese Speaker Identification. Dian Yu, Ben Zhou, Dong Yu
WALNUT: A Benchmark on Semi-weakly Supervised Learning for Natural Language Understanding. Guoqing Zheng, Giannis Karamanolakis, Kai Shu, Ahmed Hassan Awadallah
8:00 – 9:00
Choose AllRemove All
Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training. Yuanxin Liu, Fandong Meng, Zheng Lin, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou
Knowledge Inheritance for Pre-trained Language Models. Yujia Qin, Yankai Lin, Jing Yi, Jiajie Zhang, Xu Han, Zhengyan Zhang, Yusheng Su, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline. Xiangyang Liu, Tianxiang Sun, JunLiang He, Jiawen Wu, Lingling Wu, Xinyu Zhang, Hao Jiang, Zhao Cao, Xuanjing Huang, Xipeng Qiu
Adaptable Adapters. Nafise Sadat Moosavi, Quentin Delfosse, Kristian Kersting, Iryna Gurevych
8:00 – 9:00
Choose AllRemove All
CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking. Xuming Hu, Zhijiang Guo, GuanYu Wu, Aiwei Liu, Lijie Wen, Philip S. Yu
Progressive Class Semantic Matching for Semi-supervised Text Classification. Haiming Xu, Lingqiao Liu, Ehsan M Abbasnejad
Unsupervised Paraphrasability Prediction for Compound Nominalizations. John Sie Yuen Lee, Ho Hung Lim, Carol Webster
Dual-Channel Evidence Fusion for Fact Verification over Texts and Tables. Nan Hu, Zirui Wu, Yuxuan Lai, Xiao Liu, Yansong Feng
8:00 – 9:00
Choose AllRemove All
: Kasturi Bhattacharjee
CREATER: CTR-driven Advertising Text Generation with Controlled Pre-Training and Contrastive Fine-Tuning. Penghui Wei, Xuanhua Yang, ShaoGuo Liu, Liang Wang, Bo Zheng
Augmenting Poetry Composition with Verse by Verse. David Uthus, Maria Voitovich, R.J. Mical
FPI: Failure Point Isolation in Large-scale Conversational Assistants. Rinat Khaziev, Usman Shahid, Tobias Roeding, Rakesh Chada, Emir Kapanci, Pradeep Natarajan
ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking. Tom Ayoola, Shubhi Tyagi, Joseph Fisher, Christos Christodoulopoulos, Andrea Pierleoni
8:00 – 9:00
Dialogue and Interactive Systems
[SRW] Explicit Use of Topicality in Dialogue Response Generation. Takumi Yoshikoshi, Hayato Atarashi, Takashi Kodama, Sadao Kurohashi
[SRW] Automating Human Evaluation of Dialogue Systems. Sujan Reddy A
Generating Repetitions with Appropriate Repeated Words. Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
EmpHi: Generating Empathetic Responses with Human-like Intents. MAO YAN CHEN, Siheng Li, Yujiu Yang
Towards a Progression-Aware Autonomous Dialogue Agent. Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang, Deepanshu Dey, Jonas Braasch, Dakuo Wang
Representation Learning for Conversational Data using Discourse Mutual Information Maximization. Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal
D2U: Distance-to-Uniform Learning for Out-of-Scope Detection. Eyup Halit Yilmaz, Cagri Toraman
Building a Role Specified Open-Domain Dialogue System Leveraging Large-Scale Language Models. Sanghwan Bae, Donghyun Kwak, Sungdong Kim, Donghoon Ham, Soyoung Kang, Sang-Woo Lee, Woomyoung Park
Revisit Overconfidence for OOD Detection: Reassigned Contrastive Learning with Adaptive Class-dependent Threshold. Yanan Wu, Keqing He, Yuanmeng Yan, QiXiang Gao, Zhiyuan Zeng, Fujia Zheng, Lulu Zhao, Huixing Jiang, Wei Wu, Weiran Xu
AISFG: Abundant Information Slot Filling Generator. Yang Yan, Junda Ye, Zhongbao Zhang, Liwen Wang
Mining Clues from Incomplete Utterance: A Query-enhanced Network for Incomplete Utterance Rewriting. Shuzheng Si, Shuang Zeng, Baobao Chang
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances. Seungju Han, Beomsu Kim, Jin Yong Yoo, Seokjun Seo, Sangbum Kim, Enkhbayar Erdenee, Buru Chang
[Findings] Learning to Execute Actions or Ask Clarification Questions. Zhengxiang Shi, Yue Feng, Aldo Lipani
[Findings] BORT: Back and Denoising Reconstruction for End-to-End Task-Oriented Dialog. Haipeng Sun, Junwei Bao, Youzheng Wu, Xiaodong He
[Findings] Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity. Kurt Shuster, Jack Urbanek, Arthur Szlam, Jason E Weston
[Findings] DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation. Md Rashad Al Hasan Rony, Ricardo Usbeck, Jens Lehmann
[Findings] Zero-shot Cross-lingual Conversational Semantic Role Labeling. Han Wu, Haochen Tan, Kun Xu, Shuqi LIU, Lianwei Wu, Linqi Song
[Findings] A Framework to Generate High-Quality Datapoints for Multiple Novel Intent Detection. Ankan Mullick, Sukannya Purkayastha, Pawan Goyal, Niloy Ganguly
[Findings] Anti-Overestimation Dialogue Policy Learning for Task-Completion Dialogue System. Chang Tian, Wenpeng Yin, Marie-Francine Moens
[Findings] Prompt Augmented Generative Replay via Supervised Contrastive Learning for Lifelong Intent Detection. VAIBHAV VARSHNEY, Mayur Patidar, Rajat Kumar, Lovekesh Vig, Gautam Shroff
[Findings] NLU++: A Multi-Label, Slot-Rich, Generalisable Dataset for Natural Language Understanding in Task-Oriented Dialogue. Inigo Casanueva, Ivan Vulić, Georgios P. Spithourakis, Paweł Budzianowski
Information Extraction
Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction. Jiaxin Yu, Deqing Yang, Shuyu Tian
Hero-Gang Neural Model For Named Entity Recognition. Jinpeng Hu, Yaling Shen, Yang Liu, Xiang Wan, Tsung-Hui Chang
Modal Dependency Parsing via Language Model Priming. Jiarui Yao, Nianwen Xue, Bonan Min
Document-Level Event Argument Extraction by Leveraging Redundant Information and Closed Boundary Loss. Hanzhang Zhou, Kezhi Mao
Global Entity Disambiguation with BERT. Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto
Does it Really Generalize Well on Unseen Data? Systematic Evaluation of Relational Triple Extraction Methods. Juhyuk Lee, Min-Joong Lee, June Yong Yang, Eunho Yang
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning. Hongyi Yuan, Zheng Yuan, Sheng Yu
RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Extraction. Yuan Liang, Zhuoxuan Jiang, di yin, Bo Ren
[Findings] Improving Few-Shot Relation Classification by Prototypical Representation Learning with Definition Text. Li Zhenzhen, Yuyang Zhang, Jian-Yun Nie, Dongsheng Li
[Findings] Dependency Position Encoding for Relation Extraction. Qiushi Guo, Xin Wang, Dehong Gao
[Findings] XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction. Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu
[Findings] A Label-Aware Autoregressive Framework for Cross-Domain NER. Jinpeng Hu, He Zhao, Dan dan Guo, Xiang Wan, Tsung-Hui Chang
[Findings] Learning Discriminative Representations for Open Relation Extraction with Instance Ranking and Label Calibration. Shusen Wang, Bin Duan, Yanan Wu, Yajing Xu
[Findings] RCL: Relation Contrastive Learning for Zero-Shot Relation Extraction. Shusen Wang, Bosen Zhang, Yajing Xu, Yanan Wu, Bo Xiao
[Findings] Zero-Shot Event Detection Based on Ordered Contrastive Learning and Prompt-Based Prediction. Senhui Zhang, Tao Ji, Wendi Ji, Xiaoling Wang
Information Retrieval and Text Mining
SKILL: Structured Knowledge Infusion for Large Language Models. Fedor Moiseev, Zhe Dong, Enrique Alfonseca, Martin Jaggi
Collective Relevance Labeling for Passage Retrieval. Jihyuk Kim, Minsoo Kim, seung-won hwang
[Findings] Domain-matched Pre-training Tasks for Dense Retrieval. Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Scott Yih, Sonal Gupta, Yashar Mehdad
[Findings] CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering. Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong
[Findings] Weakly Supervised Text Classification using Supervision Signals from a Language Model. Ziqian Zeng, Weimin Ni, Tianqing Fang, Xiang Li, Xinran Zhao, Yangqiu Song
Interpretability and Analysis of Models for NLP
[SRW] Probe-Less Probing of BERT's Layer-Wise Linguistic Knowledge with Masked Word Prediction. Tatsuya Aoyama, Nathan Schneider
Same Neurons, Different Languages: Probing Morphosyntax in Multilingual Pre-trained Models. Karolina Stanczak, Edoardo Ponti, Lucas Torroba Hennigen, Ryan Cotterell, Isabelle Augenstein
How Conservative are Language Models? Adapting to the Introduction of Gender-Neutral Pronouns. Stephanie Brandl, Ruixiang Cui, Anders Søgaard
Residue-Based Natural Language Adversarial Attack Detection. Vyas Raina, Mark Gales
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens. Itay Itzhak, Omer Levy
[Findings] Phrase-level Textual Adversarial Attack with Label Preservation. Yibin Lei, Yu Cao, Dianqi Li, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy
Linguistic Theories, Cognitive Modeling and Psycholinguistics
Abstraction not Memory: BERT and the English Article System. Harish Tayyar Madabushi, Dagmar Divjak, Petar Milin
Machine Learning for NLP: Classification and Structured Prediction Models
Inducing and Using Alignments for Transition-based AMR Parsing. Andrew Drozdov, Jiawei Zhou, Radu Florian, Andrew McCallum, Tahira Naseem, Yoon Kim, Ramon Fernandez Astudillo
Label Anchored Contrastive Learning for Language Understanding. Zhenyu Zhang, Yuming Zhao, Meng Chen, Xiaodong He
Improving Constituent Representation with Hypertree Neural Networks. Hao Zhou, Gongshen Liu, Kewei Tu
Generic and Trend-aware Curriculum Learning for Relation Extraction. Nidhi Vakil, Hadi Amiri
On the Effectiveness of Sentence Encoding for Intent Detection Meta-Learning. Tingting Ma, Qianhui Wu, Zhiwei Yu, Tiejun Zhao, Chin-Yew Lin
A Data Cartography based MixUp for Pre-trained Language Models. Seo Yeon Park, Cornelia Caragea
Embedding Hallucination for Few-shot Language Fine-tuning. Yiren Jian, Chongyang Gao, Soroush Vosoughi
Contrastive Learning for Prompt-based Few-shot Language Learners. Yiren Jian, Chongyang Gao, Soroush Vosoughi
Consistency Training with Virtual Adversarial Discrete Perturbation. Jungsoo Park, Gyuwan Kim, Jaewoo Kang
Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference. Emīls Kadiķis, Vaibhav Srivastav, Roman Klinger
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
[SRW] Regularized Training of Nearest Neighbor Language Models. Jean-Francois Ton, Walter Talbott, Shuangfei Zhai, Joshua M. Susskind
On Curriculum Learning for Commonsense Reasoning. Adyasha Maharana, Mohit Bansal
Efficient Hierarchical Domain Adaptation for Pretrained Language Models. Alexandra Chronopoulou, Matthew E Peters, Jesse Dodge
Improving In-Context Few-Shot Learning via Self-Supervised Training. Mingda Chen, Jingfei Du, Ramakanth Pasunuru, Todor Mihaylov, Srini Iyer, Veselin Stoyanov, Zornitsa Kozareva
Learning to Generate Examples for Semantic Processing Tasks. Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili
On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model. Seongjin Shin, Sang-Woo Lee, Hwijeen Ahn, Sungdong Kim, HyoungSeok Kim, Boseop Kim, Kyunghyun Cho, Gichang Lee, Woomyoung Park, Jung-Woo Ha, Nako Sung
[Findings] A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis. Ehsan Hosseini-Asl, Wenhao Liu, Caiming Xiong
[Findings] RGL: A Simple yet Effective Relation Graph Augmented Prompt-based Tuning Approach for Few-Shot Learning. Yaqing Wang, Xin Tian, Haoyi Xiong, Yueyang Li, Zeyu Chen, Sheng Guo, Dejing Dou
[Findings] Speeding Up Entmax. Maxat Tezekbayev, Vassilina Nikoulina, Matthias Gallé, Zhenisbek Assylbekov
[Findings] MixQG: Neural Question Generation with Mixed Answer Types. Lidiya Murakhovs'ka, Chien-Sheng Wu, Philippe Laban, Tong Niu, Wenhao Liu, Caiming Xiong
[Findings] SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising. Kuan Xu, Yongbo Wang, Yongliang Wang, Zihao Wang, Zujie Wen, Yang Dong
Machine Translation
[SRW] Analysing the Correlation between Lexical Ambiguity and Translation Quality in a Multimodal Setting using WordNet. Ali Hatami, Paul Buitelaar, Mihael Arcan
Language Model Augmented Monotonic Attention for Simultaneous Translation. Sathish Reddy Indurthi, Mohd Abbas Zaidi, Beomseok Lee, Nikhil Kumar Lakumarapu, Sangha Kim
On Synthetic Data for Back Translation. Jiahao Xu, Yubin Ruan, Wei Bi, Guoping Huang, Shuming Shi, Lihui Chen, Lemao Liu
Non-Autoregressive Neural Machine Translation with Consistency Regularization Optimized Variational Framework. Minghao Zhu, Junli Wang, Chungang Yan
Cheat Codes to Quantify Missing Source Information in Neural Machine Translation. Proyag Pal, Kenneth Heafield
Training Mixed-Domain Translation Models via Federated Learning. Peyman Passban, Tanya Roosta, Rahul Gupta, Ankit Chadha, Clement Chung
Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation. Pengzhi Gao, Zhongjun He, Hua Wu, Haifeng Wang
[Findings] Latent Group Dropout for Multilingual and Multidomain Machine Translation. Minh-Quang PHAM, François Yvon, Josep Crego
Break9:00 – 9:15Regency A & B
Plenary Panel: The Place of Linguistics and Symbolic Structures9:15 – 10:15Columbia C/D (Overflow: Columbia A & 302 Beckler)
Break10:15 – 10:45Regency A & B
Oral Session 4 + In-person Poster Session 3 + SRW Panel Discussion for Starting Researchers
10:45 – 12:15
Choose AllRemove All
Measure and Improve Robustness in NLP Models: A Survey. Xuezhi Wang, Haohan Wang, Diyi Yang
Using Paraphrases to Study Properties of Contextual Embeddings. Laura Burdick, Jonathan K Kummerfeld, Rada Mihalcea
Can Rationalization Improve Robustness?. Howard Chen, Jacqueline He, Karthik R Narasimhan, Danqi Chen
Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection. Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko
What do tokens know about their characters and how do they know it?. Ayush Kaushal, Kyle Mahowald
Do Prompt-Based Models Really Understand the Meaning of Their Prompts?. Albert Webson, Ellie Pavlick
10:45 – 12:15
Choose AllRemove All
NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias. Nayeon Lee, Yejin Bang, Tiezheng YU, Andrea Madotto, Pascale Fung
Joint Learning-based Heterogeneous Graph Attention Network for Timeline Summarization. Jingyi You, Dongyuan Li, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
Interactive Query-Assisted Summarization via Deep Reinforcement Learning. Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Ido Dagan, Yael Amsterdamer
FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization. David Wan, Mohit Bansal
Proposition-Level Clustering for Multi-Document Summarization. Ori Ernst, Avi Caciularu, Ori Shapira, Ramakanth Pasunuru, Mohit Bansal, Jacob Goldberger, Ido Dagan
[TACL] A Multi-Level Optimization Framework for End-to-End Text Augmentation. Sai Ashish Somayajula, Linfeng Song, Pengtao Xie
10:45 – 12:15
Choose AllRemove All
CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data. Rui Feng, Chen Luo, Qingyu Yin, Bing Yin, Tuo Zhao, Chao Zhang
GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval. Kexin Wang, Nandan Thakur, Nils Reimers, Iryna Gurevych
Boosted Dense Retriever. Patrick Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Scott Yih, Sebastian Riedel
A Dataset for N-ary Relation Extraction of Drug Combinations. Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction. Keshav Santhanam, Omar Khattab, Jon Saad-Falcon, Christopher Potts, Matei Zaharia
Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity. Sheshera Mysore, Arman Cohan, Tom Hope
10:45 – 12:15
Choose AllRemove All
KAT: A Knowledge Augmented Transformer for Vision-and-Language. Liangke Gui, Borui Wang, Qiuyuan Huang, Alexander G Hauptmann, Yonatan Bisk, Jianfeng Gao
Do Trajectories Encode Verb Meaning?. Dylan Ebert, Chen Sun, Ellie Pavlick
Diagnosing Vision-and-Language Navigation: What Really Matters. Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel Eckstein, William Yang Wang
Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer. Yanpeng Zhao, Jack Hessel, Youngjae Yu, Ximing Lu, Rowan Zellers, Yejin Choi
Guiding Visual Question Generation. Nihir Vedd, Zixu Wang, Marek Rei, yishu miao, Lucia Specia
Interactive Symbol Grounding with Complex Referential Expressions. Rimvydas Rubavicius, Alex Lascarides
10:45 – 12:15
Choose AllRemove All
Learning to Express in Knowledge-Grounded Conversation. Xueliang Zhao, Tingchen Fu, Chongyang Tao, Wei Wu, Dongyan Zhao, Rui Yan
Partner Personas Generation for Dialogue Response Generation. Hongyuan Lu, Wai Lam, Hong Cheng, Helen M. Meng
Robust Conversational Agents against Imperceptible Toxicity Triggers. Ninareh Mehrabi, Ahmad Beirami, Fred Morstatter, Aram Galstyan
VGNMN: Video-grounded Neural Module Networks for Video-Grounded Dialogue Systems. Hung Le, Nancy F. Chen, Steven HOI
Multimodal Dialogue State Tracking. Hung Le, Nancy F. Chen, Steven HOI
Commonsense and Named Entity Aware Knowledge Grounded Dialogue Generation. Deeksha varshney, Akshara Prabhakar, Asif Ekbal
10:45 – 12:15 Regency A & B
Language Resources and Evaluation
SkillSpan: Hard and Soft Skill Extraction from English Job Postings. Mike Zhang, Kristian Nørgaard Jensen, Sif Dam Sonniks, Barbara Plank
Transparent Human Evaluation for Image Captioning. Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith
TVShowGuess: Character Comprehension in Stories as Speaker Guessing. Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton
CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course. Changyoon Lee, Yeon Seonwoo, Alice Oh
Semantic Diversity in Dialogue with Natural Language Inference. Katherine Stasaski, Marti Hearst
Extending Multi-Text Sentence Fusion Resources via Pyramid Annotations. Daniela Brook Weiss, Paul Roit, Ori Ernst, Ido Dagan
ChapterBreak: A Challenge Dataset for Long-Range Language Models. Simeng Sun, Katherine Thai, Mohit Iyyer
The USMLE® Step 2 Clinical Skills Patient Note Corpus. Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber
[TACL] Czech Grammar Error Correction with a Large and Diverse Corpus. Jakub Náplava, Milan Straka, Jana Straková, Alexandr Rosen
Linguistic Theories, Cognitive Modeling and Psycholinguistics
A Computational Acquisition Model for Multimodal Word Categorization. Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann
[TACL] He Thinks He Knows Better than the Doctors: BERT for Event Factuality Fails on Pragmatics. Nanjiang Jiang, Marie-Catherine de Marneffe
Question Answering
ProQA: Structural Prompt-based Pre-training for Unified Question Answering. Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
DREAM: Improving Situational QA by First Elaborating the Situation. Yuling Gu, Bhavana Dalvi, Peter Clark
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning. Yu Wang, Vijay Srinivasan, Hongxia Jin
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions. Elior Sulem, Jamaal Hay, Dan Roth
Long Context Question Answering via Supervised Contrastive Learning. Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan
[TACL] Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study. Xiangyang Mou, Chenghao Yang, Mo Yu, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su
Semantics: Sentence-level Semantics and Textual Inference
DocAMR: Multi-Sentence AMR Representation and Evaluation. Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer. Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy
Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks. Anton Chernyavskiy, Dmitry Ilvovsky, Pavel Kalinin, Preslav Nakov
Label Definitions Improve Semantic Role Labeling. Li Zhang, Ishan Jindal, Yunyao Li
Few-Shot Semantic Parsing with Language Models Trained on Code. Richard Shin, Benjamin Van Durme
Partial-input baselines show that NLI models can ignore context, but they don't.. Neha Srikanth, Rachel Rudinger
Improving negation detection with negation-focused pre-training. Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti
SUBS: Subtree Substitution for Compositional Semantic Parsing. Jingfeng Yang, Le Zhang, Diyi Yang
Sentiment Analysis and Stylistic Analysis
Multi-Domain Targeted Sentiment Analysis. Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim
UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis. Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis
Data Augmentation with Dual Training for Offensive Span Detection. Nasim Nouri
Analyzing Modality Robustness in Multimodal Sentiment Analysis. Devamanyu Hazarika, Yingting Li, Bo Cheng, Shuai Zhao, Roger Zimmermann, Soujanya Poria
Speech
Quantifying Language Variation Acoustically with Few Resources. Martijn Bartelds, Martijn Wieling
Lunch12:15 – 14:15
Business Meeting12:15 – 14:15Columbia A
Oral Session 5 + Industry Oral Session 2 + SRW In-person Poster Session
14:15 – 15:45
Choose AllRemove All
What Factors Should Paper-Reviewer Assignments Rely On? Community Perspectives on Issues and Ideals in Conference Peer-Review. Terne Sasha Thorn Jakobsen, Anna Rogers
Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models. Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou
Benchmarking Intersectional Biases in NLP. John P. Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi
Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection. Maarten Sap, Swabha Swayamdipta, Laura Vianna, Xuhui Zhou, Yejin Choi, Noah Smith
Features or Spurious Artifacts? Data-centric Baselines for Fair and Robust Hate Speech Detection. Alan Ramponi, Sara Tonelli
Gender Bias in Masked Language Models for Multiple Languages. Masahiro Kaneko, Aizhan Imankulova, Danushka Bollegala, Naoaki Okazaki
14:15 – 15:45
Choose AllRemove All
Putting the Con in Context: Identifying Deceptive Actors in the Game of Mafia. Samee Omotayo Ibraheem, Gaoyue Zhou, John DeNero
CoMPM: Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation. Joosung Lee, Wooin Lee
COGMEN: COntextualized GNN based Multimodal Emotion recognitioN. Abhinav Joshi, Ashwani Bhat, Ayush Jain, Atin Vikram Singh, Ashutosh Modi
Domain Confused Contrastive Learning for Unsupervised Domain Adaptation. Quanyu Long, Tianze Luo, Wenya Wang, Sinno Pan
Text Style Transfer via Optimal Transport. Nasim Nouri
SSEGCN: Syntactic and Semantic Enhanced Graph Convolutional Network for Aspect-based Sentiment Analysis. Zheng Zhang, Zili Zhou, Yanna Wang
14:15 – 15:45
Choose AllRemove All
DEGREE: A Data-Efficient Generation-Based Event Extraction Model. I-Hung Hsu, Kuan-Hao Huang, Elizabeth Boschee, Scott Miller, Prem Natarajan, Kai-Wei Chang, Nanyun Peng
[TACL] Text-based NP Enrichment. Yanai Elazar, Victoria Basmov, Yoav Goldberg
Few-Shot Document-Level Relation Extraction. Nicholas Popovic, Michael Färber
Joint Extraction of Entities, Relations, and Events via Modeling Inter-Instance and Inter-Label Dependencies. Minh Van Nguyen, Bonan Min, Franck Dernoncourt, Thien Huu Nguyen
Hyperbolic Relevance Matching for Neural Keyphrase Extraction. Mingyang Song, Yi Feng, Liping Jing
A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction. Runxin Xu, Peiyi Wang, Tianyu Liu, Shuang Zeng, Baobao Chang, Zhifang Sui
14:15 – 15:45
Choose AllRemove All
Mapping the Design Space of Human-AI Interaction in Text Summarization. Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes
Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications. Kaitlyn Zhou, Su Lin Blodgett, Adam Trischler, Hal Daumé III, Kaheer Suleman, Alexandra Olteanu
User-Centric Gender Rewriting. Bashar Alhafni, Nizar Habash, Houda Bouamor
Explaining Why: How Instructions and User Interfaces Impact Annotator Rationales When Labeling Text Data. Jamar L. Sullivan Jr., Will Brackenbury, Andrew McNutt, Kevin Bryson, Kwam Byll, Yuxin Chen, Michael Littman, Chenhao Tan, Blase Ur
An Exploration of Post-Editing Effectiveness in Text Summarization. Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel R. Tetreault, Alejandro Jaimes
The Why and The How: A Survey on Natural Language Interaction in Visualization. Henrik Voigt, Ozge Alacam, Monique Meuschke, Kai Lawonn, Sina Zarrieß
14:15 – 15:45
Choose AllRemove All
Cryptocurrency Bubble Detection: A New Stock Market Dataset, Financial Task & Hyperbolic Models. Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava
Many Hands Make Light Work: Using Essay Traits to Automatically Score Essays. Rahul Kumar, Sandeep Mathias, Sriparna Saha, Pushpak Bhattacharyya
Aligning to Social Norms and Values in Interactive Narratives. Prithviraj Ammanabrolu, Liwei Jiang, Maarten Sap, Hannaneh Hajishirzi, Yejin Choi
LITE: Intent-based Task Representation Learning Using Weak Supervision. Naoki Otani, Michael Gamon, Sujay Kumar Jauhar, Mei Yang, Sri Raghu Malireddi, Oriana Riva
Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts. Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert
Context-Aware Abbreviation Expansion Using Large Language Models. Shanqing Cai, Subhashini Venugopalan, Katrin Tomanek, Ajit Narayanan, Meredith Ringel Morris, Michael Brenner
14:15 – 15:45
Choose AllRemove All
: Sandesh Swamy
Self-supervised Product Title Rewrite for Product Listing Ads. Xue Zhao, Dayiheng Liu, Junwei Ding, Liang Yao, Mahone Yan, Huibo wang, Wenqing Yao
Local-to-global learning for iterative training of production SLU models on new features. Yulia Grishina, Daniil Sorokin
Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning. Angelo Ziletti, Alan Akbik, Christoph Berns, Thomas Herold, Marion Legler, Martina Viell
CTM - A Model for Large-Scale Multi-View Tweet Topic Classification. Vivek Kulkarni, Kenny Leung, Aria Haghighi
Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI. Pragaash Ponnusamy, Clint Solomon Mathialagan, Gustavo Aguilar, Chengyuan Ma, Chenlei Guo
Aspect-based Analysis of Advertising Appeals for Search Engine Advertising. Soichiro Murakami, Peinan Zhang, Sho Hoshino, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura
14:15 – 15:45 Regency A & B
[SRW] Systematicity Emerges in Transformers when Abstract Grammatical Roles Guide Attention. Ayush K Chakravarthy, Jacob Labe Russin, Randall O'Reilly
[SRW] Grounding in social media: An approach to building a chit-chat dialogue model. Ritvik Choudhary, Daisuke Kawahara
[SRW] ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization. Mengsay Loem, Sho Takase, Masahiro Kaneko, Naoaki Okazaki
[SRW] Neural Retriever and Go Beyond: A Thesis Proposal. Man Luo
[SRW] Improving Classification of Infrequent Cognitive Distortions: Domain-Specific Model vs. Data Augmentation. Xiruo Ding, Kevin Lybarger, Justin Tauscher, Trevor Cohen
[SRW] Towards Gender Biased Language Classification: A Case Study with British English Archival Metadata Descriptions. Lucy Havens
[SRW] What "Drives" the Use of Metaphorical Language? Negative Insights from Abstractness, Affect, Discourse Coherence and Contextualized Word Representations. Prisca Piccirilli, Sabine Schulte im Walde
[SRW] Generate, Evaluate, and Select: A Dialogue System with a Response Evaluator for Diversity-Aware Response Generation. Ryoma Sakaeda, Daisuke Kawahara
[SRW] Building a Personalized Dialogue System with Prompt-Tuning. Tomohito Kasahara, Daisuke Kawahara, Nguyen Tung, Shengzhe Li, Kenta Shinzato, Toshinori Sato
[SRW] MM-GATBT: Enriching Multimodal Representation Using Graph Attention Network. Seung Byum Seo, Hyoungwook Nam, Payam Delgosha
[SRW] ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation. Long Phan, Hieu Tran, Hieu Nguyen, Trieu H. Trinh
[SRW] Compositional Generalization in Grounded Language Learning via Induced Model Sparsity. Sam Spilsbury, Alexander Ilin
[SRW] How do people talk about images? A study on open-domain conversations with images.. Yi-Pei Chen, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama
[SRW] Preschool Children Speech Recognition for Early Childhood Intervention: Motivation and Challenges. Satwik Dutta, Dwight W. Irvin, John H. L. Hansen
[SRW] A Simple Approach to Jointly Rank Passages and Select Relevant Sentences in the OBQA Context. Man Luo, Shuguang Chen, Chitta Baral
[SRW] Multimodal Modeling of Task-Mediated Confusion. Camille Mince, Skye Rhomberg, Cecilia Alm, Reynold Bailey, Alex Ororbia
[SRW] Machine Narrative Comprehension in Fictional Characters Personality Prediction Task. Yisi Sang, Xiangyang Mou, Mo Yu, Dakuo Wang, Jing Li, Jeffrey Stanton
[SRW] Divide & Conquer for Entailment-aware Multi-hop Evidence Retrieval. Fan Luo, Mihai Surdeanu
[SRW] Multimodal large language models for inclusive collaboration learning tasks. Armanda Lewis
[SRW] Neural Networks in a Product of Hyperbolic Spaces. Jun Takeuchi, Noriki Nishida, Hideki Nakayama
[SRW] Strong Heuristics for Named Entity Linking. Marko Čuljak, Andreas Spitz, Robert West, Akhil Arora
[SRW] Unifying Parsing and Tree-Structured Models for Generating Sentence Semantic Representations. Antoine Simoulin, Benoit Crabbé
[SRW] Defending Compositionality in Emergent Languages. Michal Auersperger, Pavel Pecina
[SRW] Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition. Aditya Yadavalli, Ganesh Sai Mirishkar, Anil Vuppala
Break15:45 – 16:15Regency A & B
Oral Session 6 + SRW Thesis Proposals Session + Industry/Demo In-person Poster Session + Virtual Poster Q&A Session 2
16:15 – 17:45
Choose AllRemove All
All You May Need for VQA are Image Captions. Soravit Changpinyo, Doron Kukliansy, Idan Szpektor, Xi Chen, Nan Ding, Radu Soricut
Imagination-Augmented Natural Language Understanding. Yujie Lu, Wanrong Zhu, Xin Eric Wang, Miguel Eckstein, William Yang Wang
Visual Commonsense in Pretrained Unimodal and Multimodal Models. Chenyu Zhang, Benjamin Van Durme, Zhuowan Li, Elias Stengel-Eskin
Few-shot Subgoal Planning with Language Models. Lajanugen Logeswaran, Yao Fu, Moontae Lee, Honglak Lee
Disentangling Categorization in Multi-agent Emergent Communication. Washington Garcia, Hamilton Scott Clouse, Kevin R. B. Butler
CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination. Hyounghun Kim, Abhay Zala, Mohit Bansal
16:15 – 17:45
Choose AllRemove All
[CL] Tractable Parsing for CCGs of Bounded Degree. Lena Katharina Schiffer, Marco Kuhlmann, Giorgio Satta
Template-free Prompt Tuning for Few-shot NER. Ruotian Ma, Xin Zhou, Tao Gui, Yiding Tan, Linyang Li, Qi Zhang, Xuanjing Huang
Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition. Besnik Fetahu, Anjie Fang, Oleg Rokhlenko, Shervin Malmasi
Unsupervised Cross-Lingual Transfer of Structured Predictors without Source Data. Kemal Kurniawan, Lea Frermann, Philip Schulz, Trevor Cohn
Masked Part-Of-Speech Model: Does Modeling Long Context Help Unsupervised POS-tagging?. Xiang Zhou, Shiyue Zhang, Mohit Bansal
Dynamic Programming in Rank Space: Scaling Structured Inference with Low-Rank HMMs and PCFGs. Songlin Yang, Wei Liu, Kewei Tu
16:15 – 17:45
Choose AllRemove All
 A Balanced Data Approach for Evaluating Cross-Lingual Transfer: Mapping the Linguistic Blood Bank. Dan Malkin, Tomasz Limisiewicz, Gabriel Stanovsky
When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer. Ameet Deshpande, Partha Talukdar, Karthik R Narasimhan
Lifting the Curse of Multilinguality by Pre-training Modular Transformers. Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe
Combating the Curse of Multilinguality in Cross-Lingual WSD by Aligning Sparse Contextualized Word Representations. Gábor Berend
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat
Bridging the Gap between Language Models and Cross-Lingual Sequence Labeling. Nuo Chen, Linjun Shou, MING GONG, Jian Pei, Daxin Jiang
16:15 – 17:45
Choose AllRemove All
DEMix Layers: Disentangling Domains for Modular Language Modeling. Suchin Gururangan, Mike Lewis, Ari Holtzman, Noah Smith, Luke Zettlemoyer
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers. Vivek Kumar, Rishabh Maheshwary, Vikram Pudi
Quantifying Adaptability in Pre-trained Language Models with 500 Tasks. Belinda Z. Li, Jane A. Yu, Madian Khabsa, Luke Zettlemoyer, Alon Y. Halevy, Jacob Andreas
KALA: Knowledge-Augmented Language Model Adaptation. Minki Kang, Jinheon Baek, Sung Ju Hwang
Extreme Zero-Shot Learning for Extreme Text Classification. Yuanhao Xiong, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Inderjit S Dhillon
TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding. Le Zhang, Zichao Yang, Diyi Yang
16:15 – 17:45
Choose AllRemove All
QuALITY: Question Answering with Long Input Texts, Yes!. Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny L Ma, Jana Thompson, He He, Samuel R. Bowman
On the Robustness of Reading Comprehension Models to Entity Renaming. Jun Yan, Yang Xiao, Sagnik Mukherjee, Bill Yuchen Lin, Robin Jia, Xiang Ren
OmniTab: Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering. Zhengbao Jiang, Yi Mao, Pengcheng He, Graham Neubig, Weizhu Chen
Modeling Exemplification in Long-form Question Answering via Retrieval. Shufan Wang, Fangyuan Xu, Laure Thompson, Eunsol Choi, Mohit Iyyer
JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering. Yueqing Sun, Qi Shi, Le Qi, Yu Zhang
Clues Before Answers: Generation-Enhanced Multiple-Choice QA. Zixian Huang, Ao Wu, Jiaying Zhou, Yu Gu, Yue Zhao, Gong Cheng
16:15 – 17:45
Choose AllRemove All
[SRW] Methods for Estimating and Improving Robustness of Language Models. Michal Stefanik
[SRW] Retrieval-augmented Generation across Heterogeneous Knowledge. Wenhao Yu
[SRW] Neural Retriever and Go Beyond: A Thesis Proposal. Man Luo
[SRW] Towards Gender Biased Language Classification: A Case Study with British English Archival Metadata Descriptions. Lucy Havens
[SRW] Multimodal large language models for inclusive collaboration learning tasks. Armanda Lewis
16:15 – 17:45 Regency A & B
Industry Track Posters
Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems. Mohammad Kachuee, Jinseok Nam, Sarthak Ahuja, Jin-Myung Won, SUNGJIN LEE
AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy. Raphael Petegrosso, VasistaKrishna Baderdinnni, Thibaud Senechal, Benjamin Bullough
Temporal Generalization for Spoken Language Understanding. Judith Gaspers, Anoop Kumar, Greg Ver Steeg, Aram Galstyan
An End-to-End Dialogue Summarization System for Sales Calls. Abedelkadir Asi, Song Wang, Roy Eisenstadt, Dean Geckt, Yarin Kuper, Yi Mao, Royi Ronen
Controlled Data Generation via Insertion Operations for NLU. Manoj Kumar, Yuval Merhav, Haidar Khan, Rahul Gupta, Anna Rumshisky, Wael Hamza
Easy and Efficient Transformer: Scalable Inference Solution For Large NLP Model. Li GongZheng LGZ, Yadong Xi, Jingzhen Ding, Duan Wang, Ziyang Luo, Rongsheng Zhang, Bai Liu, Changjie Fan, Xiaoxi Mao, Zeng Zhao
Efficient Semi-supervised Consistency Training for Natural Language Understanding. George Leung, Joshua Tan
Distantly Supervised Aspect Clustering And Naming For E-Commerce Reviews. Prateek Sircar, Aniket Chakrabarti, DEEPAK GUPTA, Anirban Majumder
CULG: Commercial Universal Language Generation. Haonan Li, yameng huang, Yeyun Gong, Jian Jiao, Ruofei Zhang, Timothy Baldwin, Nan Duan
Constraining word alignments with posterior regularization for label transfer. Thomas Gueudre, Kevin Martin Jose
Explaining the Effectiveness of Multi-Task Learning for Efficient Knowledge Extraction from Spine MRI Reports. Arijit Sehanobish, McCullen Sandora, Nabila Abraham, Jayashri Pawar, Danielle Torres, Anasuya Das, Murray Becker, Richard Herzog, Benjamin Odry, Ron Vianu
Asynchronous Convergence in Multi-Task Learning via Knowledge Distillation from Converged Tasks. Weiyi Lu, Sunny Rajagopalan, Priyanka Nigam, Jaspreet Singh, Xiaodi Sun, Yi Xu, Belinda Zeng, Trishul Chilimbi
Augmenting Training Data for Massive Semantic Matching Models in Low-Traffic E-commerce Stores. Ashutosh Joshi, Shankar Vishwanath, Choon Hui Teo, Vaclav Petricek, Vishy Vishwanathan, Rahul Bhagat, Jonathan May
Retrieval Based Response Letter Generation For a Customer Care Setting. Biplob Biswas, Renhao Cui, Rajiv Ramnath
Knowledge extraction from aeronautical messages (NOTAMs) with self-supervised language models for aircraft pilots. Alexandre Arnold, Fares Ernez, Catherine Kobus, Marion-Cécile Martin
Intent Discovery for Enterprise Virtual Assistants: Applications of Utterance Embedding and Clustering to Intent Mining. Minhua Chen, Badrinath Jayakumar, Michael Johnston, S. Eman Mahmoodi, Daniel Pressel
Lightweight Transformers for Conversational AI. Daniel Pressel, Wenshuo Liu, Michael Johnston, Minhua Chen
NER-MQMRC: Formulating Named Entity Recognition as Multi Question Machine Reading Comprehension. Anubhav Shrimal, Avi Jain, Kartik Mehta, Promod Yenigalla
What Do Users Care About? Detecting Actionable Insights from User Feedback. Kasturi Bhattacharjee, Rashmi Gangadharaiah, Kathleen McKeown, Dan Roth
Developing a Production System for Purpose of Call Detection in Business Phone Conversations. Elena Khasanova, Pooja Hiranandani, Shayna Gardiner, Cheng Chen, Simon Corston-Oliver, Xue-Yong Fu
Adversarial Text Normalization. Joanna Bitton, Maya Pavlova, Ivan Evtimov
Constraint-based Multi-hop Question Answering with Knowledge Graph. Sayantan Mitra, Roshni Ramnani, Shubhashis Sengupta
Fast Bilingual Grapheme-To-Phoneme Conversion. Hwa-Yeon Kim, Jong-Hwan Kim, Jae-Min Kim
Knowledge Extraction From Texts Based on Wikidata. Anastasia Shimorina, Johannes Heinecke, Frédéric Herledan
AIT-QA: Question Answering Dataset over Complex Tables in the Airline Industry. Yannis Katsis, Saneem Ahmed Chemmengath, vishwajeet kumar, Samarth Bharadwaj, MUSTAFA CANIM, Michael Glass, Alfio Gliozzo, Feifei Pan, Jaydeep Sen, Karthik Sankaranarayanan, Soumen Chakrabarti
Parameter-efficient Continual Learning Framework in Industrial Real-time Text Classification System. Tao Zhu, Zhe Zhao, Weijie Liu, Jiachi Liu, Yiren Chen, Weiquan Mao, Haoyan Liu, Kunbo Ding, Yudong Li, Xuefeng Yang, Kimmo Yan
Fast and Light-Weight Answer Text Retrieval in Dialogue Systems. Hui Wan, Siva Sankalp Patel, J William Murdock, Saloni Potdar, Sachindra Joshi
BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations. Md Tahmid Rahman Laskar, Cheng Chen, Aliaksandr Martsinovich, Jonathan Johnston, Xue-Yong Fu, Shashi Bhushan Tn, Simon Corston-Oliver
Q2R: A Query-to-Resolution System for Natural-Language Queries. Shiau Hong Lim, Laura Wynter
Identifying Corporate Credit Risk Sentiments from Financial News. Noujoud Ahbali, Xinyuan Liu, Albert Aristotle Nanda, Jamie Stark, Ashit Talukder, Rupinder Paul Khandpur
Demo Track Posters
textless-lib: a Library for Textless Spoken Language Processing. Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossef Mordechay Adi
Web-based Annotation Interface for Derivational Morphology. Lukáš Kyjánek
TurkishDelightNLP: A Neural Turkish NLP Toolkit. Huseyin Alecakir, Necva Bölücü, Burcu Can
ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations. Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, Bonan Min
Flowstorm: Open-Source Platform with Hybrid Dialogue Architecture. Jan Pichl, Petr Marek, Jakub Konrád, Petr Lorenc, Ondrej Kobza, Tomáš Zajíček, Jan Šedivý
Contrastive Explanations of Text Classifiers as a Service. Lorenzo Malandri, Fabio Mercorio, Mario Mezzanzanica, Navid Nobani, Andrea Seveso
RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios. Xinya Du, Zixuan Zhang, Sha Li, Pengfei Yu, Hongwei Wang, Tuan Lai, Xudong Lin, Ziqi Wang, Iris Liu, Ben Zhou, Haoyang Wen, Manling Li, Darryl Hannan, Jie Lei, Hyounghun Kim, Rotem Dror, Haoyu Wang, Michael Regan, Qi Zeng, Qing Lyu, Charles Yu, Carl Edwards, Xiaomeng Jin, Yizhu Jiao, Ghazaleh Kazeminejad, Zhenhailong Wang, Chris Callison-Burch, Mohit Bansal, Carl Vondrick, Jiawei Han, Dan Roth, Shih-Fu Chang, Martha Palmer, Heng Ji
A Human-machine Interface for Few-shot Rule Synthesis for Information Extraction. Robert Vacareanu, George C. G. Barbosa, Enrique Noriega-Atala, Gus Hahn-Powell, Rebecca Sharp, Marco Antonio Valenzuela-Escárcega, Mihai Surdeanu
SETSum: Summarization and Visualization of Student Evaluations of Teaching. Yinuo Hu, Shiyue Zhang, Viji Sathy, Abigail Panter, Mohit Bansal
Towards Open-Domain Topic Classification. Hantian Ding, Jinrui Yang, Yuqian Deng, Hongming Zhang, Dan Roth
SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features. Greta Tuckute, Aalok Sathe, Mingye Wang, Harley Yoder, Cory Shain, Evelina Fedorenko
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit. Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang
DadmaTools: Natural Language Processing Toolkit for Persian Language. Romina Etezadi, Mohammad Karrabi, Najmeh Zare, Mohamad Bagher Sajadi, Mohammad Taher Pilehvar
FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction. Minh Van Nguyen, Nghia Trung Ngo, Bonan Min, Thien Huu Nguyen
16:15 – 17:45
Computational Social Science and Cultural Analytics
[Findings] Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning. Hongzhan Lin, Jing Ma, Liangliang Chen, Zhiwei Yang, Mingfei Cheng, Guang Chen
Dialogue and Interactive Systems
Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation. Yu Li, Baolin Peng, yelong shen, Yi Mao, Lars Liden, Zhou Yu, Jianfeng Gao
Learning Dialogue Representations from Consecutive Utterances. Zhihan Zhou, Dejiao Zhang, Wei Xiao, Nicholas Dingwall, Xiaofei Ma, Andrew Arnold, Bing Xiang
Emp-RFT: Empathetic Response Generation via Recognizing Feature Transitions between Utterances. Wongyu Kim, Youbin Ahn, Donghyun Kim, Kyong-Ho Lee
Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue. Raghav Gupta, Harrison Lee, Jeffrey Zhao, Yuan Cao, Abhinav Rastogi, Yonghui Wu
Disentangling Indirect Answers to Yes-No Questions in Real Conversations. Krishna Chaitanya Sanagavarapu, Jathin Pranav Singaraju, Anusha Kakileti, Anirudh Kaza, Aaron Abraham Mathews, Helen Li, Nathan Raul Brito, Eduardo Blanco
On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?. Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy
[Findings] Instilling Type Knowledge in Language Models via Multi-Task QA. Shuyang Li, Mukund Sridhar, Chandana Satya Prakash, Jin Cao, Wael Hamza, Julian McAuley
[Findings] A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning. Yang Yang Zhao, Hua Qin, Wang Zhenyu, Changxi Zhu, Shihan Wang
Efficient Methods in NLP
Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem. Ryoma Sato
Causal Distillation for Language Models. Zhengxuan Wu, Atticus Geiger, Joshua Rozner, Elisa Kreiss, Hanson Lu, Thomas Icard, Christopher Potts, Noah Goodman
[Findings] Attention Fusion: a light yet efficient late fusion mechanism for task adaptation in NLU. Jin Cao, Chandana Satya Prakash, Wael Hamza
[Findings] Towards Computationally Feasible Deep Active Learning. Akim Tsvigun, Artem Shelmanov, Gleb Kuzmin, Leonid Sanochkin, Daniil Larionov, Gleb Gennadjevich Gusev, Manvel Avetisian, Leonid Zhukov
[Findings] Pruning Adatperfusion with Lottery Ticket Hypothesis. Jiarun Wu, Qingliang Chen, Zeguan Xiao, Yuliang Gu, Mengsi Sun
Human-Centered NLP
Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring?. Zijie Zeng, XINYU LI, Dragan Gasevic, Guanliang Chen
Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs. Xu Wang, Simin Fan, Jessica Houghton, Lu Wang
Information Extraction
Sentence-Level Resampling for Named Entity Recognition. Xiaochen Wang, Yue Wang
Unified Semantic Typing with Meaningful Label Inference. James Y. Huang, Bangzheng Li, Jiashu Xu, Muhao Chen
Crossroads, Buildings and Neighborhoods: A Dataset for Fine-grained Location Recognition. Pei Chen, Haotian Xu, Cheng Zhang, Ruihong Huang
Modeling Task Interactions in Document-Level Joint Entity and Relation Extraction. Liyan Xu, Jinho D. Choi
Information Retrieval and Text Mining
Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds. Yu Zhang, Yu Meng, Xuan Wang, Sheng Wang, Jiawei Han
Improving Neural Models for Radiology Report Retrieval with Lexicon-based Automated Annotation. Luyao Shi, Tanveer Syeda-mahmood, Tyler Baldwin
Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics. Zihan Zhang, Meng Fang, Ling Chen, Mohammad Reza Namazi Rad
Interpretability and Analysis of Models for NLP
Reframing Human-AI Collaboration for Generating Free-Text Explanations. Sarah Wiegreffe, Jack Hessel, Swabha Swayamdipta, Mark Riedl, Yejin Choi
Implicit n-grams Induced by Recurrence. Xiaobing Sun, Wei Lu
Locally Aggregated Feature Attribution on Natural Language Model Understanding. Sheng Zhang, Jin Wang, Haitao Jiang, Rui Song
[Findings] White-box Testing of NLP models with Mask Neuron Coverage. Arshdeep Sekhon, Yangfeng Ji, Matthew Dwyer, Yanjun Qi
Language Generation
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts. Rujun Han, Hong Chen, Yufei Tian, Nanyun Peng
[Findings] Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency. Jin Liu, chongfeng fan, zhou Fengyu, Huijuan Xu
Language Resources and Evaluation
Semantic Diversity in Dialogue with Natural Language Inference. Katherine Stasaski, Marti Hearst
CS1QA: A Dataset for Assisting Code-based Question Answering in an Introductory Programming Course. Changyoon Lee, Yeon Seonwoo, Alice Oh
The USMLE® Step 2 Clinical Skills Patient Note Corpus. Victoria Yaneva, Janet Mee, Le An Ha, Polina Harik, Michael Jodoin, Alex J Mechaber
Transparent Human Evaluation for Image Captioning. Jungo Kasai, Keisuke Sakaguchi, Lavinia Dunagan, Jacob Daniel Morrison, Ronan Le Bras, Yejin Choi, Noah Smith
ChapterBreak: A Challenge Dataset for Long-Range Language Models. Simeng Sun, Katherine Thai, Mohit Iyyer
TVShowGuess: Character Comprehension in Stories as Speaker Guessing. Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton
Machine Translation
Building Multilingual Machine Translation Systems That Serve Arbitrary XY Translations. Akiko Eriguchi, Shufang Xie, Tao Qin, Hany Hassan
Quality-Aware Decoding for Neural Machine Translation. Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, Andre Martins
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation. Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu
Tricks for Training Sparse Translation Models. Dheeru Dua, Shruti Bhosale, Vedanuj Goswami, James Cross, Mike Lewis, Angela Fan
[Findings] When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?. Zhuoyuan Mao, Chenhui Chu, Raj Dabre, Haiyue Song, Zhen Wan, Sadao Kurohashi
NLP Applications
Cross-document Misinformation Detection based on Event Graph Reasoning. Xueqing Wu, Kung-Hsiang Huang, Yi Fung, Heng Ji
A Word is Worth A Thousand Dollars: Adversarial Attack on Tweets Fools Stock Prediction. Yong Xie, Dakuo Wang, Pin-Yu Chen, Jinjun Xiong, Sijia Liu, Oluwasanmi O Koyejo
Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption. Garam Lee, Minsoo Kim, Jai Hyun Park, seung-won hwang, Jung Hee Cheon
[Findings] Harmless Transfer Learning for Item Embeddings. Chengyue Gong, Xiaocong Du, Dhruv Choudhary, Bhargav Bhushanam, qiang liu, Arun Kejariwal
Phonology, Morphology and Word Segmentation
Grapheme-to-Phoneme Conversion for Thai using Neural Regression Models. Tomohiro Yamasaki
Semantics: Lexical Semantics
[Findings] Improving Contextual Representation with Gloss Regularized Pre-training. Yu Lin, Zhecheng An, Peihao Wu, Zejun MA
Semantics: Sentence-level Semantics and Textual Inference
SUBS: Subtree Substitution for Compositional Semantic Parsing. Jingfeng Yang, Le Zhang, Diyi Yang
CoSe-Co: Text Conditioned Generative CommonSense Contextualizer. Rachit Bansal, Milan Aggarwal, Sumit Bhatia, Jivat Neet Kaur, Balaji Krishnamurthy
MuCPAD: A Multi-Domain Chinese Predicate-Argument Dataset. Yahui Liu, Haoping Yang, Chen Gong, Qingrong Xia, Zhenghua Li, Min Zhang
MGIMN: Multi-Grained Interactive Matching Network for Few-shot Text Classification. Jianhai Zhang, Mieradilijiang Maimaiti, Gao Xing, Yuanhang Zheng, Ji Zhang
DocAMR: Multi-Sentence AMR Representation and Evaluation. Tahira Naseem, Austin Blodgett, Sadhana Kumaravel, Tim O'Gorman, Young-Suk Lee, Jeffrey Flanigan, Ramon Fernandez Astudillo, Radu Florian, Salim Roukos, Nathan Schneider
Improving negation detection with negation-focused pre-training. Thinh Hung Truong, Timothy Baldwin, Trevor Cohn, Karin Verspoor
Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge. Ian Porada, Alessandro Sordoni, Jackie CK Cheung
Partial-input baselines show that NLI models can ignore context, but they don't.. Neha Srikanth, Rachel Rudinger
[Findings] Analytical Reasoning of Text. Wanjun Zhong, Siyuan Wang, Duyu Tang, Zenan Xu, Daya Guo, Yining Chen, Jiahai Wang, Jian Yin, Ming Zhou, Nan Duan
Sentiment Analysis and Stylistic Analysis
A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction. Shu Liu, Kaiwen Li, Zuhe Li
Data Augmentation with Dual Training for Offensive Span Detection. Nasim Nouri
Multi-Domain Targeted Sentiment Analysis. Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim
UserIdentifier: Implicit User Representations for Simple and Effective Personalized Sentiment Analysis. Fatemehsadat Mireshghallah, Vaishnavi Shrivastava, Milad Shokouhi, Taylor Berg-Kirkpatrick, Robert Sim, Dimitrios Dimitriadis
Speech
[Findings] End-to-end Spoken Conversational Question Answering: Task, Dataset and Model. Chenyu You, Nuo Chen, Fenglin Liu, Shen Ge, Xian Wu, Yuexian Zou
Summarization
TSTR: Too Short to Represent, Summarize with Details! Intro-Guided Extended Summary Generation. Sajad Sotudeh, Nazli Goharian
Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness. Yun-Zhu Song, Yi-Syuan Chen, Hong-Han Shuai
SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling. Forrest Sheng Bao, Ge Luo, Hebi Li, Minghui Qiu, Yinfei Yang, Youbiao He, Cen Chen
Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries. Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Thomas Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
Social Event19:00 – 22:00Museum of Pop Culture (MoPOP)
Wednesday, July 13, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Oral Session 7 + Virtual Poster Q&A Session 3
8:00 – 9:00
Choose AllRemove All
When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it. Sebastian Schuster, Tal Linzen
Analyzing Encoded Concepts in Transformer Language Models. Hassan Sajjad, Nadir Durrani, Fahim Dalvi, Firoj Alam, Abdul Rafae Khan, Jia Xu
Probing via Prompting. Jiaoda Li, Ryan Cotterell, Mrinmaya Sachan
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers. Ali Modarressi, Mohsen Fayyaz, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar
8:00 – 9:00
Choose AllRemove All
From spoken dialogue to formal summary: An utterance rewriting for dialogue summarization. Yue Fang, Hainan Zhang, Hongshen Chen, Zhuoye Ding, Bo Long, Yanyan Lan, Yanquan Zhou
Domain-Oriented Prefix-Tuning: Towards Efficient and Generalizable Fine-tuning for Zero-Shot Dialogue Summarization. Lulu Zhao, Fujia Zheng, Weihao Zeng, Keqing He, Weiran Xu, Huixing Jiang, Wei Wu, Yanan Wu
DialSummEval: Revisiting Summarization Evaluation for Dialogues. Mingqi Gao, Xiaojun Wan
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles. Encarna Segarra, Vicent Ahuir, Lluís-F. Hurtado, José Ángel González
8:00 – 9:00
Choose AllRemove All
Robust Self-Augmentation for Named Entity Recognition with Meta Reweighting. Linzhi Wu, Pengjun Xie, Jie Zhou, Meishan Zhang, Ma Chunping, Guangwei Xu, Min Zhang
GMN: Generative Multi-modal Network for Practical Document Information Extraction. Haoyu Cao, Jiefeng Ma, Antai Guo, Yiqing Hu, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren
DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction. MeiHan Tong, Bin Xu, Shuai Wang, Meihuan Han, Yixin Cao, Jiangqi Zhu, Siyu Chen, Lei Hou, Juanzi Li
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction. Shuliang Liu, Xuming Hu, Chenwei Zhang, Shu'ang Li, Lijie Wen, Philip S. Yu
8:00 – 9:00
Choose AllRemove All
Neighbors Are Not Strangers: Improving Non-Autoregressive Translation under Low-Frequency Lexical Constraints. Chun Zeng, Jiangjie Chen, Tianyi Zhuang, Rui Xu, Hao Yang, Qin Ying, shimin tao, Yanghua Xiao
Nearest Neighbor Knowledge Distillation for Neural Machine Translation. Zhixian Yang, Renliang Sun, Xiaojun Wan
Cross-modal Contrastive Learning for Speech Translation. Rong Ye, Mingxuan Wang, Lei Li
One Reference Is Not Enough: Diverse Distillation with Reference Selection for Non-Autoregressive Translation. Chenze Shao, Xuanfu Wu, Yang Feng
8:00 – 9:00
Choose AllRemove All
Less is More: Learning to Refine Dialogue History for Personalized Dialogue Generation. Hanxun Zhong, Zhicheng Dou, Yutao Zhu, Hongjin Qian, Ji-Rong Wen
Diversifying Neural Dialogue Generation via Negative Distillation. Yiwei Li, Shaoxiong Feng, Bin Sun, Kan Li
Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition. Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, hong yu, Sachindra Joshi
Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs. Sha Li, Mahdi Namazifar, Di Jin, Mohit Bansal, Heng Ji, Yang Liu, Dilek Hakkani-Tur
8:00 – 9:00
Choose AllRemove All
LaMemo: Language Modeling with Look-Ahead Memory. Haozhe Ji, Rongsheng Zhang, Zhenyu Yang, Zhipeng Hu, Minlie Huang
[TACL] Formal Language Recognition by Hard Attention Transformers: Perspectives from Circuit Complexity. Yiding Hao, Dana Angluin, Robert Evan Frank
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models. Peter West, Chandra Bhagavatula, Jack Hessel, Jena D. Hwang, Liwei Jiang, Ronan Le Bras, Ximing Lu, Sean Welleck, Yejin Choi
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models. Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz
8:00 – 9:00
Computational Social Science and Cultural Analytics
[SRW] Again, Dozens of Refugees Drowned: A Computational Study of Political Framing Evoked by Presuppositions. Qi Yu
Political Ideology and Polarization: A Multi-dimensional Approach. Barea Sinno, Bernardo Oviedo, Katherine Atwell, Malihe Alikhani, Junyi Jessy Li
Combining Humor and Sarcasm for Improving Political Parody Detection. Xiao Ao, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras
Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection. Indira Sen, Mattia Samory, Claudia Wagner, Isabelle Augenstein
Conceptualizing Treatment Leakage in Text-based Causal Inference. Adel Daoud, Connor Thomas Jerzak, Richard Johansson
[Findings] DISARM: Detecting the Victims Targeted by Harmful Memes. Shivam Sharma, Md Shad Akhtar, Preslav Nakov, Tanmoy Chakraborty
[Findings] Analyzing the Intensity of Complaints on Social Media. MING FANG, Shi Zong, Jing Li, Xinyu Dai, Shujian Huang, Jiajun Chen
[Findings] CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection. Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee
Efficient Methods in NLP
[SRW] Impact of Training Instance Selection on Domain-Specific Entity Extraction using BERT. Eileen Salhofer, Xing Lan Liu, Roman Kern
Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval. Siyu Ren, Kenny Q. Zhu
Exact Paired-Permutation Testing for Structured Test Statistics. Ran Zmigrod, Tim Vieira, Ryan Cotterell
[Findings] Efficient Learning of Multiple NLP Tasks via Collective Weight Factorization on BERT. Christos Charalampos Papadopoulos, Yannis Panagakis, Manolis Koubarakis, Mihalis Nicolaou
[Findings] RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation. Md Akmal Haidar, NITHIN ANCHURI, Mehdi Rezagholizadeh, Abbas Ghaddar, Philippe Langlais, Pascal Poupart
Ethics, Bias, and Fairness
[SRW] Text Style Transfer for Bias Mitigation using Masked Language Modeling. Ewoenam Kwaku Tokpo, Toon Calders
[SRW] Differentially Private Instance Encoding against Privacy Attacks. Shangyu Xie, Yuan Hong
Triggerless Backdoor Attack for NLP Tasks with Clean Labels. Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan
[Findings] An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models. Victor Steinborn, Philipp Dufter, Haris Jabbar, Hinrich Schuetze
Human-Centered NLP
What Makes a Good and Useful Summary? Incorporating Users in Automatic Summarization Research. Maartje Ter Hoeve, Julia Kiseleva, Maarten de Rijke
[Findings] Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation. Philippe Laban, Chien-Sheng Wu, Lidiya Murakhovs'ka, Wenhao Liu, Caiming Xiong
Language Generation
[SRW] Methods for Estimating and Improving Robustness of Language Models. Michal Stefanik
Cross-Domain Detection of GPT-2-Generated Technical Text. Juan Diego Rodriguez, Todd Hay, David Gros, Zain Shamsi, Ravi Srinivasan
[Findings] Revisiting Generative Commonsense Reasoning: A Pre-Ordering Approach. Chao Zhao, Faeze Brahman, Tenghao Huang, Snigdha Chaturvedi
[Findings] Learning from Bootstrapping and Stepwise Reinforcement Reward: A Semi-Supervised Framework for Text Style Transfer. Zhengyuan Liu, Nancy F. Chen
Language Grounding to Vision, Robotics and Beyond
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation. Zi-Yi Dou, Nanyun Peng
[Findings] KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation. Yongfei Liu, Chenfei Wu, Shao-Yen Tseng, Vasudev Lal, Xuming He, Nan Duan
[Findings] Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval. Jiaheng Liu, Tan Yu, Hanyu Peng, Mingming Sun, Ping Li
Language Resources and Evaluation
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding. Ao Jia, Yu He, Yazhou Zhang, Sagar Uprety, Dawei Song, Christina Lioma
Are All the Datasets in Benchmark Necessary? A Pilot Study of Dataset Evaluation for Text Classification. Yang Xiao, Jinlan Fu, See-Kiong Ng, Pengfei Liu
[Findings] MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguation). Simone Tedeschi, Roberto Navigli
[Findings] ID10M: Idiom Identification in 10 Languages. Simone Tedeschi, Federico Martelli, Roberto Navigli
Machine Learning for NLP: Classification and Structured Prediction Models
CIAug: Equipping Interpolative Augmentation with Curriculum Learning. Ramit Sawhney, Ritesh Singh Soun, Shrey Pandit, Megh Thakkar, Sarvagya Malaviya, Yuval Pinter
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification. Minyi Zhao, Lu Zhang, Yi Xu, Jiandong Ding, Jihong Guan, Shuigeng Zhou
NLP Applications
Enhancing Self-Attention with Knowledge-Assisted Attention Maps. Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Defu Cao, Mingliang Zhang, Yunhai Tong, Yaming Yang, Jing Bai, Ruofei Zhang, Hao Sun, Wei Shen
Natural Language Inference with Self-Attention for Veracity Assessment of Pandemic Claims. Miguel Arana-Catania, Elena Kochkina, Arkaitz Zubiaga, Maria Liakata, Robert Procter, Yulan He
ValCAT: Variable-Length Contextualized Adversarial Transformations Using Encoder-Decoder Language Model. Chuyun Deng, Mingxuan Liu, Yue Qin, Jia Zhang, Hai-Xin Duan, Donghong Sun
DynamicTOC: Persona-based Table of Contents for Consumption of Long Documents. Himanshu Maheshwari, Nethraa Sivakumar, Shelly Jain, Tanvi Karandikar, Vinay Aggarwal, Navita Goyal, Sumit Shekhar
Non-Autoregressive Chinese ASR Error Correction with Phonological Training. Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao
[Findings] Measuring and Improving Compositional Generalization in Text-to-SQL via Component Alignment. Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver
[Findings] CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training. Xin Wang, Yasheng Wang, Yao Wan, Jiawei Wang, Pingyi Zhou, Li Li, Hao Wu, Jin Liu
[Findings] Unbiased Math Word Problems Benchmark for Mitigating Solving Bias. ZhiCheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang
[Findings] Pathway2Text: Dataset and Method for Biomedical Pathway Description Generation. Junwei Yang, Zequn Liu, Ming Zhang, Sheng Wang
[Findings] D2GCLF: Document-to-Graph Classifier for Legal Document Classification. Qiqi Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Ruofan Wang
[Findings] Explore More Guidance: A Task-aware Instruction Network for Sign Language Translation Enhanced with Data Augmentation. Yong Cao, Wei Li, Xianzhi Li, Min Chen, Guangyong Chen, Long Hu, Zhengdao Li, Kai Hwang
[Findings] Query2Particles: Knowledge Graph Reasoning with Particle Embeddings. Jiaxin Bai, Zihao Wang, Hongming Zhang, Yangqiu Song
Question Answering
[SRW] Eliciting Complex Relational Knowledge From Masked Language Models. Arun Sundaresan, Ming Hsu, Zhihao Zhang
Understand before Answer: Improve Temporal Reading Comprehension via Precise Question Understanding. Hao Huang, Xiubo Geng, Guodong Long, Daxin Jiang
Re2G: Retrieve, Rerank, Generate. Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Naik, Pengshan Cai, Alfio Gliozzo
[Findings] Seeing the wood for the trees: a contrastive regularization method for the low-resource Knowledge Base Question Answering. Junping Liu, Shijie Mei, Xinrong Hu, Xun Yao, JACK Yang, Yi Guo
[Findings] To Answer or Not To Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning. Yunjie Ji, Liangyu Chen, Chenxiao Dou, Baochang Ma, Xiangang Li
[Findings] All Information is Valuable: Question Matching over Full Information Transmission Network. Le Qi, Yu Zhang, Qingyu Yin, Guidong Zheng, wen junjie, Jinlong Li, Ting Liu
[Findings] $Great~Truths~are ~Always ~Simple:$ A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models. Jinhao Jiang, Kun Zhou, Ji-Rong Wen, Xin Zhao
[Findings] Capturing Conversational Interaction for Question Answering via Global History Reasoning. Jin Qian, Bowei Zou, Mengxing Dong, Xiao Li, AiTi Aw, Yu Hong
[Findings] Continual Machine Reading Comprehension via Uncertainty-aware Fixed Memory and Adversarial Domain Adaptation. Zhijing Wu, Hua Xu, Jingliang Fang, Kai Gao
Semantics: Sentence-level Semantics and Textual Inference
Label Definitions Improve Semantic Role Labeling. Li Zhang, Ishan Jindal, Yunyao Li
Sentiment Analysis and Stylistic Analysis
[SRW] Static and Dynamic Speaker Modeling based on Graph Neural Network for Emotion Recognition in Conversation. Prakhar Saxena, Yin Jou Huang, Sadao Kurohashi
Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis. Jiahao Cao, Rui Liu, Huailiang Peng, Lei Jiang, Xu Bai
Generative Cross-Domain Data Augmentation for Aspect and Opinion Co-Extraction. Junjie Li, Jianfei Yu, Rui Xia
[Findings] A Dual-Channel Framework for Sarcasm Recognition by Detecting Sentiment Conflict. Yiyi Liu, Yequan Wang, Aixin Sun, Xuying Meng, Jing Li, Jiafeng Guo
[Findings] CLMLF:A Contrastive Learning and Multi-Layer Fusion Method for Multimodal Sentiment Detection. Zhen Li, Bing Xu, Conghui Zhu, Tiejun Zhao
Speech
[SRW] Towards Unsupervised Speech Synthesis. Alexander H. Liu, Cheng-I Lai, James R. Glass
[SRW] Investigating the effectiveness of various speaker embeddings for multi-speaker end-to-end speech synthesis system using small-sized speech data. Sheng-Yao Wang, Yi-Chin Huang
[SRW] Multiformer: A Head-Configurable Transformer-Based Model for Direct Speech Translation. Gerard Sant, Gerard I. Gállego, Belen Alastruey, Marta Ruiz Costa-jussà
Quantifying Language Variation Acoustically with Few Resources. Martijn Bartelds, Martijn Wieling
[Findings] FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems. Divya V Sharma, Arun Balaji Buduru
Syntax: Tagging, Chunking, and Parsing
[SRW] Simulating Feature Structures with Simple Types. Valentin D. Richard
[Findings] Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis. Seth Kulick, Neville Ryant, Beatrice Santorini
[Findings] SHARP: Search-Based Adversarial Attack for Structured Prediction. Liwen Zhang, Zixia Jia, Wenjuan Han, Zilong Zheng, Kewei Tu
Break9:00 – 9:15Regency A & B
Oral Session 8 + Virtual Poster Q&A Session 4
9:15 – 10:15
Choose AllRemove All
Exploiting Inductive Bias in Transformers for Unsupervised Disentanglement of Syntax and Semantics with VAEs. Ghazi Felhi, Joseph Le Roux, Djamé Seddah
Time Waits for No One! Analysis and Challenges of Temporal Misalignment. Kelvin Luu, Daniel Khashabi, Suchin Gururangan, Karishma Mandyam, Noah Smith
What do Toothbrushes do in the Kitchen? How Transformers Think our World is Structured. Alexander Henlein, Alexander Mehler
A Study of the Attention Abnormality in Trojaned BERTs. Weimin Lyu, Songzhu Zheng, Tengfei Ma, Chao Chen
9:15 – 10:15
Choose AllRemove All
Mitigating Toxic Degeneration with Empathetic Data: Exploring the Relationship Between Toxicity and Empathy. Allison Lahnala, Charles Welch, Béla Neuendorf, Lucie Flek
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate. Hannah Rose Kirk, Bertie Vidgen, Paul Rottger, Tristan Thrush, Scott A. Hale
A Holistic Framework for Analyzing the COVID-19 Vaccine Debate. Maria Leonor Pacheco, Tunazzina Islam, Monal Mahajan, Andrey Shor, Ming Yin, Lyle Ungar, Dan Goldwasser
Hate Speech and Counter Speech Detection: Conversational Context Does Matter. Xinchen Yu, Eduardo Blanco, Lingzi Hong
9:15 – 10:15
Choose AllRemove All
[TACL] Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression. Yuxia Wang, Daniel Beck, Timothy Baldwin, Karin Verspoor
[TACL] Heterogeneous Supervised Topic Models. Dhanya Sridhar, Hal Daumé III, David Meir Blei
Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks. Paul Rottger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert
On the Machine Learning of Ethical Judgments from Natural Language. Zeerak Talat, Hagen Blix, Josef Valvoda, Maya Indira Ganesh, Ryan Cotterell, Adina Williams
9:15 – 10:15
Choose AllRemove All
SURF: Semantic-level Unsupervised Reward Function for Machine Translation. Atijit Anuchitanukul, Julia Ive
Reducing Disambiguation Biases in NMT by Leveraging Explicit Word Sense Information. Niccolò Campolungo, Tommaso Pasini, Denis Emelin, Roberto Navigli
Jam or Cream First? Modeling Ambiguity in Neural Machine Translation with SCONES. Felix Stahlberg, Shankar Kumar
Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation. Siyu Lai, Zhen Yang, Fandong Meng, Xue Zhang, Yufeng Chen, Jinan Xu, Jie Zhou
9:15 – 10:15
Choose AllRemove All
Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization. Haode Zhang, Haowen Liang, Yuwei Zhang, Li-Ming Zhan, Xiao-Ming Wu, Xiaolei Lu, Albert Y.S. Lam
You Don’t Know My Favorite Color: Preventing Dialogue Representations from Revealing Speakers’ Private Personas. Haoran Li, Yangqiu Song, Lixin Fan
Unsupervised Slot Schema Induction for Task-oriented Dialog. Dian Yu, Mingqiu Wang, Yuan Cao, Izhak Shafran, Laurent El Shafey, Hagen Soltau
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning. Siddharth Verma, Justin Fu, Sherry Yang, Sergey Levine
9:15 – 10:15
Choose AllRemove All
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts. Daniel Khashabi, Xinxi Lyu, Sewon Min, Lianhui Qin, Kyle Richardson, Sean Welleck, Hannaneh Hajishirzi, Tushar Khot, Ashish Sabharwal, Sameer Singh, Yejin Choi
MetaICL: Learning to Learn In Context. Sewon Min, Mike Lewis, Luke Zettlemoyer, Hannaneh Hajishirzi
Learning To Retrieve Prompts for In-Context Learning. Ohad Rubin, Jonathan Herzig, Jonathan Berant
IDPG: An Instance-Dependent Prompt Generation Method. Zhuofeng Wu, Sinong Wang, Jiatao Gu, Rui Hou, Yuxiao Dong, V.G.Vinod Vydiswaran, Hao Ma
9:15 – 10:15
Discourse and Pragmatics
Incorporating Centering Theory into Neural Coreference Resolution. Haixia Chai, Michael Strube
Efficient Methods in NLP
[Findings] ALLSH: Active Learning Guided by Local Sensitivity and Hardness. Shujian Zhang, Chengyue Gong, Xingchao Liu, Pengcheng He, Weizhu Chen, Mingyuan Zhou
Ethics, Bias, and Fairness
Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification. Xiaolei Huang
Socially Aware Bias Measurements for Hindi Language Representations. Vijit Malik, Sunipa Dev, Akihiro Nishi, Nanyun Peng, Kai-Wei Chang
Recognition of They/Them as Singular Personal Pronouns in Coreference Resolution. Connor Baumler, Rachel Rudinger
Information Extraction
[SRW] Dr. Livingstone, I presume? Polishing of foreign character identification in literary texts. Aleksandra Konovalova, Antonio Toral, Kristiina Taivalkoski-Shilov
[SRW] CSSS: A Novel Candidate Summary Selection Strategy for Summary-level Extractive Summarization. Shuai Gong, Zhenfang Zhu, Wenqing Wu, Zhen Zhao, Dianyuan Zhang
EmRel: Joint Representation of Entities and Embedded Relations for Multi-triple Extraction. Benfeng Xu, Quan Wang, Yajuan Lyu, Yabing Shi, Yong Zhu, Jie Gao, Zhendong Mao
CompactIE: Compact Facts in Open Information Extraction. Farima Fatahi Bayat, Nikita Bhutani, H. Jagadish
Document-Level Relation Extraction with Sentences Importance Estimation and Focusing. Wang Xu, Kehai Chen, Lili Mou, Tiejun Zhao
ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition. Xinyu Wang, Min Gui, Yong Jiang, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang, Kewei Tu
[Findings] Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision. Yang Li, Guodong Long, Tao Shen, Jing Jiang
[Findings] Good Visual Guidance Make A Better Extractor: Hierarchical Visual Prefix for Multimodal Entity and Relation Extraction. Xiang Chen, Ningyu Zhang, Lei Li, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang, Luo Si, Huajun Chen
[Findings] Label Refinement via Contrastive Learning for Distantly-Supervised Named Entity Recognition. Huaiyuan Ying, Shengxuan Luo, Tiantian Dang, Sheng Yu
Language Grounding to Vision, Robotics and Beyond
Disentangled Action Recognition with Knowledge Bases. Zhekun Luo, Shalini Ghosh, Devin Guillory, Keizo Kato, Trevor Darrell, Huijuan Xu
Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation. Giscard Biamby, Grace Luo, Trevor Darrell, Anna Rohrbach
MCSE: Multimodal Contrastive Learning of Sentence Embeddings. Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, Dietrich Klakow
[Findings] Fine-grained Image Captioning with CLIP Reward. Jaemin Cho, Seunghyun Yoon, Ajinkya Kale, Franck Dernoncourt, Trung Bui, Mohit Bansal
[Findings] CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations. Jialu Li, Hao Tan, Mohit Bansal
[Findings] What kinds of errors do reference resolution models make and what can we learn from them?. Jorge Sánchez, Mauricio Mazuecos, Hernán Maina, Luciana Benotti
[Findings] Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval. Zhihao Fan, zhongyu wei, Zejun Li, Siyuan Wang, Xuanjing Huang, Jianqing Fan
[Findings] RoViST: Learning Robust Metrics for Visual Storytelling. Eileen Wang, Caren Han, Josiah Poon
Language Resources and Evaluation
[SRW] Zuo Zhuan Ancient Chinese Dataset for Word Sense Disambiguation. Xiaomeng Pan, Hongfei Wang, Teruaki Oka, Mamoru Komachi
SwahBERT: Language Model of Swahili. Gati L Martin, Medard Medard Mswahili, Young-Seob Jeong, Jiyoung Woo
[Findings] BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla. Abhik Bhattacharjee, Tahmid Hasan, Wasi Uddin Ahmad, Kazi Samin Mubasshir, Md Saiful Islam, Anindya Iqbal, M. Sohel Rahman, Rifat Shahriyar
[Findings] EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification. Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Inigo Casanueva, Paweł Budzianowski
[Findings] Detecting Narrative Elements in Informational Text. Effi Levi, Guy Mor, Tamir Sheafer, Shaul Rafael Shenhav
Multilinguality
Pretrained Models for Multilingual Federated Learning. Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme
BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer. Marinela Parović, Goran Glavaš, Ivan Vulić, Anna Korhonen
Towards Debiasing Translation Artifacts. KOEL DUTTA CHOWDHURY, Rricha Jalota, Cristina España-Bonet, Josef van Genabith
[Findings] FreeTransfer-X: Safe and Label-Free Cross-Lingual Transfer from Off-the-Shelf Models. Yinpeng Guo, Liangyou Li, Xin Jiang, Qun Liu
[Findings] Uncertainty-Aware Cross-Lingual Transfer with Pseudo Partial Labels. Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Chang-Tien Lu
[Findings] Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching. Kunbo Ding, Weijie Liu, Yuejian Fang, Zhe Zhao, Qi Ju, Xuefeng Yang, Rong Tian, Zhu Tao, Haoyan Liu, Han Guo, Xingyu Bai, Weiquan Mao, Yudong Li, Weigang Guo, Taiqiang Wu, Ningyuan Sun
NLP Applications
[SRW] Understanding Long Document with Different Position-Aware Attentions. Hai Pham, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony. Tulika Saha, Saichethan Miriyala Reddy, Anindya Sundar Das, Sriparna Saha, Pushpak Bhattacharyya
TWEETSPIN: Fine-grained Propaganda Detection in Social Media Using Multi-View Representations. Prashanth Vijayaraghavan, Soroush Vosoughi
Question Answering
Cooperative Self-training of Machine Reading Comprehension. Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass
Ask Me Anything in Your Native Language. Nikita Sorokin, Dmitry Abulkhanov, Irina Piontkovskaya, Valentin Malykh
Yes, No or IDK: The Challenge of Unanswerable Yes/No Questions. Elior Sulem, Jamaal Hay, Dan Roth
DREAM: Improving Situational QA by First Elaborating the Situation. Yuling Gu, Bhavana Dalvi, Peter Clark
OPERA: Operation-Pivoted Discrete Reasoning over Text. Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, jiahui liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages. Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu
Long Context Question Answering via Supervised Contrastive Learning. Avi Caciularu, Ido Dagan, Jacob Goldberger, Arman Cohan
Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering. JianGuo Mao, Wenbin Jiang, Xiangdong Wang, Zhifan Feng, Yajuan Lyu, Hong Liu, Yong Zhu
A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning. Yu Wang, Vijay Srinivasan, Hongxia Jin
ProQA: Structural Prompt-based Pre-training for Unified Question Answering. Wanjun Zhong, Yifan Gao, Ning Ding, Yujia Qin, Zhiyuan Liu, Ming Zhou, Jiahai Wang, Jian Yin, Nan Duan
[Findings] Multi-Hop Open-Domain Question Answering over Structured and Unstructured Knowledge. Yue Feng, Zhen Han, Mingming Sun, Ping Li
[Findings] Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base. Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou
Semantics: Sentence-level Semantics and Textual Inference
Paragraph-based Transformer Pre-training for Multi-Sentence Inference. Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti
Few-Shot Semantic Parsing with Language Models Trained on Code. Richard Shin, Benjamin Van Durme
Summarization
[SRW] Few-shot fine-tuning SOTA summarization models for medical dialogues. David Fraile Navarro, Mark Dras, Shlomo Berkovsky
Reference-free Summarization Evaluation via Semantic Correlation and Compression Ratio. Yizhu Liu, Qi Jia, Kenny Q. Zhu
[Findings] Data Augmentation for Low-Resource Dialogue Summarization. Yongtai Liu, Joshua Maynez, Gonçalo Simões, Shashi Narayan
[Findings] OTExtSum: Extractive Text Summarisation with Optimal Transport. Peggy Tang, Kun Hu, Rui Yan, Lei Zhang, Junbin Gao, Zhiyong Wang
[Findings] Exploring Neural Models for Query-Focused Summarization. Jesse Vig, Alexander Fabbri, Wojciech Maciej Kryscinski, Chien-Sheng Wu, Wenhao Liu
[Findings] Post-Training Dialogue Summarization using Pseudo-Paraphrasing. Qi Jia, Yizhu Liu, Haifeng Tang, Kenny Q. Zhu
[Findings] TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization. Ze Yang, Christian WANG, Zhoujin Tian, Wei Wu, Zhoujun Li
Break10:15 – 10:45Regency A & B
Oral Session 9 + Findings In-person Poster Session 1
10:45 – 12:15
Choose AllRemove All
 FRUIT: Faithfully Reflecting Updated Information in Text. Robert L. Logan IV, Alexandre Tachard Passos, Sameer Singh, Ming-Wei Chang
Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation. Zhexin Zhang, Jiaxin Wen, Jian Guan, Minlie Huang
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation. Jinyi Hu, Xiaoyuan Yi, Wenhao Li, Maosong Sun, Xing Xie
Overcoming Catastrophic Forgetting During Domain Adaptation of Seq2seq Language Generation. Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Xing Fan, Chenlei Guo, Yang Liu
Zero-shot Sonnet Generation with Discourse-level Planning and Aesthetics Features. Yufei Tian, Nanyun Peng
10:45 – 12:15
Choose AllRemove All
Textless Speech-to-Speech Translation on Real Data. Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Pino, Jiatao Gu, Wei-Ning Hsu
Unsupervised Stem-based Cross-lingual Part-of-Speech Tagging for Morphologically Rich Low-Resource Languages. Ramy Eskander, Cass Lowry, Sujay Khandagale, Judith Lynn Klavans, Maria Polinsky, Smaranda Muresan
Quantifying Synthesis and Fusion and their Impact on Machine Translation. Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva
On the Use of External Data for Spoken Named Entity Recognition. Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Han
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems. Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Kumar Nelakanti, Vineet Gandhi
10:45 – 12:15
Choose AllRemove All
Improving Entity Disambiguation by Reasoning over a Knowledge Base. Tom Ayoola, Joseph Fisher, Andrea Pierleoni
DocTime: A Document-level Temporal Dependency Graph Parser. Puneet Mathur, Vlad I Morariu, Verena Kaynig-Fittkau, Jiuxiang Gu, Franck Dernoncourt, Quan Hung Tran, Ani Nenkova, Dinesh Manocha, Rajiv Jain
SAIS: Supervising and Augmenting Intermediate Steps for Document-Level Relation Extraction. Yuxin Xiao, Zecheng Zhang, Yuning Mao, Carl Yang, Jiawei Han
Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis. Yiwei Wang, Muhao Chen, Wenxuan Zhou, Yujun Cai, Yuxuan Liang, Dayiheng Liu, Baosong Yang, Juncheng Liu, Bryan Hooi
MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection. Amir Pouran Ben Veyseh, Minh Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen
GenIE: Generative Information Extraction. Martin Josifoski, Nicola De Cao, Maxime Peyrard, Fabio Petroni, Robert West
10:45 – 12:15
Choose AllRemove All
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants. Max Bartolo, Tristan Thrush, Sebastian Riedel, Pontus Stenetorp, Robin Jia, Douwe Kiela
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting. Anne Lauscher, Brandon Ko, Bailey Kuehl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo
 NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge. Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng
Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis. Baber Khalid, SUNGJIN LEE
Answer Consolidation: Formulation and Benchmarking. Wenxuan Zhou, Qiang Ning, Heba Elfardy, Kevin Small, Muhao Chen
[TACL] Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation. Zoey Liu, Emily Prud’hommeau
10:45 – 12:15
Choose AllRemove All
Efficient Constituency Tree based Encoding for Natural Language to Bash Translation. Shikhar Bharadwaj, Shirish Shevade
Multi-Relational Graph Transformer for Automatic Short Answer Grading. Rajat Agarwal, Varun Khurana, Karish Grover, Mukesh Mohania, Vikram Goyal
ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence. Yibo Hu, MohammadSaleh Hosseini, Erick Skorupa Parolin, Javier Osorio, Latifur Khan, Patrick Brandt, Vito D'Orazio
Semantically Informed Slang Interpretation. Zhewei Sun, Richard Zemel, Yang Xu
Don’t sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks. Jonathan Rusert, Padmini Srinivasan
GRAM: Fast Fine-tuning of Pre-trained Language Models for Content-based Collaborative Filtering. Yoonseok Yang, Kyu Seok Kim, Minsam Kim, Juneyoung Park
10:45 – 12:15 Regency A & B
Computational Social Science and Cultural Analytics
[Findings] Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity. Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schuetze
[Findings] HUE: Pretrained Model and Dataset for Understanding Hanja Documents of Ancient Korea. Haneul Yoo, Jiho Jin, Juhee Son, JinYeong Bak, Kyunghyun Cho, Alice Oh
Dialogue and Interactive Systems
[Findings] Empathetic Persuasion: Reinforcing Empathy and Persuasiveness in Dialogue Systems. Azlaan Mustafa Samad, Kshitij Mishra, Mauajama Firdaus, Asif Ekbal
[Findings] Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation. Prakhar Gupta, Harsh Jhamtani, Jeffrey Bigham
[Findings] Balancing Multi-Domain Corpora Learning for Open-Domain Response Generation. Yujie Xing, Jinglun Cai, Nils Barlaug, Peng Liu, Jon Atle Gulla
[Findings] Context-Aware Language Modeling for Goal-Oriented Dialogue Systems. Charlie Victor Snell, Sherry Yang, Justin Fu, Yi Su, Sergey Levine
[Findings] KETOD: Knowledge-Enriched Task-Oriented Dialogue. Zhiyu Chen, Bing Liu, Seungwhan Moon, Chinnadhurai Sankar, Paul A. Crook, William Yang Wang
Discourse and Pragmatics
[Findings] Improve Discourse Dependency Parsing with Contextualized Representations. Yifei Zhou, Yansong Feng
Efficient Methods in NLP
[Findings] Empowering parameter-efficient transfer learning by recognizing the kernel structure in self-attention. Yifan Chen, Devamanyu Hazarika, Mahdi Namazifar, Yang Liu, Di Jin, Dilek Hakkani-Tur
[Findings] Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models. Joseph McDonald, Baolin Li, Nathan C. Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
[Findings] AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks. Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee
[Findings] LongT5: Efficient Text-To-Text Transformer for Long Sequences. Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang
[Findings] LM-CORE: Language Models with Contextually Relevant External Knowledge. Jivat Neet Kaur, Sumit Bhatia, Milan Aggarwal, Rachit Bansal, Balaji Krishnamurthy
Ethics, Bias, and Fairness
[Findings] Cross-Domain Classification of Moral Values. Enrico Liscio, Alin Eugen Dondera, Andrei Geadau, Catholijn M Jonker, Pradeep Kumar Murukannaiah
[Findings] On Measuring Social Biases in Prompt-Based Multi-Task Learning. Afra Feyza Akyürek, Sejin Paik, Muhammed Yusuf Kocyigit, Seda Akbiyik, Serife Leman Runyun, Derry Wijaya
Interpretability and Analysis of Models for NLP
[Findings] Few-Shot Self-Rationalization with Natural Language Prompts. Ana Marasovic, Iz Beltagy, Doug Downey, Matthew E Peters
[Findings] Entailment Tree Explanations via Iterative Retrieval-Generation Reasoner. Danilo Neves Ribeiro, Shen Wang, Xiaofei Ma, Rui Dong, Xiaokai Wei, Henghui Zhu, Xinchi Chen, Peng Xu, zhiheng huang, Andrew Arnold, Dan Roth
[Findings] Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models. Tianlu Wang, Rohit Sridhar, Diyi Yang, Xuezhi Wang
[Findings] Exploring the Universal Vulnerability of Prompt-based Learning Paradigm. Lei Xu, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Zhiyuan Liu
[Findings] On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations. Roy Schwartz, Gabriel Stanovsky
[Findings] Beyond Distributional Hypothesis: Let Language Models Learn Meaning-Text Correspondence. M.J Jang, Frank Martin Mtumbuka, Thomas Lukasiewicz
Machine Learning for NLP: Classification and Structured Prediction Models
[Findings] 'Diversity and Uncertainty in Moderation'' are the Key to Data Selection for Multilingual Few-shot Transfer. Shanu Kumar, Sandipan Dandapat, Monojit Choudhury
Machine Learning for NLP: Language Modeling and Sequence to Sequence Models
[Findings] Entity Cloze By Date: What LMs Know About Unseen Entities. Yasumasa Onoe, Michael JQ Zhang, Eunsol Choi, Greg Durrett
[Findings] Masked Measurement Prediction: Learning to Jointly Predict Quantities and Units from Textual Context. Daniel Spokoyny, Ivan Lee, Zhao Jin, Taylor Berg-Kirkpatrick
[Findings] Learning Rich Representation of Keyphrases from Text. Mayank Kulkarni, Debanjan Mahata, Ravneet Singh Arora, Rajarshi Bhowmik
[Findings] Temporal Attention for Language Models. Guy D. Rosin, Kira Radinsky
[Findings] Lacuna Reconstruction: Self-Supervised Pre-Training for Low-Resource Historical Document Transcription. Nikolai Vogler, Jonathan Parkes Allen, Matthew Thomas Miller, Taylor Berg-Kirkpatrick
[Findings] Hierarchical Transformers Are More Efficient Language Models. Piotr Nawrot, Szymon Tworkowski, Michał Tyrolski, Lukasz Kaiser, Yuhuai Wu, Christian Szegedy, Henryk Michalewski
Multilinguality
[Findings] DOCmT5: Document-Level Pretraining of Multilingual Language Models. Chia-Hsuan Lee, Aditya Siddhant, Viresh Ratnakar, Melvin Johnson
[Findings] How to Translate Your Samples and Choose Your Shots? Analyzing Translate-train & Few-shot Cross-lingual Transfer. Iman Jundi, Gabriella Lapesa
[Findings] Por Qué Não Utiliser Alla Språk? Mixed Training with Gradient Optimization in Few-Shot Cross-Lingual Transfer. Haoran Xu, Kenton Murray
[Findings] MTG: A Benchmark Suite for Multilingual Text Generation. Yiran Chen, Zhenqiao Song, Xianze Wu, Danqing Wang, Jingjing Xu, Jiaze Chen, Hao Zhou, Lei Li
Question Answering
[Findings] MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving. Zhenwen Liang, Jipeng Zhang, Lei Wang, Wei QIN, Yunshi Lan, Jie Shao, Xiangliang Zhang
[Findings] Exploiting Numerical-Contextual Knowledge to Improve Numerical Reasoning in Question Answering. Jeonghwan Kim, Junmo Kang, Kyung-min Kim, Giwon Hong, Sung-Hyon Myaeng
[Findings] METGEN: A Module-Based Entailment Tree Generation Framework for Answer Explanation. Ruixin Hong, Hongming Zhang, Xintong Yu, Changshui Zhang
[Findings] Challenges in Generalization in Open Domain Question Answering. Linqing Liu, Patrick Lewis, Sebastian Riedel, Pontus Stenetorp
[Findings] CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training. Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Scott Yih, Sonal Gupta, Xilun Chen
[Findings] UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering. Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih
[Findings] PerKGQA: Question Answering over Personalized Knowledge Graphs. Ritam Dutt, Kasturi Bhattacharjee, Rashmi Gangadharaiah, Dan Roth, Carolyn Rose
Semantics: Lexical Semantics
[Findings] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning. Yixuan Su, Fangyu Liu, Zaiqiao Meng, Tian Lan, Lei Shu, Ehsan Shareghi, Nigel Collier
Semantics: Sentence-level Semantics and Textual Inference
[Findings] A Question-Answer Driven Approach to Reveal Affirmative Interpretations from Verbal Negations. Md Mosharaf Hossain, Luke Holman, Anusha Kakileti, Tiffany Iris Kao, Nathan Raul Brito, Aaron Abraham Mathews, Eduardo Blanco
[Findings] The Role of Context in Detecting Previously Fact-Checked Claims. Shaden Shaar, Firoj Alam, Giovanni Da San Martino, Preslav Nakov
[Findings] SEQZERO: Few-shot Compositional Semantic Parsing with Sequential Prompts and Zero-shot Models. Jingfeng Yang, Haoming Jiang, Qingyu Yin, Danqing Zhang, Bing Yin, Diyi Yang
[Findings] Weakly Supervised Text-to-SQL Parsing through Question Decomposition. Tomer Wolfson, Daniel Deutch, Jonathan Berant
Sentiment Analysis and Stylistic Analysis
[Findings] POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection. Yujian Liu, Xinliang Frederick Zhang, David Wegsman, Nicholas Beauchamp, Lu Wang
[Findings] A Survey on Stance Detection for Mis- and Disinformation Identification. Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein
Lunch12:15 – 14:15
Oral Session 10 + Findings In-person Poster Session 2
14:15 – 15:45
Choose AllRemove All
Selective Differential Privacy for Language Modeling. Weiyan Shi, Aiqi Cui, Evan Li, Ruoxi Jia, Zhou Yu
Federated Learning with Noisy User Feedback. Rahul Sharma, Anil Ramakrishna, Ansel MacLaughlin, Anna Rumshisky, Jimit Majmudar, Clement Chung, Salman Avestimehr, Rahul Gupta
Provably Confidential Language Modelling. Xuandong Zhao, Lei Li, Yu-Xiang Wang
Optimising Equal Opportunity Fairness in Model Training. Aili Shen, Xudong Han, Trevor Cohn, Timothy Baldwin, Lea Frermann
How Gender Debiasing Affects Internal Model Representations, and Why It Matters. Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov
Explaining Toxic Text via Knowledge Enhanced Text Generation. Rohit Sridhar, Diyi Yang
14:15 – 15:45
Choose AllRemove All
Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization. Prasetya Ajie Utama, Joshua Bambrick, Nafise Sadat Moosavi, Iryna Gurevych
Maximum Bayes Smatch Ensemble Distillation for AMR Parsing. Young-Suk Lee, Ramon Fernandez Astudillo, Hoang Thanh Lam, Tahira Naseem, Radu Florian, Salim Roukos
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding. Zeming Chen, Qiyue Gao
Syn2Vec: Synset Colexification Graphs for Lexical Semantic Similarity. John Harvill, Roxana Girju, Mark A. Hasegawa-Johnson
WiC = TSV = WSD: On the Equivalence of Three Semantic Tasks. Bradley Hauer, Grzegorz Kondrak
EASE: Entity-Aware Contrastive Learning of Sentence Embedding. Sosuke Nishikawa, Ryokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen
14:15 – 15:45
Choose AllRemove All
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks. Ruixiang Cui, Daniel Hershcovich, Anders Søgaard
What company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris. Mikael Brunila, Jack LaViolette
Learning the Ordering of Coordinate Compounds and Elaborate Expressions in Hmong, Lahu, and Chinese. Chenxuan Cui, Katherine J. Zhang, David R Mortensen
Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?. SUBBA REDDY OOTA, JASHN ARORA, Veeral Agarwal, mounika marreddy, Manish Gupta, Bapi Raju Surampudi
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models. Patrick Huber, Giuseppe Carenini
Social Norms Guide Reference Resolution. Mitchell Abrams, Matthias Scheutz
14:15 – 15:45
Choose AllRemove All
A Structured Span Selector. Tianyu Liu, Yuchen Eleanor Jiang, Ryan Cotterell, Mrinmaya Sachan
Entity Linking via Explicit Mention-Mention Coreference Modeling. Dhruv Agarwal, Rico Angell, Nicholas Monath, Andrew McCallum
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification. Han Wang, Canwen Xu, Julian McAuley
AcTune: Uncertainty-Based Active Self-Training for Active Fine-Tuning of Pretrained Language Models. Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang, Chao Zhang
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation. Simiao Zuo, Qingru Zhang, Chen Liang, Pengcheng He, Tuo Zhao, Weizhu Chen
Teaching BERT to Wait: Balancing Accuracy and Latency for Streaming Disfluency Detection. Angelica Chen, Vicky Zayats, Daniel David Walker, Dirk Padfield
14:15 – 15:45
Choose AllRemove All
Learning to Retrieve Passages without Supervision. Ori Ram, Gal Shachaf, Omer Levy, Jonathan Berant, Amir Globerson
Interpretable Proof Generation via Iterative Backward Reasoning. Hanhao Qu, Yu Cao, Jun Gao, Liang Ding, Ruifeng Xu
MultiSpanQA: A Dataset for Multi-Span Question Answering. Haonan Li, Martin Tomko, Maria Vasardani, Timothy Baldwin
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks. Akari Asai, Matt Gardner, Hannaneh Hajishirzi
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning. Yu Jin Kim, Beong-woo Kwak, Youngwook Kim, Reinald Kim Amplayo, seung-won hwang, Jinyoung Yeo
[TACL] MuSiQue: Multi-hop Questions via Single-hop Question Composition. Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot, Ashish Sabharwal
14:15 – 15:45 Regency A & B
Human-Centered NLP
[Findings] One Size Does Not Fit All: The Case for Personalised Word Complexity Models. Sian Gooding, Manuel Tragut
[Findings] Aligning Generative Language Models with Human Values. Ruibo Liu, Ge Zhang, Xinyu Feng, Soroush Vosoughi
[Findings] Design Challenges for a Multi-Perspective Search Engine. Sihao Chen, Siyi Liu, Xander Uyttendaele, Yi Zhang, William Bruno, Dan Roth
Information Extraction
[Findings] Extracting Temporal Event Relation with Syntax-guided Graph Transformer. SHUAICHENG ZHANG, Qiang Ning, Lifu Huang
[Findings] StATIK: Structure and Text for Inductive Knowledge Graph Completion. Elan Sopher Markowitz, Keshav Balasubramanian, Mehrnoosh Mirtaheri, Murali Annavaram, Aram Galstyan, Greg Ver Steeg
[Findings] Permutation Invariant Strategy Using Transformer Encoders for Table Understanding. Sarthak Dash, Sugato Bagchi, Nandana Mihindukulasooriya, Alfio Gliozzo
[Findings] Self-Training with Differentiable Teacher. Simiao Zuo, Yue Yu, Chen Liang, Haoming Jiang, Siawpeng Er, Chao Zhang, Tuo Zhao, Hongyuan Zha
[Findings] Low-resource Entity Set Expansion: A Comprehensive Study on User-generated Text. Yutong Shao, Nikita Bhutani, Sajjadur Rahman, Estevam Hruschka
[Findings] Zero-shot Entity Linking with Less Data. G P Shrivatsa Bhargav, Dinesh Khandelwal, Saswati Dana, Dinesh Garg, Pavan Kapanipathi, Salim Roukos, Alexander Gray, L Venkata Subramaniam
[Findings] Event Detection for Suicide Understanding. Luis Fernando Guzman-Nateras, Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
[Findings] Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning. Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre
[Findings] EA$^2$E: Improving Consistency with Event Awareness for Document-Level Argument Extraction. Qi Zeng, Qiusi Zhan, Heng Ji
[Findings] Dangling-Aware Entity Alignment with Mixed High-Order Proximities. Juncheng Liu, Zequn Sun, Bryan Hooi, Yiwei Wang, Dayiheng Liu, Baosong Yang, Xiaokui Xiao, Muhao Chen
Information Retrieval and Text Mining
[Findings] Literature-Augmented Clinical Outcome Prediction. Aakanksha Naik, Sravanthi Parasa, Sergey Feldman, Lucy Lu Wang, Tom Hope
[Findings] Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training. Yifan Gao, Qingyu Yin, zheng li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael Lyu
Language Generation
[Findings] Controllable Sentence Simplification via Operation Classification. Liam Cripwell, Joël Legrand, Claire Gardent
[Findings] The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank. Daphne Ippolito, Liam Dugan, Emily Reif, Ann Yuan, Andy Coenen, Chris Callison-Burch
Language Grounding to Vision, Robotics and Beyond
[Findings] Probing the Role of Positional Information in Vision-Language Models. Philipp J. Rösch, Jindřich Libovický
[Findings] Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions. Kosuke Nishida, Kyosuke Nishida, Shuichi Nishioka
Language Resources and Evaluation
[Findings] Challenging America: Modeling language in longer time scales. Jakub Pokrywka, Filip Graliński, Krzysztof Jassem, Karol Kaczmarek, Krzysztof Jan Jurkiewicz, Piotr Wierzchon
[Findings] PubHealthTab: A Public Health Table-based Dataset for Evidence-based Fact Checking. Mubashara Akhtar, Oana Cocarascu, Elena Simperl
[Findings] MM-Claims: A Dataset for Multimodal Claim Detection in Social Media. Gullal Singh Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth
[Findings] In-BoXBART: Get Instructions into Biomedical Multi-Task Learning. Mihir Parmar, Swaroop Mishra, Mirali Purohit, Man Luo, M. Hassan Murad, Chitta Baral
[Findings] SemAttack: Natural Textual Attacks via Different Semantic Spaces. Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li
[Findings] Language Models for Code-switch Detection of te reo Māori and English in a Low-resource Setting. Jesin James, Vithya Yogarajan, Isabella Shields, Catherine Watson, Peter Keegan, Keoni Mahelona, Peter-Lucas Jones
[Findings] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation. Jaehyung Seo, Seounghoon Lee, Chanjun Park, Yoonna Jang, Hyeonseok Moon, Sugyeong Eo, Seonmin Koo, Heuiseok Lim
Machine Translation
[Findings] CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality. Maria Nadejde, Anna Currey, Benjamin Hsu, Xing Niu, Marcello Federico, Georgiana Dinu
[Findings] BitextEdit: Automatic Bitext Editing for Improved Low-Resource Machine Translation. Eleftheria Briakou, Sida Wang, Luke Zettlemoyer, Marjan Ghazvininejad
NLP Applications
[Findings] Learning to repair: Repairing model output errors after deployment using a dynamic memory of feedback. Niket Tandon, Aman Madaan, Peter Clark, Yiming Yang
[Findings] TEAM: A multitask learning based Taxonomy Expansion approach for Attach and Merge. Bornali Phukon, Anasua Mitra, Ranbir Singh Sanasam, Priyankoo Sarmah
[Findings] Multimodal Intent Discovery from Livestream Videos. Adyasha Maharana, Quan Hung Tran, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter W Chang, Mohit Bansal
[Findings] Opponent Modeling in Negotiation Dialogues by Related Data Adaptation. Kushal Chawla, Gale Lucas, Jonathan May, Jonathan Gratch
[Findings] Learning to Embed Multi-Modal Contexts for Situated Conversational Agents. Haeju Lee, Oh Joon Kwon, Yunseon Choi, Minho Park, Ran Han, Yoonhyung Kim, Jinhyeon Kim, Youngjune Lee, Haebin Shin, Kangwook Lee, Kee-Eung Kim
[Findings] MultiVerS: Improving scientific claim verification with weak supervision and full-document context. David Wadden, Kyle Lo, Lucy Lu Wang, Arman Cohan, Iz Beltagy, Hannaneh Hajishirzi
[Findings] An Item Response Theory Framework for Persuasion. Anastassia Kornilova, Vladimir Eidelman, Daniel Argyle
[Findings] Self-Supervised Contrastive Learning with Adversarial Perturbations for Defending Word Substitution-based Attacks. Zhao Meng, Yihan Dong, Mrinmaya Sachan, Roger Wattenhofer
[Findings] Towards Job-Transition-Tag Graph for a Better Job Title Representation Learning. Jun ZHU, CELINE HUDELOT
[Findings] The Limits of Word Level Differential Privacy. Justus Mattern, Benjamin Weggenmann, Florian Kerschbaum
[Findings] Denoising Neural Network for News Recommendation with Positive and Negative Implicit Feedback. Yunfan Hu, Zhaopeng Qiu, Xian Wu
Phonology, Morphology and Word Segmentation
[Findings] Restoring Hebrew Diacritics Without a Dictionary. Elazar Gershuni, Yuval Pinter
Speech
[Findings] BehancePR: A Punctuation Restoration Dataset for Livestreaming Video Transcript. Viet Dac Lai, Amir Pouran Ben Veyseh, Franck Dernoncourt, Thien Huu Nguyen
Summarization
[Findings] Masked Summarization to Generate Factually Inconsistent Summaries for Improved Factual Consistency Checking. Hwanhee Lee, Kang Min Yoo, Joonsuk Park, Hwaran Lee, Kyomin Jung
[Findings] Efficient Few-Shot Fine-Tuning for Opinion Summarization. Arthur Brazinskas, Ramesh Nallapati, Mohit Bansal, Markus Dreyer
[Findings] Make The Most of Prior Data: A Solution for Interactive Text Summarization with Preference Feedback. Duy-Hung Nguyen, Nguyen Viet Dung Nghiem, Bao-Sinh Nguyen, Tien Dung Le, Shahab Sabahi, Minh-Tien Nguyen, Hung Le
Break15:45 – 16:15Regency A & B
Plenary Invited Talk 2: Manuel Montes-y-Gómez: "NLP in Mexican Spanish: One of many stories"16:15 – 17:15Columbia C/D (Overflow: Columbia A & 302 Beckler)
Closing Session17:15 – 17:45Columbia C/D (Overflow: Columbia A & 302 Beckler)
Thursday, July 14, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Workshops9:00 – 18:00
Friday, July 15, 2022
Registration and Breakfast7:30 – 9:00Level 3 Foyer & Level 5 Foyer
Workshops9:00 – 18:00