2024
Mapping the Podcast Ecosystem with the Structured Podcast Research Corpus. Ben Litterer, David Jurgens, and Dallas Card. preprint. paper · code (data) · code (paper) · data |
A Test of Time: Predicting the Sustainable Success of Online Collaboration in Wikipedia. Abraham Israeli, David Jurgens, and Daniel Romero. preprint. paper · code and data |
SPRIG: Improving Large Language Model Performance by System Prompt Optimization. Lechen Zhang, Tolga Ergen, Lajanugen Logeswaran, Moontae Lee, and David Jurgens. preprint. paper · code and data |
Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue. Johnathan Ivey, Shivani Kumar, Jiayu Liu, Hua Shen, Sushrita Rakshit, Rohan Raju, Haotian Zhang, Aparna Ananthasubramaniam, Junghwan Kim, Bowen Yi, Dustin Wright, Abraham Israeli, Anders Giovanni Møller, Lechen Zhang, David Jurgens. preprint. paper · code and data |
Networks and Identity Drive Geographic Properties of the Diffusion of Linguistic Innovation Aparna Ananthasubramaniam, David Jurgens, Daniel M. Romero. npj Complexity. 2024. pdf |
The Language of Trauma: Modeling Traumatic Event Descriptions Across Domains with Explainable AI Miriam Schirmer, Tobias Leemann, Gjergji Kasneci, Jürgen Pfeffer, and David Jurgens. Findings of EMNLP. 2024. pdf |
ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions Chan Young Park, Shuyue Stella Li, Hayoung Jung, Svitlana Volkova, Tanushree Mitra, David Jurgens, and Yulia Tsvetkov. Findings of EMNLP. 2024. paper · code and data |
Tab2Text - A framework for deep learning with tabular data Tong Lin*, Jason Yan*, David Jurgens, and Sabina Tomkins. Findings of EMNLP. 2024. preprint forthcoming |
Is "A Helpful Assistant" the Best Role for Large Language Models? A Systematic Evaluation of Social Roles in System Prompts Mingqian Zheng, Jiaxin Pei, Lajanugen Logeswaran, Moontae Lee, and David Jurgens. Findings of EMNLP. 2024. paper · code and data |
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions. Hua Shen, Tiffany Knearem, Reshmi Ghosh, Kenan Alkiek, Kundan Krishna, Yachuan Liu, Ziqiao Ma, Savvas Petridis, Yi-Hao Peng, Li Qiwei, Sushrita Rakshit, Chenglei Si, Yutong Xie, Jeffrey P. Bigham, Frank Bentley, Joyce Chai, Zachary Lipton, Qiaozhu Mei, Rada Mihalcea, Michael Terry, Diyi Yang, Meredith Ringel Morris, Paul Resnick, and David Jurgens. preprint. paper |
The Call for Socially Aware Language Technologies. Diyi Yang, Dirk Hovy, David Jurgens, and Barbara Plank. preprint. paper |
A Multilingual Similarity Dataset for News Article Frame. Xi Chen, Mattia Samory, Scott Hale, David Jurgens, Przemyslaw A Grabowicz Proceedings of the International AAAI Conference on Web and Social Media (ICWSM). paper · data |
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments Bangzhao Shu*, Lechen Zhang*, Minje Choi, Lavinia Dunagan, Dallas Card, and David Jurgens. Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics. paper · code and data |
Social Meme-ing: Measuring Linguistic Variation in Memes Naitian Zhou, David Jurgens, and David Bamman. Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics. paper · code and data |
Modeling Empathetic Alignment in Conversation Jiamin Yang and David Jurgens. Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics. paper · code, models, and data · Jiamin's amazing annotation tool |
Global News Synchrony During the Start of the COVID-19 Pandemic Xi Chen, Scott A. Hale, David Jurgens, Mattia Samory, Ethan Zuckerman, Przemyslaw Adam Grabowicz. Proceedings of the 2024 Web Conference. paper · code and data |
Finding Educationally Supportive Contexts for Vocabulary Learning with Attention-Based Models Sungjin Nam, Kevyn Collins-Thompson, David Jurgens and Xin Tong. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING). paper |
Author mentions in science news reveal widespread disparities across name-inferred ethnicities. Hao Peng, Misha Teplitskiy, David Jurgens. Journal of Quantitative Social Sciences. pdf (preprint) |
2023
Aligning with Whom? Large Language Models Have Gender and Racial Biases in Subjective NLP Tasks Huaman Sun, Jiaxin Pei, Minje Choi, and David Jurgens. preprint. paper · code and data |
Do LLMs Understand Social Knowledge? Evaluating the Sociability of Large Language Models with SocKET Benchmark Minje Choi,* Jiaxin Pei,* Sagar Kumar, Chang Shu and David Jurgens. Proceedings of the Empirical Methods in Natural Language Processing (EMNLP). 2023. paper · code and data |
When it Rains, it Pours: Modeling Media Storms and the News Ecosystem Ben Litterer, David Jurgens, and Dallas Card. Proceedings of the Empirical Methods in Natural Language Processing (EMNLP). 2023. paper (forthcoming) |
Profile Update: The Effects of Identity Disclosure on Network
Connections and Language Minje Choi, Daniel Romero, David Jurgens. preprint. paper |
RCT Rejection Sampling for Causal Estimation Evaluation Katherine A. Keith, Sergey Feldman, David Jurgens, Jonathan Bragg, Rohit Bhattacharya. preprint. 2023. paper · code and data |
Your spouse needs professional help: Determining the Contextual Appropriateness of Messages through Modeling Social Relationships David Jurgens,* Agrima Seth,* Jackson Sargent,† Athena Aghighi,† and Michael Geraci.†. Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL). 2023. paper · code and data |
When Do Annotator Demographics Matter? Measuring The Influence of Annotator Demographics with the POPQUORN Dataset Jiaxin Pei and David Jurgens. Proceedings of the 17th Linguistic Annotation Workshop (LAW-XVII) at ACL. 2023. paper · code and data |
Exploring Linguistic Style Matching in Online Communities: The Role of Social Context and Conversation Dynamics Aparna Ananthasubramaniam, Hong Chen, Jason Yan, Kenan Alkiek, Jiaxin Pei, Agrima Seth, Lavinia Dunagan, Minje Choi, Benjamin Litterer and David Jurgens. (Best Paper) Proceedings of the 1st Workshop on Social Influence in Conversations (SICon) at ACL. 2023. paper · code and data |
SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis Jiaxin Pei, Vítor Silva, Maarten Bos, Yozon Liu, Leonardo Neves, David Jurgens, and Francesco Barbieri. Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval). paper · data and competition |
Analyzing the Engagement of Social Relationships During Life Event Shocks in Social Media Minje Choi, David Jurgens, and Daniel Romero. Proceedings of the International Conference on Web and Social Media (ICWSM). 2023. paper · code and data |
Bridging Nations: Quantifying the Role of Multilinguals in Communication on Social Media Julia Mendelsohn, Sayan Ghosh, David Jurgens, and Ceren Budak. (Best Methodology Paper) Proceedings of the International Conference on Web and Social Media (ICWSM). 2023. paper · code and data |
2022
Modeling Information Change in Science Communication with Semantically Matched Paraphrases Dustin Wright, Jiaxin Pei, David Jurgens, and Isabelle Augenstein. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 2022. paper · code and data |
A Critical Reflection and Forward Perspective on Empathy and Natural Language Processing Allison Claire Lahnala, Charles Welch, David Jurgens, and Lucie Flek. Proceedings of the Findings of Empirical Methods in Natural Language Processing (EMNLP Findings). 2022. paper |
POTATO: The Portable Text Annotation Tool Jiaxin Pei, Aparna Kamakshi Ananthasubramaniam, Xingyao Wang, Naitian Zhou, Apostolos Dedeloudis, Jackson Sargent and David Jurgens. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP): Systems Demonstrations. 2022. paper · code |
MultiCite: Modeling realistic citations requires moving beyond the single-sentence single-label setting Anne Lauscher, Brandon Ko, Bailey Kuhl, Sophie Johnson, Arman Cohan, David Jurgens, Kyle Lo. Proceedings of the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). 2022. paper · code and data |
The subtle language of exclusion: Identifying the Toxic Speech of Trans-exclusionary Radical Feminists Christina Lu and David Jurgens. Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). 2022. paper · code and data |
SemEval-2022 Task 8: Multilingual news article similarity Xi Chen, Ali Zeynali, Chico Camargo, Fabian Flöck, Devin Gaffney, Przemyslaw Grabowicz, Scott Hale, David Jurgens, and Mattia Samory. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022). 2022. paper · data |
An Attention-Based Model for Predicting Contextual Informativeness and Curriculum Learning Applications Sungjin Nam, David Jurgens, and Kevyn Collins-Thompson. in submission. 2022. pdf |
Diversifying the Professoriate Bas Hofstra, Daniel A. McFarland, Sanne Smith, David Jurgens. Socius. 2022. pdf |
Classification without (Proper) Representation: Political Heterogeneity in Social Media and Its Implications for Classification and Behavioral Analysis Kenan Alkik, Bohan Zhang, and David Jurgens. ACL Findings. 2022. pdf · code |
ByT5 model for massively multilingual grapheme-to-phoneme conversion Jian Zhu, Cong Zhang, and David Jurgens. Interspeech 2022. pdf · code |
Language in Popular American Culture Constructs the Meaning of Healthy and Unhealthy Eating: Narratives of Craveability, Excitement, and Social Connection in Movies, Television, Social Media, Recipes, and Food Reviews Bradley P. Turnwald, Margaret A. Perry, David Jurgens, Vinodkumar Prabhakaran, Dan Jurafsky, Hazel R. Markus, Alia J. Crum. Appetitte. 2022. pdf |
Phone-to-audio alignment without text: A Semi-supervised Approach Jian Zhu, Cong Zhang, and David Jurgens. Proceedings of the 2022 IEEE International Conference on Acoustics, Speech and Signal Processing. pdf · code |
Work Expectations, Depressive Symptoms, and Passive Suicidal Ideation Among Older Adults: Evidence From the Health and Retirement Study Briana Mezuk, Linh Dang, David Jurgens, Jacqui Smith. The Gerontologist 62 (10), 1454-1465 2022. paper |
2021
Detecting Cross-Geographic Biases in Toxicity Modeling on Social Media Sayan Ghosh, Dylan Baker, David Jurgens, and Vinodkumar Prabhakaran. (Best Paper) Proceedings of the 7th Workshop on Noisy User-generated Text (W-NUT). pdf |
Using Sociolinguistic Variables to Reveal Changing Attitudes Towards Sexuality and Gender. Sky Wang and David Jurgens. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP). pdf |
Idiosyncratic but not Arbitrary: Learning Idiolects in Online Registers Reveals Distinctive yet Consistent Individual Styles. Jian Zhu and David Jurgens.. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP). pdf · code and data |
Measuring Sentence-Level and Aspect-Level Certainty in Science Communications Jiaxin Pei and David Jurgens. Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP). pdf · code and data |
Detecting Community Sensitive Norm Violations in Online Conversations. Chan Young Park, Julia Mendelsohn, Karthik Radhakrishnan, Kinjal Jain, Tushar Kanakagiri, David Jurgens and Yulia Tsvetkov. Proceedings of the Findings of the 2021 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP). pdf |
An Animated Picture Says at Least a Thousand Words: Selecting Gif-based Replies in Multimodal Dialog.. Xingyao Wang and David Jurgens. Proceedings of the Findings of the 2021 Conference on Empirical Methods in Natural Language Processing (Findings of EMNLP). pdf · code and data · Slack gif-bot App |
A Data Science Approach to Estimating the Frequency of Driving Cessation Associated Suicide in the US: Evidence From the National Violent Death Reporting System Tomohiro M. Ko,, Viktoryia A. Kalesnikava, David Jurgens, and Briana Mezuk. Frontiers in Public Health. pdf |
Learning PyTorch Through A Neural Dependency Parsing Exercise David Jurgens. Proceedings of the Fifth Workshop on Teaching NLP, 2021. pdf |
Learning about Word Vector Representations and Deep Learning through Implementing Word2vec David Jurgens. Proceedings of the Fifth Workshop on Teaching NLP, 2021. pdf |
More than meets the tie: Examining the Role of Interpersonal Relationships in Social Networks Minje Choi, Ceren Budak, Daniel Romero, and David Jurgens. International Conference on Web and Social Media (ICWSM), 2021. pdf · code |
The Structure of Online Social Networks Modulates the Rate of Lexical Change Jian Zhu and David Jurgens. Proceedings of the North American Meeting of the Association for Computational Linguistics (NAACL), 2021. pdf · code |
Modeling Framing in Immigration Discourse on Social Media Julia Mendelsohn, Ceren Budak, and David Jurgens. Proceedings of the North American Meeting of the Association for Computational Linguistics (NAACL), 2021. pdf · code |
Conversations Gone Alright: Quantifying and Predicting Prosocial Outcomes in Online Conversations
Jiajun Bao*, Junjie Wu*, Yiming Zhang*, Eshwar Chandrasekharan, and David Jurgens. Proceedings of the Web Conference (WebConf), 2021. pdf · code |
2020
Quantifying Intimacy In Language Jiaxin Pei and David Jurgens. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020. pdf · project webpage · code · pip-installable package |
Condolence and Empathy in Online Communities Naitian Zhou and David Jurgens. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020. pdf · data request form |
Still out there: Modeling and Identifying Russian Troll Accounts on Twitter. (Best Paper Runner-Up) Jane Im, Eshwar Chandrasekharan, Jackson Sargent, Paige Lighthammer, Taylor Denby, Ankit Bhargava, Libby Hemphill, David Jurgens, Eric Gilbert. Proceedings of Web Science, 2020. pdf |
Measuring the predictability of life outcomes with
a scientific mass collaboration. Matthew J. Salganik, Ian Lundberg, Alexander T. Kindel, Caitlin E. Ahearn, Khaled Al-Ghoneim, Abdullah Almaatouq, Drew M. Altschul, Jennie E. Brand, Nicole Bohme Carnegie, Ryan James Compton, Debanjan Datta, Thomas Davidson, Anna Filippova, Connor Gilroy, Brian J. Goode, Eaman Jahani, Ridhi Kashyap, Antje Kirchner, Stephen McKay, Allison C. Morgan, Alex “Sandy” Pentland, Kivan Polimis, Louis Raes, Daniel E. Rigobon, Claudia V. Roberts, Diana M. Stanescu, Yoshihiko Suhara, Adaner Usmani, Erik H. Wang, Muna Adem, Abdulla Alhajri, Bedoor AlShebli, Redwane Amin, Ryan B. Amos, Lisa P. Argyle, Livia Baer-Bositis, Moritz Büchi, Bo-Ryehn Chung, William Eggert, Gregory Faletto, Zhilin Fan, Jeremy Freese, Tejomay Gadgil, Josh Gagné, Yue Gaobj, Andrew Halpern-Manners, Sonia P. Hashim, Sonia A. Hausen, Guanhua He, Kimberly Higuera, Bernie Hogan, Ilana M. Horwitz, Lisa M. Hummel, Naman Jain, Kun Jin, David Jurgens, Patrick C. Kaminski, Areg Karapetyan, E. H. Kim, Ben Leizman, Naijia Liu, Malte Möser, Andrew E. Mack, Mayank Mahajan, Noah Mandell, Helge-Johannes Marahrens, Diana Mercado-Garcia, Viola Mocz, Katariina Mueller-Gastell, Ahmed Musse, Qiankun Niu, William P. Nowak, Hamidreza Omidvar, Andrew Or, Karen Ouyang, Katy M. Pinto, Ethan Porter, Kristin E. Porter, Crystal Qian, Tamkinat Rauf, Anahit Sargsyan, Thomas Schaffner, Landon Schnabel, Bryan Schonfeld, Ben Sender, Jonathan D. Tang, Emma Tsurkov, Austin van Loon, Onur Varol, Xiafei Wang, Zhi Wang, Julia Wang, Flora Wang, Samantha Weissman, Kirstie Whitaker, Maria K Wolters, Wei Lee Woon, James Wu, Catherine Wu, Kengran Yang, Jingwen Yin, Bingyu Zhao, Chenyun Zhu, Jeanne Brooks-Gunn, Barbara E. Engelhardt, Moritz Hardt, Dean Knox, Karen Levy, Arvind Narayanan, Brandon M. Stewart, Duncan J. Watts, and Sara McLanahan. Proceedings of the National Academy of Sciences. Mar 2020, 201915006; DOI: 10.1073/pnas.1915006117 pdf |
2019
Finding Microaggressions in the Wild: A
Case for Locating Elusive Phenomena in Social Media
Posts Luke Breitfeller, Emily Ahn, David Jurgens, and Yulia Tsvetkov. Proceedings of 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 2019. |
Perceptions of social roles across cultures. (Nominated for Best Paper) Meixing Dong, David Jurgens, Carmen Banea and Rada Mihalcea. Proceedings of Social Informatics (SocInfo), 2019. |
Suicide Among Older Adults Living in or Transitioning to Residential Long-term Care, 2003 to 2015 Briana Mezuk, Tomohiro M. Ko, Viktoryia A. Kalesnikava, and David Jurgens. JAMA Network Open 2019;2(6):e195627 |
Wetin dey with these comments? Modeling Sociolinguistic Factors Affecting Code-switching Behavior in Nigerian Online Discussions Innocent Ndubuisi-Obi*, Sayan Ghosh*, David Jurgens. Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2019 |
A Just and Comprehensive Strategy for Using NLP to Address Online Abuse David Jurgens, Libby Hemphill and Eshwar Chandrasekharan. Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2019 |
Smart, Responsible, and Upper Caste Only:Measuring Caste Attitudes through Large-Scale Analysis of Matrimonial Profiles (Best Paper Award) Ashwin Rajadesingan, Ramaswami Mahalingam, David Jurgens. Proceedings of the AAAI International Conference on Web and Social Media (ICWSM), 2019 pdf · Press: Times of India, Devdiscourse, Science X, Business Standard |
Demographic Inference and Representative Population Estimates from Multilingual Social Media Data. Zijian Wang, Scott Hale, David Ifeoluwa Adelani, Przemyslaw Grabowicz, Timo Hartmann, Fabian Flöck and David Jurgens*. Proceedings of the Web Conference, 2019 *Corresponding senior author pdf · demo · code · poster (Best Poster Presentation Award) |
Are All Successful Communities Alike? Characterizing and Predicting the Success of Online Communities. Tiago Cunha, David Jurgens, Chenhao Tan and Daniel Romero. Proceedings of the Web Conference, 2019 |
2018
It's going to be okay: Measuring Access to Support in Online Communities. Zijian Wang and David Jurgens. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2018 pdf · supplementary · website and data · code |
RtGender: A Corpus of Responses to Gender for Studying Gender Bias. Rob Voigt, David Jurgens, Vinodkumar Prabhakaran, Dan Jurafsky, and Yulia Tsvetkov. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC), 2018 pdf · data |
Measuring the Evolution of a Scientific Field through Citation Frames. David Jurgens, Srijan Kumar, Raine Hoover, Dan McFarland, Dan Jurafsky. Transactions of the Association for Computational Linguistics (TACL). 2018. pdf · website and data · code · video |
2017
An Analysis of Individuals' Behavior Change in Online Groups. David Jurgens, James McCorriston, and Derek Ruths. Proceedings of the 9th International Conference on Social Informatics (SocInfo). 2017. pdf (preprint) |
Writer Profiling Without the Writer's Text. David Jurgens, Yulia Tsvetkov, and Dan Jurafsky. Proceedings of the 9th International Conference on Social Informatics (SocInfo). 2017. pdf (preprint) |
Language from Police Body Camera Footage Shows Racial Disparities in Officer Respect. Rob Voigt, Nicholas P. Camp, Vinod Prabhakaran, William L. Hamilton, Rebecca C. Hetey, Camilla M. Griffiths, David Jurgens, Dan Jurafsky, and Jennifer L. Eberhardt. Proceedings of the National Academy of Science (PNAS). 2017. |
Incorporating Dialectal Variability for Socially Equitable Language Identification. David Jurgens, Yulia Tsvetkov, Dan Jurafsky. Proceedings of the Annual Meeting of the Association for Computational Linguistics. 2017. pdf · code · slides |
2016
User Migration in Online Social Networks: A Case Study on Reddit During A Period of Community Unrest. Edward Newell*, David Jurgens*, Hardik Vala, Jad Sassine, Caitrin Armstrong, Derek Ruths and Haji Mohammad Saleem. Proceedings of the 10th International AAAI Conference on Web and Social Media (ICWSM). 2016 |
Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel. Hardik Vala, Stefan Dimitrov, David Jurgens, Andrew Piper and Derek Ruths. Proceedings of the 10th edition of the Language Resources and Evaluation Conference (LREC). 2016. |
Semi-supervised Learning with Induced Word Senses for
State of the Art Word Sense Disambiguation. Osman Baskaya and David Jurgens. Journal of Artificial Intelligence Research (JAIR). 55(1) pp. 1025-1058. |
SemEval-2016 Task 14: Semantic Taxonomy Enrichment. David Jurgens and Mohammad Taher Pilehvar. Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval). 2016. pdf · website |
2015
Mr. Bennet, his coachman, and the Archbishop walk into
a bar but only one of them gets recognized: On The
Difficulty of Detecting Characters in Literary Texts. Hardik Vala, David Jurgens, Andrew Piper, and Derek Ruths. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 2015. pdf · data |
Evaluating learning language representations. J. Karlgren, J. Callin, K. Collins-Thompson, A.C. Gyllensten, A. Ekgren, D. Jurgens, A. Korhonen, F. Olsson, M. Sahlgren, and H. Schütze. Proceedings of Conference and Labs of Evaluation Forum (CLEF). 2015. |
Reading Between the Lines: Overcoming Data Sparsity for Accurate Classification of Lexical Relationships. Silvia Necsulescu, Sara Mendes, David Jurgens, Núria Bel, and Roberto Navigli. Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (*SEM). 2015. |
Everyone's Invited: A New Paradigm For Evaluation on
Non-transferable Datasets. David Jurgens, Tyler Finethy, Caitrin Armstrong, and Derek Ruths. Proceedings of the ICWSM Workshop on Standards and Practices in Large-Scale Social Media Research. 2015. pdf · code · FREESR code · FREESR website · project website |
Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice. David Jurgens, Tyler Finethy, James McCorriston, Yi Tian Xu, and Derek Ruths. Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015 pdf · poster · code · website |
An Analysis of
Exercising Behavior in Online Populations.
David Jurgens, James McCorriston, and Derek Ruths. Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015 pdf · poster · website |
Organizations are Users Too:
Characterizing and Detecting the Presence of
Organizations on Twitter. James McCorriston, David Jurgens, and Derek Ruths. Proceedings of the 9th International AAAI Conference on Web and Social Media (ICWSM). 2015 pdf · website · code · data |
Cross Level Semantic Similarity: An Evaluation Framework for Universal Measures of Similarity. David Jurgens, Mohammad Taher Pilehvar, and Roberto Navilgi. Journal of Language Resources and Evaluation. 50(1) pp. 5-30. pdf (preprint) |
Reserating the awesometastic: An automatic extension
of the WordNet taxonomy for novel terms. David Jurgens and Mohammad Taher Pilehvar. Proceeding of the Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL-HLT). 2015. pdf · poster · download · website |
2014
It's All Fun and Games until Someone Annotates: Video
Games with a Purpose for Linguistic Annotation. David Jurgens and Roberto Navigli. Transactions of the Association for Computational Linguistics (TACL) 2014. pdf · slides: pdf, pptx · games! |
Geotagging One Hundred Million Twitter Accounts with Total Variation Minimization. Ryan Compton, David Jurgens, and David Allan. Proceedings of the IEEE International Conference on Big Data. 2014. Press: Forbes, MIT Technology Review, Business Insider, Daily Caller, Schneier on Security |
Twitter users #CodeSwitch hashtags! #MoltoImportante #wow #헐. David Jurgens, Stefan Dimitrov, and Derek Ruths. Proceedings of The First Workshop on Computational Approaches to Code Switching. 2014. pdf · blog post |
SemEval-2014 Task 3: Cross-Level Semantic Similarity. David Jurgens, Mohammad Taher Pilehvar, and Roberto Navigli. Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval) 2014. pdf · slides · website |
Validating and Extending Semantic Knowledge Bases using Video Games with a Purpose. Daniele Vannella, David Jurgens, Daniele Scarfini, Domenico Toscani, and Roberto Navigli. Proceedings of the Annual Meeting for the Association for Computational Linguistics (ACL) 2014. pdf · poster · games! |
An analysis of ambiguity in word sense annotations. David Jurgens. Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC) 2014. |
2013
Align, Disambiguate and Walk: A
Unified Approach for Measuring Semantic Similarity. (Best paper nominee) Mohammad T. Pilehvar, David Jurgens, and Roberto Navigli. Proceedings of the Annual Meeting for the Association for Computational Linguistics (ACL) 2013. pdf · slides · code |
That's what friends are for: Inferring location in online communities based on social relationships. David Jurgens. Proceedings of the 7th International AAAI Conference on Weblogs and Social Media (ICWSM) 2013. pdf · slides · video · Press: Follow the Crowd, MIT Technology Review |
Embracing Ambiguity: A Comparison of Annotation Methodologies for Crowdsourcing Word Sense Labels. David Jurgens. Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL) 2013. pdf · poster |
Characterizing Online Discussions in Microblogs Using Network Analysis. Veronika Strnadova, David Jurgens, and Tsai-Ching Lu. Proceedings of the AAAI Spring Symposium on Analyzing Microtext, 2013. |
SemEval-2013
Task 13: Word Sense Induction for Graded and Non-Graded
Senses. David Jurgens and Ioannis Klapaftis. Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval) 2013. pdf · errata · slides · website |
SemEval-2013 Task 12: Multilingual Word Sense Disambiguation. Roberto Navigli, David Jurgens, and Daniele Vanilla. Proceedings of the 7th International Workshop on Semantic Evaluation (SemEval) 2013. pdf · website |
2012
Temporal Motifs Reveal the Dynamics of Editor Interactions in Wikipedia. David Jurgens and Tsai-Ching Lu. Proceedings of the 6th International AAAI Conference on Weblogs and Social Media (ICWSM) 2012. pdf · video |
Semeval-2012 task 2: Measuring degrees of relational similarity. David Jurgens, Saif M Mohammad, Peter D Turney, and Keith J Holyoak. Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval), 2012. pdf · slides |
An Evaluation of Graded Sense Disambiguation using Word Sense Induction. David Jurgens. Proceedings of the First Joint Conference on Lexical and Computational Semantics (*SEM), 2012. pdf · slides |
Friends, Enemies, and Lovers: Detecting Communities in Networks Where Relationships Matter. David Jurgens and Tsai-Ching Lu. Proceedings of Web Science, 2012. |
2011
Word sense induction by community detection. David Jurgens. Proceedings of the Workshop on Graph-based Methods for Natural Language Processing (TextGraphs), 2011. |
Measuring the impact of sense similarity on word sense induction. David Jurgens and Keith Stevens. Proceedings of the First Workshop on Unsupervised Learning in NLP, 2011. |
2010
The
S-Space Package: An Open Source Package for Word Space
Models. David Jurgens and Keith Stevens. Proceedings of the ACL 2010 System Demonstrations, 2010. pdf · website · Mailing Lists: Users, Developers |
Capturing nonlinear structure in word spaces through dimensionality reduction. David Jurgens and Keith Stevens. Proceedings of the ACL Workshop on GEometrical Models of Natural Language (GEMS), 2010. |
HERMIT: Flexible clustering for the SemEval-2 WSI task. David Jurgens and Keith Stevens. Proceedings of the 5th International Workshop on Semantic Evaluation (SemEval), 2010. |
2009
Event detection in blogs using temporal random indexing. David Jurgens and Keith Stevens. Proceedings of the Workshop on Events in Emerging Text Types, 2009. |
2004
Road extraction from motion cues in aerial video. Robert Pless and David Jurgens. Proceedings of the 12th annual ACM international workshop on Geographic information systems, 2004. |