Computation and Language
Authors and titles for June 2024
- arXiv:2406.00015 [pdf, other]
-
Title: Use of natural language processing to extract and classify papillary thyroid cancer features from surgical pathology reportsRicardo Loor-Torres, Yuqi Wu, Esteban Cabezas, Mariana Borras, David Toro-Tobon, Mayra Duran, Misk Al Zahidy, Maria Mateo Chavez, Cristian Soto Jacome, Jungwei W. Fan, Naykky M. Singh Ospina, Yonghui Wu, Juan P. BritoComments: 21 pages, 6 figures, 7 tablesSubjects: Computation and Language (cs.CL)
- arXiv:2406.00016 [pdf, other]
-
Title: Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual DataComments: arXiv admin note: text overlap with arXiv:2405.11704 by other authorsSubjects: Computation and Language (cs.CL)
- arXiv:2406.00017 [pdf, html, other]
-
Title: PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based AlignmentShezheng Song, Shasha Li, Shan Zhao, Chengyu Wang, Xiaopeng Li, Jie Yu, Qian Wan, Jun Ma, Tianwei Yan, Wentao Ma, Xiaoguang MaoComments: Code will be released upon publicationSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
- arXiv:2406.00018 [pdf, other]
-
Title: Large Language Models' Detection of Political Orientation in NewspapersSubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
- arXiv:2406.00019 [pdf, html, other]
-
Title: EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health RecordsComments: ACL 2024 (Findings)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Information Retrieval (cs.IR)
- arXiv:2406.00020 [pdf, html, other]
-
Title: Harmful Speech Detection by Language Models Exhibits Gender-Queer Dialect BiasSubjects: Computation and Language (cs.CL); Computers and Society (cs.CY)
- arXiv:2406.00021 [pdf, html, other]
-
Title: CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer LearningComments: 8 pages, Accepted at ICLR 2024 - Tiny TrackSubjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- arXiv:2406.00022 [pdf, html, other]
-
Title: Multilingual Prosody Transfer: Comparing Supervised & Transfer LearningComments: 7 pages, Accepted to ICLR 2024 - Tiny TrackSubjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
- arXiv:2406.00023 [pdf, html, other]
-
Title: Expert-Token Resonance MoE: Bidirectional Routing with Efficiency Affinity-Driven Active SelectionSubjects: Computation and Language (cs.CL)
- arXiv:2406.00024 [pdf, html, other]
-
Title: Embedding-Aligned Language ModelsComments: Accepted Neurips 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
- arXiv:2406.00025 [pdf, html, other]
-
Title: SCALM: Towards Semantic Caching for Automated Chat Services with Large Language ModelsSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00027 [pdf, html, other]
-
Title: Adapting PromptORE for Modern History: Information Extraction from Hispanic Monarchy Documents of the XVIth CenturySubjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
- arXiv:2406.00028 [pdf, other]
-
Title: Word Sense Disambiguation in Persian: Can AI Finally Get It Right?Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00029 [pdf, html, other]
-
Title: Clustered Retrieved Augmented Generation (CRAG)Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00030 [pdf, html, other]
-
Title: Large Language Model PruningHanjuan Huang (1) (2), Hao-Jia Song (1), Hsing-Kuo Pao (1) ((1) Dept. of Computer Science and Information Engineering National Taiwan University of Science and Technology, Taipei, Taiwan, (2) College of Mechanical and Electrical Engineering, WUYI University, Wuyishan, China)Comments: 17 pages, 7 figures, 2 tablesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- arXiv:2406.00031 [pdf, html, other]
-
Title: AMGPT: a Large Language Model for Contextual Querying in Additive ManufacturingComments: 54 pages, 4 figuresSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00032 [pdf, html, other]
-
Title: Paths of A Million People: Extracting Life Trajectories from WikipediaComments: Accepted to ICWSM 2025. 15 pagesSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
- arXiv:2406.00033 [pdf, html, other]
-
Title: Retrieval-Augmented Conversational Recommendation with Prompt-based Semi-Structured Natural Language State TrackingSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00034 [pdf, html, other]
-
Title: Adaptive Activation Steering: A Tuning-Free LLM Truthfulness Improvement Method for Diverse Hallucinations CategoriesTianlong Wang, Xianfeng Jiao, Yinghao Zhu, Zhongzhi Chen, Yifan He, Xu Chu, Junyi Gao, Yasha Wang, Liantao MaComments: ACM TheWebConf 2025 Conference (WWW 2025) Research TrackSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00036 [pdf, html, other]
-
Title: EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented GenerationYinghao Zhu, Changyu Ren, Zixiang Wang, Xiaochen Zheng, Shiyun Xie, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei PanComments: CIKM 2024 Full Research Paper; arXiv admin note: text overlap with arXiv:2402.07016Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- arXiv:2406.00037 [pdf, html, other]
-
Title: Aligning LLMs through Multi-perspective User Preference Ranking-based Feedback for Programming Question AnsweringSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00038 [pdf, html, other]
-
Title: ViSpeR: Multilingual Audio-Visual Speech RecognitionSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00039 [pdf, other]
-
Title: How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?Comments: Accepted at Educational Data Mining 2024Subjects: Computation and Language (cs.CL)
- arXiv:2406.00040 [pdf, html, other]
-
Title: Unveiling Themes in Judicial Proceedings: A Cross-Country Study Using Topic Modeling on Legal Documents from India and the UKSubjects: Computation and Language (cs.CL)
- arXiv:2406.00041 [pdf, html, other]
-
Title: QUB-Cirdan at "Discharge Me!": Zero shot discharge letter generation by open-source LLMComments: BioNLP 2024 workshopSubjects: Computation and Language (cs.CL)
- arXiv:2406.00044 [pdf, html, other]
- arXiv:2406.00045 [pdf, html, other]
-
Title: Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference OptimizationSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00046 [pdf, html, other]
-
Title: Hate Speech Detection with Generalizable Target-aware FairnessComments: To appear in KDD 2024Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00048 [pdf, other]
-
Title: Towards a theory of how the structure of language is acquired by deep neural networksComments: NeurIPS 2024Subjects: Computation and Language (cs.CL); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
- arXiv:2406.00049 [pdf, html, other]
-
Title: QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine TranslationGonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. MartinsComments: Accepted at NEURIPS Main 2024Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00050 [pdf, html, other]
-
Title: An Empirical Analysis on Large Language Models in Debate EvaluationComments: Accepted to ACL 2024 mainSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00053 [pdf, html, other]
-
Title: Dual Process Learning: Controlling Use of In-Context vs. In-Weights Strategies with Weight ForgettingComments: 10 pages, 6 figuresSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00057 [pdf, html, other]
-
Title: Toward Conversational Agents with Context and Time Sensitive Long-term MemorySubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00059 [pdf, html, other]
-
Title: Conveyor: Efficient Tool-aware LLM Serving with Tool Partial ExecutionComments: 11 pages, 8 figuresSubjects: Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
- arXiv:2406.00060 [pdf, html, other]
-
Title: Cascade-Aware Training of Language ModelsCongchao Wang, Sean Augenstein, Keith Rush, Wittawat Jitkrittum, Harikrishna Narasimhan, Ankit Singh Rawat, Aditya Krishna Menon, Alec GoComments: 22 pages, 13 figuresSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00062 [pdf, html, other]
-
Title: Unlocking the Potential of Large Language Models for Clinical Text Anonymization: A Comparative StudyDavid Pissarra, Isabel Curioso, João Alveira, Duarte Pereira, Bruno Ribeiro, Tomás Souper, Vasco Gomes, André V. Carreiro, Vitor RollaSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
- arXiv:2406.00069 [pdf, html, other]
-
Title: Confidence-Aware Sub-Structure Beam Search (CABS): Mitigating Hallucination in Structured Data Generation with Large Language ModelsSubjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
- arXiv:2406.00159 [pdf, other]
-
Title: On the referential capacity of language models: An internalist rejoinder to Mandelkern & LinzenSubjects: Computation and Language (cs.CL)
- arXiv:2406.00179 [pdf, other]
-
Title: Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side EvaluationBernd Bohnet, Kevin Swersky, Rosanne Liu, Pranjal Awasthi, Azade Nova, Javier Snaider, Hanie Sedghi, Aaron T Parisi, Michael Collins, Angeliki Lazaridou, Orhan Firat, Noah FiedelSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00197 [pdf, html, other]
-
Title: Re3: A Holistic Framework and Dataset for Modeling Collaborative Document RevisionComments: accepted to ACL2024 mainJournal-ref: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics ACL 2024 (Volume 1: Long Papers)Subjects: Computation and Language (cs.CL)
- arXiv:2406.00222 [pdf, html, other]
-
Title: Learning to Clarify: Multi-turn Conversations with Action-Based Contrastive Self-TrainingComments: ICLR 2025; Code: this https URLSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- arXiv:2406.00226 [pdf, other]
-
Title: Entangled Relations: Leveraging NLI and Meta-analysis to Enhance Biomedical Relation ExtractionComments: 17 pages, 1 figureSubjects: Computation and Language (cs.CL)
- arXiv:2406.00244 [pdf, html, other]
-
Title: Controlling Large Language Model Agents with Entropic Activation SteeringSubjects: Computation and Language (cs.CL)
- arXiv:2406.00257 [pdf, html, other]
-
Title: Are Large Vision Language Models up to the Challenge of Chart Comprehension and Reasoning? An Extensive Investigation into the Capabilities and Limitations of LVLMsMohammed Saidul Islam, Raian Rahman, Ahmed Masry, Md Tahmid Rahman Laskar, Mir Tafseer Nayeem, Enamul HoqueSubjects: Computation and Language (cs.CL)
- arXiv:2406.00284 [pdf, other]
-
Title: A Closer Look at Logical Reasoning with LLMs: The Choice of Tool MattersComments: Code and data are publicly available at: this https URLSubjects: Computation and Language (cs.CL)
- arXiv:2406.00303 [pdf, html, other]
-
Title: Multi-Dimensional Optimization for Text Summarization via Reinforcement LearningComments: ACL 2024Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
- arXiv:2406.00314 [pdf, html, other]
-
Title: CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert ModelsSarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit SodhiSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
- arXiv:2406.00343 [pdf, html, other]
-
Title: Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World ScenariosMillicent Ochieng, Varun Gumma, Sunayana Sitaram, Jindong Wang, Vishrav Chaudhary, Keshet Ronen, Kalika Bali, Jacki O'NeillSubjects: Computation and Language (cs.CL)
- arXiv:2406.00367 [pdf, other]
-
Title: RoBERTa-BiLSTM: A Context-Aware Hybrid Model for Sentiment AnalysisSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE)
- arXiv:2406.00380 [pdf, html, other]
-
Title: HonestLLM: Toward an Honest and Helpful Large Language ModelChujie Gao, Siyuan Wu, Yue Huang, Dongping Chen, Qihui Zhang, Zhengyan Fu, Yao Wan, Lichao Sun, Xiangliang ZhangSubjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
