{"id":1333,"date":"2025-10-15T10:35:00","date_gmt":"2025-10-15T09:35:00","guid":{"rendered":"https:\/\/dariah.ie\/wordpress\/?p=1333"},"modified":"2025-10-02T17:40:32","modified_gmt":"2025-10-02T16:40:32","slug":"cleaning-and-reconciling-literary-historical-data-with-ai-reflections-from-the-stemma-project","status":"publish","type":"post","link":"https:\/\/dariah.ie\/wordpress\/2025\/10\/cleaning-and-reconciling-literary-historical-data-with-ai-reflections-from-the-stemma-project\/","title":{"rendered":"Cleaning and Reconciling Literary Historical Data with AI: Reflections from the STEMMA Project"},"content":{"rendered":"\n<p class=\"has-medium-font-size\">Cleaning and Reconciling Literary Historical Data with AI: Reflections from the STEMMA Project<\/p>\n\n\n\n<p><strong>Date: 21 October 2025 (Tuesday)<\/strong><br><strong>Time: 4:00 pm (HKT)<\/strong><br><strong>Via Zoom<\/strong><\/p>\n\n\n\n<p><strong>Speaker: Prof. Erin McCarthy,&nbsp;<\/strong>Professor of English Literature and Computational Humanities and the Principal Investigator of the STEMMA Project, University of Galway<\/p>\n\n\n\n<p>Click&nbsp;<a href=\"https:\/\/cloud.itsc.cuhk.edu.hk\/webform\/view.php?id=13716188\">here<\/a>&nbsp;to register.<\/p>\n\n\n\n<p><strong>About the talk<\/strong><\/p>\n\n\n\n<p>The European Research Council-funded project \u201cSTEMMA: Systems of Transmitting Early Modern Manuscript Verse, 1475\u20131700\u201d aims to build the first large-scale computational model of the circulation on English-language poetry. To do so, the STEMMA team has reconciled five of the most comprehensive sources of data about early modern poetic manuscripts.&nbsp;In this talk, Prof. McCarthy will describe the use of computational methods such as locality sensitive hashing, cosine similarity, and LLM agents to assist with the cleaning and reconciliation of historical data. These methods allow us to strike a balance between working with \u201cdirty\u201d data and retaining evidence of the untidy state in which it was found. However, they still require significant computational effort and literary historical supervision. The talk will therefore reflect on the opportunities and challenges presented by such work and offer ideas about future directions.<\/p>\n\n\n\n<p><strong>About the speaker<\/strong><\/p>\n\n\n\n<p>Erin A. McCarthy is Established Professor of English Literature and Computational Humanities and the Principal Investigator of the European Research Council-funded project \u201cSTEMMA: Systems of Transmitting Early Modern Manuscript Verse, 1475\u20131700\u201d at the University of Galway. She is the author of Doubtful Readers: Print, Poetry, and the Reading Public (Oxford University Press, 2020), which was named an Outstanding Academic Title by CHOICE and won the 2020 John Donne Society Award for Distinguished Publication. She is currently completing two monographs: a jointly authored monograph about the findings of the RECIRC project, \u201cThe Reception and Circulation of Early Modern Women\u2019s Writing in Manuscript Miscellanies, 1550\u20131700,\u201d with Marie-Louise Coolahan and Sajed Chowdhury, and a sole-authored monograph called \u201cInterpreting Early Modern Manuscripts: Towards a New Methodology.\u201d Her scholarship has also appeared in the John Donne Journal, SEL: Studies in English Literature 1500\u20131900, the Review of English Studies, Criticism, and Reformation.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Cleaning and Reconciling Literary Historical Data with AI: Reflections from the STEMMA Project Date: 21 October 2025 (Tuesday)Time: 4:00 pm (HKT)Via Zoom Speaker: Prof. Erin McCarthy,&nbsp;Professor of English Literature and Computational Humanities and the Principal Investigator of the STEMMA Project, University of Galway Click&nbsp;here&nbsp;to register. About the talk The European Research Council-funded project \u201cSTEMMA: Systems &#8230; <span class=\"more\"><a class=\"more-link\" href=\"https:\/\/dariah.ie\/wordpress\/2025\/10\/cleaning-and-reconciling-literary-historical-data-with-ai-reflections-from-the-stemma-project\/\">[Read more&#8230;]<\/a><\/span><\/p>\n","protected":false},"author":8,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[103,9,113],"tags":[],"class_list":{"0":"entry","1":"post","2":"publish","3":"author-joan-y-murphytcd-ie","4":"post-1333","6":"format-standard","7":"category-digital-humanities","8":"category-events","9":"category-literature"},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p9zaVM-lv","_links":{"self":[{"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/posts\/1333","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/comments?post=1333"}],"version-history":[{"count":2,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/posts\/1333\/revisions"}],"predecessor-version":[{"id":1335,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/posts\/1333\/revisions\/1335"}],"wp:attachment":[{"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/media?parent=1333"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/categories?post=1333"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dariah.ie\/wordpress\/wp-json\/wp\/v2\/tags?post=1333"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}