Joanna Byszuk
Department of Methodology
365

mgr Joanna Byszuk

Personal website – for workshop materials etc.

Education

  • 2017-2022 – doctoral programme in linguistics at the Institute of Polish Language Polish Academy of Sciences
  • 2017 – Master of Arts in English philology (translation studies), Jagiellonian University
  • 2017 – Bachelor of Arts and Science in Electronic Information Processing, Jagiellonian University
  • 2015 – Bachelor of Arts, English philology with German, Jagiellonian University

Publications

Papers

Šeļa, A., Nagy, B., Byszuk, J., Hernández-Lorenzo, L., Szemes, B. and Eder, M. (forthcoming). From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, [pre-print].

Hernández-Lorenzo, L., and Byszuk J. (2023). Challenging stylometry: the authorship of the baroque play La Segunda Celestina, Digital Scholarship in the Humanities, advance access: https://doi.org/10.1093/llc/fqac063.

Byszuk, J. (2023). On Computers in text analysis. In: J. O’Sullivan (ed.), The Bloomsbury Handbook to the Digital Humanities, 159–68. London: Bloomsbury.

Byszuk, J. and Dombrowski, Q. (2022). Stylometric investigations into translationese: The Baby-Sitters Club across languages. In Misuraca, M., Scepi, G. and Spano, M. (eds), Proceedings of the 16th International Conference on Statistical Analysis of Textual Data, vol. 1. Naples, pp. 188–96, http://lexicometrica.univ-paris3.fr/jadt/JADT2022/VOL1.pdf.

Škorić, M., Stanković R., Ikonić Nešić, M., Byszuk, J. and Eder M. (2022) ‘Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution’, Mathematics, 10.5 (2022), 838 <https://doi.org/10.3390/math10050838>

Idziak, I., Šeļa, A., Woźniak, M., Leśniak, A., Byszuk J. and Eder M. (2021). Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabets. International Conference on Computational Science Proceedings.

Byszuk, J. (2020) The Voices of Doctor Who – how stylometry can be useful in revealing new information about TV series. Digital Humanities Quarterly, 14(4) link.

Byszuk, J., Woźniak, M., Kestemont, M., Leśniak, A., Łukasik, W., Šeļa, A. and Eder, M. (2020). Detecting direct speech in multilingual collection of 19th century novels. In LREC 2020available here pp. 100-104

Franzini, G., Kestemont, M., Rotari, G., Jander, M., Ochab, J. K., Franzini, E., Byszuk, J., Rybicki, J. Attributing Authorship in the Noisy Digitized Correspondence of Jacob and Wilhelm Grimm. Frontiers in Digital Humanities, 5. 2018, link.

Other

Byszuk, J. (2023) What is Authorship Attribution? In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Byszuk, J. (2023) Analysis in Authorship Attribution. In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Byszuk, J. (2023) Evaluation in Authorship Attribution. In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Four articles promoting new digital humanities publications for “Nowości Badawcze NCK” 1/2021.

Mischke, D. Choiński, M., Byszuk, J, Göbel, M. “Network Analysis and Spatial Stylometry in American Drama Studies” DH2020 Book of Abstracts, Ottawa, ADHO, 2020

Byszuk, J. “Stylometry Expertise Conclusion: Assessing authorship of an anonymous Persian qasida” in: Khismatulin, A. “Amir Mu‘izzi Nishapuri. The Siyasat-nama/Siyar al-muluk: A Fabrication Ascribed to Nizam al-Mulk”. The series: THE PERSIAN MIRRORS FOR PRINCES WRITTEN IN THE SALJUQ PERIOD: ORIGINALS AND FABRICATIONS (I). St. Petersburg: Peterburgskoe Vostokovedenie; Moscow: Sadra, 2020. pp. 176-178 (online access: Stylometry Expertise Conclusion on Assessing authorship of an anonymous Persian qasida)

Frontini, F., Brando C., Byszuk J., Galleron I., Santos D. & Stanković R. “Named Entity Recognition for Distant Reading in ELTeC”, CLARIN Annual Conference 2020 Proceedings, p. 37-41, (online).

Byszuk, J. A review of the book Reading beyond the female: The relationship between perception of author gender and literary quality. Socjolingwistyka, 33. 2019. (online)

Hernandez Lorenzo, L., Byszuk, J. Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina. In Digital Humanities 2019: Book of Abstracts. University of Utrecht.

Eder M., Byszuk, J. Feature Selection in Authorship Attribution: Ordering the Wordlist. In Digital Humanities 2019: Book of Abstracts. University of Utrecht.

Ochab, J.K., Byszuk, J., Pielström, S. and Eder, M. Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection). In Digital Humanities 2019: Book of Abstracts. University of Utrecht. Utrecht.

Byszuk, J. Tracing Showrunners’ Impact. Book of Abstracts AIUCD 2018. 190-192.

Selection of talks

Invited talks

  • Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” w ramach Brave New Humanities? A Novel Perceptions Symposium on Computational Literary Studies, 8 IV 2022, online (link).
  • “Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” for Natural Language Processing Seminar 2021–2022 at the Institute of Computer Science, Polish Academy of Sciences, 6 XII 2021, online.
  • “What can be measured with stylometry? On language, creativity and numbers.” for the Linguistics seminar at the University of Bielefeld, 12 X 2021, online and in person.
  • “The Voices of Doctor Who – How Stylometry Can be Useful in Revealing New Information About TV Series”, for the seminars of the Centre for Digital Humanities at the University of Groningen, 9 III 2021, online.
  • “Direct speech for multilingual corpora some problems and one possible solution”, SIG_DLS Workshop: Tool Criticism 3.0. Present, past, and future methods in Digital Literary Stylistics, ADHO Special Interest Group for Digital Literary Studies, 20 VII 2020, online.
  • “Literary Studies: Textual analysis and stylometry with WebSty”, CLARIN Café III CLARIN for Researchers, 8 VII 2020, online.
  • “Stylometry in textual analysis and beyond”, Colloquium in Digital Cultural Heritage, 22 I 2020, Köln.
  • “AI in Computational Linguistics and Humanities”, Giersch Symposium “AI for Science”, 18-22 XI 2019, Frankfurt am Main.

Peer-reviewed conferences

  • Herrmann, J. B., Byszuk, J. and Grisot, G. (2022). Using word embeddings for validation and enhancement of spatial entity lists. Digital Humanities 2022: Conference Abstracts. Tokyo: University of Tokyo, pp. 239–41, https://dh2022.dhii.asia/dh2022bookofabsts.pdf
  • 19-21 X 2022, Byszuk, J., Kunda, B. (2022) Coping Strategies Used by Male Young Adults in Contemporary TV Series, Discourses of Fictional (Digital) TV Series

    19-21 X 2022, Byszuk, J. (2022) What Makes a Captain: Quantitative Analysis of Discourses of Power across Star Trek Series, Discourses of Fictional (Digital) TV Series

  • 14-15 IX 2022, Šeļa, Artjoms, Ben Nagy, Joanna Byszuk, Laura Hernández-Lorenzo, Botond Szemes, and Maciej Eder (2022) From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, Workshop on Computational Drama Analysis: Achievements and Opportunities
  • 22 IV 2022, Closing conference of the COST Action Distant Reading for European Literary History, Artjoms Šeļa, Joanna Byszuk, Bartlomiej Kunda, Laura Hernández-Lorenzo, Botond Szemes, Maciej Eder. (2022). Imagined differences: approaches to variation in fictional character voices in literary history. 

  • 21 IV 2022 Closing conference of the COST Action Distant Reading for European Literary History, Ranka Stanković, Diana Santos, Carmen Brando, Gábor Palkó, Joanna Byszuk. (2022). Distant Reading of ELTeC text collection through Named Entities. 

  • “Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina” with Laura Hernandez Lorenzo (Universidad de Sevilla), DH 2019, 9-12 VII 2019, Utrecht.
  • “Feature Selection in Authorship Attribution: Ordering the Wordlist” with Maciej Eder, DH 2019, 9-12 VII 2019, Utrecht.
  • “Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection)” with Jeremi K. Ochab, Maciej Eder and Steffen Pielström (University of Würzburg), DH 2019, 9-12 VII 2019, Utrecht.
  • 8 VI 2019, “Attribution of Authorship for Medieval Persian Quasidas with Stylometry” with Alexey Khismatulin (Institute of Oriental Manuscripts, Russian Academy of Sciences), #Right2Left Workshop, Victoria BC. 
  • 28 II – 2 III 2019, “Older language learner: a comparative corpus study of FL performance and learning materials”, poster with Karolina Czopek (Uniwersytet Warszawski), 4th CLARe Conference, Helsinki.
  • 7-9 XII 2018, EADH Conference, long paper “Enhanced digital editions: retrieving POS tags from pre-digital word indexes” with Maciej Eder and Magdalena Król. 
  • 7-9 XII 2018, EADH Conference, long paper “Who is the author? Modeling creative relations in television writing”.
  • 6 VII 2018, Qualico International Quantitative Linguistics Conference, poster “Analysis of cross-lingual semantic change in professional discourse with quantitative methods”.
  • 5 VII 2018, Qualico International Quantitative Linguistics Conference, long paper “Zipf’s law and subsets of lexis” with Maciej Eder and Rafał L. Górski.
  • 31 I – 2 II 2018., 7th AIUCD Conference, long paper “Tracing Showrunner’s Impact”, ITC Conference Grant (COST Action Distant Reading COST CA16204).
  • 20-22 IV 2017, April Conference Fourteen, paper “The Voices of Doctor Who”.
  • 9-11 V 2017, 11. Studenckie Warsztaty Tłumaczeniowe, paper “Jak (po angielsku) pisze polski programista?”

Participation in grant projects

Conducted workshops

  • “Examining Language Variation with Stylometry”  –  a course at the  IQLA-GIAT Summer School (4th – 8th September 2023 in Padua)
  • “Drafting Standards for Stylometry”  – a pre-conference workshop accompanying DH 2023 ADHO Conference in Graz, co-organized with Patrick Juola
  • “DH2023 Workshop – SIG-DLS Seven Years on (Program)”  –  a pre-conference workshop accompanying DH 2023 ADHO Conference in Graz, co-organized with Simone Rebora, Pablo Ruiz Fabio, J. Berenike Herrmann and Suzanne Mpouli
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder and Artjoms Šeļa, 6-10 VI 2022.
  • Advanced Stylometry Workshop at the ELTE Department of Digital Humanities, with Maciej Eder and Artjoms Šeļa, 6-7 V 2022.
  • “Stylometry with R” at Distant Reading Training School: Exploring ELTeC: Use-Cases for Information Extraction and Analysis within the COST Action 16204: Distant Reading for European Literary History, with Maciej Eder and Artjoms Šeļa, 22-24 III 2022.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder and Artjoms Šeļa, 14-18 VI 2021.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder – Victoria BC (Kanada), 10-14 VI 2019.
  • “Stylometry with R”– a training session at the Distant Reading Training School 2018, Workshop 1: Methods and Tools of Distant Reading Adapted to Multiple European Languages in Galway, 5-7 XII 2018.
  • “Stylometry with R”– a course at the Третья Московско-тартуская школа по цифровым гуманитарным исследованиям in Moscow, 4-7 X 2018.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with dr Jan Rybicki (Uniwersytet Jagielloński) – Victoria BC (Kanada), 11-15 VI 2018.

Other activities

Ikona z ludzikiem do otwierania panelu kontrolnego WCAG
Aa+
Aa-
Ikona kontrastu
Ikona linku
Ikona skali szarości
Ikona zmiany na czytelne czcionki
Ikona resetu ustawień WCAG