Joanna Byszuk
Department of Methodology
365

dr Joanna Byszuk

Personal website – for workshop materials etc.

Education

  • 2017-2022 – doctoral programme in linguistics at the Institute of Polish Language Polish Academy of Sciences
  • 2017 – Master of Arts in English philology (translation studies), Jagiellonian University
  • 2017 – Bachelor of Arts and Science in Electronic Information Processing, Jagiellonian University
  • 2015 – Bachelor of Arts, English philology with German, Jagiellonian University

Publications

Papers

Šeļa, A., Nagy, B., Byszuk, J., Hernández-Lorenzo, L., Szemes, B. and Eder, M. (2024). From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, In: M. Andersen and N Reiter (ed.) Computational Drama Analysis. Reflecting on Methods and Interpretations, pp. 149-166, De Gruyter[pre-print].

Hernández-Lorenzo, L., and Byszuk J. (2023). Challenging stylometry: the authorship of the baroque play La Segunda Celestina, Digital Scholarship in the Humanities, Volume 38, Issue 2, pp. 544–558: https://doi.org/10.1093/llc/fqac063.

Byszuk, J. (2023). On Computers in Text Analysis. In: J. O’Sullivan (ed.), The Bloomsbury Handbook to the Digital Humanities, 159–68. London: Bloomsbury.

Byszuk, J. and Dombrowski, Q. (2022). Stylometric investigations into translationese: The Baby-Sitters Club across languages. In: Misuraca, M., Scepi, G. and Spano, M. (ed.), Proceedings of the 16th International Conference on Statistical Analysis of Textual Data, vol. 1. Naples, pp. 188–96, http://lexicometrica.univ-paris3.fr/jadt/JADT2022/VOL1.pdf.

Škorić, M., Stanković R., Ikonić Nešić, M., Byszuk, J. and Eder M. (2022). Parallel Stylometric Document Embeddings with Deep Learning Based Language Models in Literary Authorship Attribution, Mathematics, 10.5, 838 https://doi.org/10.3390/math10050838.

Idziak, I., Šeļa, A., Woźniak, M., Leśniak, A., Byszuk J. and Eder M. (2021). Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabets. International Conference on Computational Science Proceedings.

Byszuk, J. (2020) The Voices of Doctor Who – how stylometry can be useful in revealing new information about TV series. Digital Humanities Quarterly, 14(4) link.

Byszuk, J., Woźniak, M., Kestemont, M., Leśniak, A., Łukasik, W., Šeļa, A. and Eder, M. (2020). Detecting direct speech in multilingual collection of 19th century novels. In LREC 2020available here pp. 100-104

Franzini, G., Kestemont, M., Rotari, G., Jander, M., Ochab, J. K., Franzini, E., Byszuk, J., Rybicki, J. Attributing Authorship in the Noisy Digitized Correspondence of Jacob and Wilhelm Grimm. Frontiers in Digital Humanities, 5. 2018, link.

Other

Byszuk, J. (2023) What is Authorship Attribution? In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Byszuk, J. (2023) Analysis in Authorship Attribution. In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Byszuk, J. (2023) Evaluation in Authorship Attribution. In: Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.

Byszuk J. (2021) Four articles promoting new digital humanities publications for “Nowości Badawcze NCK” 1/2021.

Mischke, D. Choiński, M., Byszuk, J, Göbel, M. (2020) Network Analysis and Spatial Stylometry in American Drama Studies” DH2020 Book of Abstracts, ADHO Conference.

Byszuk, J. (2020) Stylometry Expertise Conclusion: Assessing authorship of an anonymous Persian qasida, In: Khismatulin, A. Amir Mu‘izzi Nishapuri. The Siyasat-nama/Siyar al-muluk: A Fabrication Ascribed to Nizam al-Mulk. The series: THE PERSIAN MIRRORS FOR PRINCES WRITTEN IN THE SALJUQ PERIOD: ORIGINALS AND FABRICATIONS (I). St. Petersburg: Peterburgskoe Vostokovedenie; Moscow: Sadra, 2020. pp. 176-178 (online access: Stylometry Expertise Conclusion on Assessing authorship of an anonymous Persian qasida)

Frontini, F., Brando C., Byszuk J., Galleron I., Santos D. & Stanković R. (2020) Named Entity Recognition for Distant Reading in ELTeC, CLARIN Annual Conference 2020 Proceedings, pp. 37-41, (online).

Byszuk, J. (2019) A review of the book Reading beyond the female: The relationship between perception of author gender and literary quality. Socjolingwistyka, 33. 2019. (online)

Hernandez Lorenzo, L., Byszuk, J. (2019) Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina. In Digital Humanities 2019: Book of Abstracts. University of Utrecht.

Eder M., Byszuk, J. (2019) Feature Selection in Authorship Attribution: Ordering the Wordlist. In Digital Humanities 2019: Book of Abstracts. University of Utrecht.

Ochab, J.K., Byszuk, J., Pielström, S. and Eder, M. (2019) Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection). In Digital Humanities 2019: Book of Abstracts. University of Utrecht. Utrecht.

Byszuk, J. (2018) Tracing Showrunners’ Impact. Book of Abstracts AIUCD 2018. 190-192.

Selection of talks

Invited talks

  • Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” for mini-conference Brave New Humanities? A Novel Perceptions Symposium on Computational Literary Studies, 8 IV 2022, online (link).
  • “Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” for Natural Language Processing Seminar 2021–2022 at the Institute of Computer Science, Polish Academy of Sciences, 6 XII 2021, online.
  • “What can be measured with stylometry? On language, creativity and numbers.” for the Linguistics seminar at the University of Bielefeld, 12 X 2021, online and in person.
  • “The Voices of Doctor Who – How Stylometry Can be Useful in Revealing New Information About TV Series”, for the seminars of the Centre for Digital Humanities at the University of Groningen, 9 III 2021, online.
  • “Direct speech for multilingual corpora some problems and one possible solution”, SIG_DLS Workshop: Tool Criticism 3.0. Present, past, and future methods in Digital Literary Stylistics, ADHO Special Interest Group for Digital Literary Studies, 20 VII 2020, online.
  • “Literary Studies: Textual analysis and stylometry with WebSty”, CLARIN Café III CLARIN for Researchers, 8 VII 2020, online.
  • “Stylometry in textual analysis and beyond”, Colloquium in Digital Cultural Heritage, 22 I 2020, Köln.
  • “AI in Computational Linguistics and Humanities”, Giersch Symposium “AI for Science”, 18-22 XI 2019, Frankfurt am Main.

Peer-reviewed conferences

  • 19-21.10.2022, Byszuk, J., Kunda, B., Coping Strategies Used by Male Young Adults in Contemporary TV Series, Discourses of Fictional (Digital) TV Series.
  • 19-2.10.2022, Byszuk, J., What Makes a Captain: Quantitative Analysis of Discourses of Power across Star Trek Series, Discourses of Fictional (Digital) TV Series.
  • 14-15.09.2022, Šeļa, A., Nagy, B., Byszuk, J., Hernández-Lorenzo, L., Szemes, B., and Eder M., From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, Workshop on Computational Drama Analysis: Achievements and Opportunities.
  • 25-29.07.2022, Herrmann, J. B., Byszuk, J. and Grisot, G. Using word embeddings for validation and enhancement of spatial entity lists. Digital Humanities 2022: Conference Abstracts. Tokyo: University of Tokyo, pp. 239–41, https://dh2022.dhii.asia/dh2022bookofabsts.pdf
  • 22.04.2022, Šeļa, A., Byszuk, J., Kunda, B., Hernández-Lorenzo, L., Szemes, B., Eder, M., Imagined differences: approaches to variation in fictional character voices in literary history, Closing conference of the COST Action Distant Reading for European Literary History.
  • 21.04.2022, Stanković, R., Santos, D., Brando, C., Palkó, G., Byszuk, J., Distant Reading of ELTeC text collection through Named Entities, Closing conference of the COST Action Distant Reading for European Literary History.
  • 9-12.07.2019, Hernández-Lorenzo, L. and Byszuk, J., Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina, DH 2019, Utrecht.
  • 9-12.07.2019, Eder, M. and Byszuk, J., Feature Selection in Authorship Attribution: Ordering the Wordlist, DH 2019,  Utrecht.
  • 9-12.07.2019, Ochab, J.K., Byszuk, J., Pielström, S. and Eder, M., Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection), DH 2019, Utrecht.
  • 8.06.2019, Byszuk, J., Khismatulin A., Attribution of Authorship for Medieval Persian Quasidas with Stylometry, #Right2Left Workshop, Victoria BC.
  • 28.02 – 2.03.2019, Czopek, K., Byszuk, J., Older language learner: a comparative corpus study of FL performance and learning materials, (poster), 4th CLARe Conference, Helsinki.
  • 7-9.12.2018, Byszuk, J., Król, M., Eder, M., Enhanced digital editions: retrieving POS tags from pre-digital word indexes, EADH Conference, Galway.
  • 7-9.12.2018, Byszuk, J., Who is the author? Modeling creative relations in television writing, EADH Conference, Galway.
  • 6.07.2018, Byszuk. J., Analysis of cross-lingual semantic change in professional discourse with quantitative methods, (poster), Qualico International Quantitative Linguistics Conference, Wrocław.
  • 5.07.2018, Eder M., Byszuk, J., Górski, R.L., Zipf’s law and subsets of lexis, Qualico International Quantitative Linguistics Conference, Wrocław.
  • 31.01 – 2.02.2018., Byszuk, J., Tracing Showrunner’s Impact, 7th AIUCD Conference, ITC Conference Grant (COST Action Distant Reading COST CA16204), Bari.
  • 20-22.04.2017, Byszuk, J., The Voices of Doctor Who, April Conference Fourteen, Kraków.
  • 9-1.05.2017, Byszuk, J., Jak (po angielsku) pisze polski programista?, 11. Studenckie Warsztaty Tłumaczeniowe, Kraków.

Participation in grant projects

Conducted workshops

  • “DH2024 Workshop – Computational Literary Studies: How To Do Research Responsibly”  –  a pre-conference workshop accompanying DH 2024 ADHO Conference in Washington DC, co-organized with Simone Rebora, Maciej Eder, Pablo Ruiz Fabio, J. Berenike Herrmann and Suzanne Mpouli, 5th August 2024.
  • “DIY Computational Text Analysis with R”  –  a course at the Digital Humanities Summer Institute, co-taught with Jeremi K. Ochab, 10-14 June 2024.
  • “Examining Language Variation with Stylometry”  –  a course at the  IQLA-GIAT Summer School (4th – 8th September 2023 in Padua).
  • “Drafting Standards for Stylometry”  – a pre-conference workshop accompanying DH 2023 ADHO Conference in Graz, co-organized with Patrick Juola.
  • “DH2023 Workshop – SIG-DLS Seven Years on (Program)”  –  a pre-conference workshop accompanying DH 2023 ADHO Conference in Graz, co-organized with Simone Rebora, Pablo Ruiz Fabio, J. Berenike Herrmann and Suzanne Mpouli.
  • “Finding Gold” – Stylometry part of the Digging For Gold – Knowledge Extraction From Text Summer School organized by CLS Infra in Madrid, co-taught with Artjoms Šeļa, 9-11 May 2023.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder and Artjoms Šeļa, 6-10 June 2022.
  • “Advanced Stylometry Workshop at the ELTE Department of Digital Humanities”, with Maciej Eder and Artjoms Šeļa, 6-7 May 2022.
  • “Stylometry with R” at Distant Reading Training School: Exploring ELTeC: Use-Cases for Information Extraction and Analysis within the COST Action 16204: Distant Reading for European Literary History, with Maciej Eder and Artjoms Šeļa, 22-24 III 2022.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder and Artjoms Šeļa, 14-18 June 2021.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with Maciej Eder – Victoria BC (Kanada), 10-14 June 2019.
  • “Stylometry with R”– a training session at the Distant Reading Training School 2018, Workshop 1: Methods and Tools of Distant Reading Adapted to Multiple European Languages in Galway, 5-7 December 2018.
  • “Stylometry with R”– a course at the Третья Московско-тартуская школа по цифровым гуманитарным исследованиям in Moscow, 4-7 October 2018.
  • “Stylometry with R: Computer-Assisted Analysis of Literary Texts” – a course at the Digital Humanities Summer Institute, co-taught with dr Jan Rybicki (Uniwersytet Jagielloński) – Victoria BC (Kanada), 11-15 June 2018.

Other activities

Ikona z ludzikiem do otwierania panelu kontrolnego WCAG
Aa+
Aa-
Ikona kontrastu
Ikona linku
Ikona skali szarości
Ikona zmiany na czytelne czcionki
Ikona resetu ustawień WCAG