Materials from the Workshop on regional markedness in text available

The ma­te­ri­als from Re­gion­al vari­a­tion in gen­der mark­ing: a hands-on tu­to­r­i­al on ex­tract­ing data from cor­po­ra are now avail­able for down­load from These ma­te­ri­als pro­vide an in­tro­duc­tion to the process of us­ing cor­po­ra to study a lin­guis­tic (and not only lin­guis­tic) prob­lem, with in­for­ma­tion on:

  • how to find (com­pa­ra­ble) South Slav­ic cor­po­ra in the CLARIN.SI repos­i­to­ry
  • how to ex­plore cor­po­ra through the noS­ketchEngine and Kon­Text con­cor­dancers
  • how to study gen­der mark­ing look­ing at fre­quen­cies of fem­i­nine and mas­cu­line nouns de­scrib­ing oc­cu­pa­tions, and at the dis­tri­b­u­tion of fem­i­nine and mas­cu­line forms of dif­fer­ent verbs
  • how to draw con­clu­sions about gen­der bias in so­ci­ety based on cor­pus re­sults

The ma­te­ri­als were pre­pared by Mir­jana Starović and Tan­ja Samardžić as part of the on­line work­shop held on 6 and 7 No­vem­ber 2021, or­gan­ised by the Uni­ver­si­ty of Zurich — URPP “Lan­guage and Space”, the CLARIN knowl­edge cen­tre for South Slav­ic lan­guages — CLASSLA and the ReLDI cen­tre. The pro­gramme also in­clud­ed a keynote talk by Yves Scher­rer from the Uni­ver­si­ty of Helsin­ki, Dar­ja Fišer’s pre­sen­ta­tion of op­por­tu­ni­ties for stu­dent pre­sen­ta­tions at the JTDH Lan­guage Tech­nolo­gies and Dig­i­tal Hu­man­i­ties Con­fer­ence, and an In­ter­ac­tive work­shop on re­gion­al vari­a­tion in text led by Sara Košu­tar, Laris­sa Schmidt and Ley­la Fein­er.

The work­shop saw the par­tic­i­pa­tion of around 30 stu­dents and col­leagues di­vid­ed be­tween Gath­er­Town and Zoom, with live­ly and fun in­ter­ac­tive ses­sions, and some sur­pris­ing find­ings. A fol­low-up men­tor­ing ses­sion for stu­dents took place on 16 De­cem­ber 2021.

For CLASSLA ac­counts of the work­shop, see here:


Work­shop on re­gion­al marked­ness in text