<w>

<w> (word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation]

Module	analysis
Attributes	att.global xml:id n xml:lang lim.att.global.fragmentable part att.global.rendition style rendition att.global.linking corresp copyOf next prev att.global.source source
Member of	model.linePart model.segLike
Contained by	analysis: w core: abbr add address author bibl corr date del editor emph expan foreign gloss head hi item l label measure mentioned name note num orig p pubPlace publisher q quote ref reg rs sic soCalled speaker term time title unclear figures: cell header: change extent handNote lim: compliment duration enclosure request linking: ab seg msdescription: origin summary surrogates namesdates: addName affiliation birth death faith floruit forename genName nameLink nationality objectName occupation orgName persName placeName residence roleName settlement sex surname textstructure: byline dateline docAuthor docDate docEdition docImprint imprimatur salute signed titlePart trailer transcr: damage fw supplied surplus zone
May contain	analysis: pc w core: abbr add choice del ellipsis gap hi lb milestone note noteGrp orig pb q reg sic unclear figures: figure gaiji: g lim: noteMarker rule linking: linkGrp seg transcr: am damage fw handShift metamark subst supplied surplus character data
Examples	This example is adapted from the Folger Library’s Early Modern English Drama version of The Wits: a Comedy by William Davenant. <l> <w lemma="it" pos="pn" xml:id="A19883-003-a-0100">IT</w> <w lemma="have" pos="vvz" xml:id="A19883-003-a-0110">hath</w> <w lemma="be" pos="vvn" xml:id="A19883-003-a-0120">been</w> <w lemma="say" pos="vvn" xml:id="A19883-003-a-0130">said</w> <w lemma="of" pos="acp-p" xml:id="A19883-003-a-0140">of</w> <w lemma="old" pos="j" xml:id="A19883-003-a-0150">old</w> <pc xml:id="A19883-003-a-0160">,</pc> <w lemma="that" pos="cs" xml:id="A19883-003-a-0170">that</w> <w lemma="play" pos="vvz" xml:id="A19883-003-a-0180"> <choice> <orig>Playes</orig> <reg>Plays</reg> </choice> </w> <w lemma="be" pos="vvb" xml:id="A19883-003-a-0190">are</w> <w lemma="feast" pos="n2" xml:id="A19883-003-a-0200">Feasts</w> <pc xml:id="A19883-003-a-0210">,</pc> </l> <l xml:id="A19883-e100220"> <w lemma="poet" pos="n2" xml:id="A19883-003-a-0220">Poets</w> <w lemma="the" pos="d" xml:id="A19883-003-a-0230">the</w> <w lemma="cook" pos="n2" xml:id="A19883-003-a-0240"> <choice> <orig>Cookes</orig> <reg>Cooks</reg> </choice> </w> <pc xml:id="A19883-003-a-0250">,</pc> <w lemma="and" pos="cc" xml:id="A19883-003-a-0260">and</w> <w lemma="the" pos="d" xml:id="A19883-003-a-0270">the</w> <w lemma="spectator" pos="n2" xml:id="A19883-003-a-0280">Spectators</w> <w lemma="guest" pos="n2" xml:id="A19883-003-a-0290">Guests</w> <pc xml:id="A19883-003-a-0300">,</pc> </l> <l xml:id="A19883-e100230"> <w lemma="the" pos="d" xml:id="A19883-003-a-0310">The</w> <w lemma="actor" pos="n2" xml:id="A19883-003-a-0320">Actors</w> <w lemma="waiter" pos="n2" xml:id="A19883-003-a-0330">Waiters</w> <pc xml:id="A19883-003-a-0340">:</pc> <!-- ... --> </l>
Schematron	Words should always contain a pc <sch:rule context="tei:w[not(@part)]"> <sch:assert test="descendant::tei:pc[@force='weak']">ERROR: Word elements must include a pc since the element should only be used to flag words separated across line boundaries</sch:assert> </sch:rule> Words should contain either an lb or an add <sch:rule context="tei:w[not(@part)]"> <sch:assert test="descendant::tei:lb or descendant::tei:add">ERROR: Word elements should usually include an lb since the element is used to connect words across lines. In special cases, an add can be used instead of an lb if the last part of the word has been appended to line.</sch:assert> </sch:rule> Words should always contain a pc <sch:rule context="tei:w"> <sch:assert test="not(descendant::tei:fw or descendant::tei:pb)">ERROR: Do not include formeworks or page beginnings in a word element; if a word is split across pages, then use the @part attribute to signal the initial and final parts of the word.</sch:assert> </sch:rule> Words should only contain a single line-beginning <sch:rule context="tei:w"> <sch:assert test="not(count(descendant::tei:lb) gt 1)">ERROR: Words should only go across a single line. This word has multiple line beginnings within it, which means it's probably more than one word.</sch:assert> </sch:rule> Words should always contain a pc <sch:rule context="tei:w"> <sch:let name="text" value="string-join(descendant::text(),'')"/> <sch:assert test="not(matches($text,'^\s\|\s$'))">ERROR:<sch:name/>should begin or end with spaces.</sch:assert> </sch:rule> <sch:rule context="tei:w"> <sch:assert test="not(text()[matches(.,'[a-zA-Z]\s[a-zA-Z]')])">ERROR: w elements should only contain a single word.</sch:assert> </sch:rule> <sch:rule context="tei:text[descendant::tei:w[@part]]"> <sch:assert test="count(descendant::tei:w[@part]) mod 2 = 0">ERROR: Odd number of split words found in this transcription.</sch:assert> </sch:rule> <sch:rule context="tei:w[@part = 'F']"> <sch:assert test="preceding-sibling::tei:w[1][@part = 'I']">ERROR: Initial word part missing following final part.</sch:assert> </sch:rule> <sch:rule context="tei:w[@part = 'I']"> <sch:assert test="following-sibling::tei:w[1][@part = 'F']">ERROR: Final word part missing preceding initial part.</sch:assert> </sch:rule>
Source	Github <elementSpec rend="change" module="analysis" ident="w"> <gloss>word</gloss> <desc>represents a grammatical (not necessarily orthographic) word.</desc> <classes> <memberOf key="att.global"/> <memberOf key="model.segLike"/> <memberOf key="model.linePart"/> </classes> <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <elementRef key="seg"/> <elementRef key="w"/> <elementRef key="m"/> <elementRef key="c"/> <elementRef key="pc"/> <classRef key="model.global"/> <classRef key="model.lPart"/> <classRef key="model.hiLike"/> <classRef key="model.pPart.edit"/> </alternate> </content> </elementSpec>