<w>

<w> (word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation]
Module analysis
Attributes
att.global
xml:id
n
xml:lang
att.global.rendition
style
rendition
att.global.linking
corresp
copyOf
next
prev
Member of
Contained by
May contain
analysis: pc w
figures: figure
gaiji: g
linking: linkGrp seg
character data
Examples This example is adapted from the Folger Library’s Early Modern English Drama version of The Wits: a Comedy by William Davenant.

<l>
   
<w lemma="it" pos="pn" xml:id="A19883-003-a-0100">IT</w>
   
<w lemma="have" pos="vvz" xml:id="A19883-003-a-0110">hath</w>
   
<w lemma="be" pos="vvn" xml:id="A19883-003-a-0120">been</w>
   
<w lemma="say" pos="vvn" xml:id="A19883-003-a-0130">said</w>
   
<w lemma="of" pos="acp-p" xml:id="A19883-003-a-0140">of</w>
   
<w lemma="old" pos="j" xml:id="A19883-003-a-0150">old</w>
   
<pc xml:id="A19883-003-a-0160">,</pc>
   
<w lemma="that" pos="cs" xml:id="A19883-003-a-0170">that</w>
   
<w lemma="play" pos="vvz" xml:id="A19883-003-a-0180">
      
<choice>
         
<orig>Playes</orig>
         
<reg>Plays</reg>
      
</choice>
   
</w>
   
<w lemma="be" pos="vvb" xml:id="A19883-003-a-0190">are</w>
   
<w lemma="feast" pos="n2" xml:id="A19883-003-a-0200">Feasts</w>
   
<pc xml:id="A19883-003-a-0210">,</pc>
</l>
<l xml:id="A19883-e100220">
   
<w lemma="poet" pos="n2" xml:id="A19883-003-a-0220">Poets</w>
   
<w lemma="the" pos="d" xml:id="A19883-003-a-0230">the</w>
   
<w lemma="cook" pos="n2" xml:id="A19883-003-a-0240">
      
<choice>
         
<orig>Cookes</orig>
         
<reg>Cooks</reg>
      
</choice>
   
</w>
   
<pc xml:id="A19883-003-a-0250">,</pc>
   
<w lemma="and" pos="cc" xml:id="A19883-003-a-0260">and</w>
   
<w lemma="the" pos="d" xml:id="A19883-003-a-0270">the</w>
   
<w lemma="spectator" pos="n2" xml:id="A19883-003-a-0280">Spectators</w>
   
<w lemma="guest" pos="n2" xml:id="A19883-003-a-0290">Guests</w>
   
<pc xml:id="A19883-003-a-0300">,</pc>
</l>
<l xml:id="A19883-e100230">
   
<w lemma="the" pos="d" xml:id="A19883-003-a-0310">The</w>
   
<w lemma="actor" pos="n2" xml:id="A19883-003-a-0320">Actors</w>
   
<w lemma="waiter" pos="n2" xml:id="A19883-003-a-0330">Waiters</w>
   
<pc xml:id="A19883-003-a-0340">:</pc>
   
<!-- ... -->
</l>
Schematron
Words should always contain a pc

<sch:rule context="tei:w[not(@part)]">
   
<sch:assert test="descendant::tei:pc[@force='weak']">ERROR: Word elements must include a pc since the element should only be used to flag words separated across line boundaries</sch:assert>
</sch:rule>
Words should contain either an lb or an add

<sch:rule context="tei:w[not(@part)]">
   
<sch:assert test="descendant::tei:lb or descendant::tei:add">ERROR: Word elements should usually include an lb since the element is used to connect words across lines. In special cases, an add can be used instead of an lb if the last part of the word has been appended to line.</sch:assert>
</sch:rule>
Words should always contain a pc

<sch:rule context="tei:w">
   
<sch:assert test="not(descendant::tei:fw or descendant::tei:pb)">ERROR: Do not include formeworks or page beginnings in a word element; if a word is split across pages, then use the @part attribute to signal the initial and final parts of the word.</sch:assert>
</sch:rule>
Words should only contain a single line-beginning

<sch:rule context="tei:w">
   
<sch:assert test="not(count(descendant::tei:lb) gt 1)">ERROR: Words should only go across a single line. This word has multiple line beginnings within it, which means it's probably more than one word.</sch:assert>
</sch:rule>
Words should always contain a pc

<sch:rule context="tei:w">
   
<sch:let name="text" value="string-join(descendant::text(),'')"/>
   
<sch:assert test="not(matches($text,'^\s|\s$'))">ERROR:<sch:name/>should begin or end with spaces.</sch:assert>
</sch:rule>
<sch:rule context="tei:w">
   
<sch:assert test="not(text()[matches(.,'[a-zA-Z]\s[a-zA-Z]')])">ERROR: w elements should only contain a single word.</sch:assert>
</sch:rule>
<sch:rule context="tei:text[descendant::tei:w[@part]]">
   
<sch:assert test="count(descendant::tei:w[@part]) mod 2 = 0">ERROR: Odd number of split words found in this transcription.</sch:assert>
</sch:rule>
<sch:rule context="tei:w[@part = 'F']">
   
<sch:assert test="preceding-sibling::tei:w[1][@part = 'I']">ERROR: Initial word part missing following final part.</sch:assert>
</sch:rule>
<sch:rule context="tei:w[@part = 'I']">
   
<sch:assert test="following-sibling::tei:w[1][@part = 'F']">ERROR: Final word part missing preceding initial part.</sch:assert>
</sch:rule>
Source Github

<elementSpec rend="change" module="analysis" ident="w">
   
<gloss>word</gloss>
   
<desc>represents a grammatical (not necessarily orthographic) word.</desc>
   
<classes>
      
<memberOf key="att.global"/>
      
<memberOf key="model.segLike"/>
      
<memberOf key="model.linePart"/>
   
</classes>
   
<content>
      
<alternate minOccurs="0" maxOccurs="unbounded">
         
<textNode/>
         
<classRef key="model.gLike"/>
         
<elementRef key="seg"/>
         
<elementRef key="w"/>
         
<elementRef key="m"/>
         
<elementRef key="c"/>
         
<elementRef key="pc"/>
         
<classRef key="model.global"/>
         
<classRef key="model.lPart"/>
         
<classRef key="model.hiLike"/>
         
<classRef key="model.pPart.edit"/>
      
</alternate>
   
</content>
</elementSpec>