<w>
<w> (word) represents a grammatical (not necessarily orthographic) word. [17.1. Linguistic Segment Categories 17.4.2. Lightweight Linguistic Annotation]
| Module | analysis |
| Attributes |
att.global
xml:id
n
xml:lang
att.global.rendition
style
rendition
att.global.linking
corresp
copyOf
next
prev
att.global.source
source
|
| Member of | |
| Contained by |
analysis: w
core: abbr add address author bibl corr date del editor emph expan foreign gloss head hi item l label measure mentioned name note num orig p pubPlace publisher q quote ref reg rs sic soCalled speaker term time title unclear
figures: cell
namesdates: addName affiliation birth death faith floruit forename genName nameLink nationality objectName occupation orgName persName placeName residence roleName settlement sex surname
textstructure: byline dateline docAuthor docDate docEdition docImprint imprimatur salute signed titlePart trailer
|
| May contain | |
| Examples | This example is adapted from the Folger Library’s Early Modern English Drama version
of The Wits: a Comedy by William Davenant.
<l> <w lemma="it" pos="pn" xml:id="A19883-003-a-0100">IT</w> <w lemma="have" pos="vvz" xml:id="A19883-003-a-0110">hath</w> <w lemma="be" pos="vvn" xml:id="A19883-003-a-0120">been</w> <w lemma="say" pos="vvn" xml:id="A19883-003-a-0130">said</w> <w lemma="of" pos="acp-p" xml:id="A19883-003-a-0140">of</w> <w lemma="old" pos="j" xml:id="A19883-003-a-0150">old</w> <pc xml:id="A19883-003-a-0160">,</pc> <w lemma="that" pos="cs" xml:id="A19883-003-a-0170">that</w> <w lemma="play" pos="vvz" xml:id="A19883-003-a-0180"> <choice> <orig>Playes</orig> <reg>Plays</reg> </choice> </w> <w lemma="be" pos="vvb" xml:id="A19883-003-a-0190">are</w> <w lemma="feast" pos="n2" xml:id="A19883-003-a-0200">Feasts</w> <pc xml:id="A19883-003-a-0210">,</pc> </l> <l xml:id="A19883-e100220"> <w lemma="poet" pos="n2" xml:id="A19883-003-a-0220">Poets</w> <w lemma="the" pos="d" xml:id="A19883-003-a-0230">the</w> <w lemma="cook" pos="n2" xml:id="A19883-003-a-0240"> <choice> <orig>Cookes</orig> <reg>Cooks</reg> </choice> </w> <pc xml:id="A19883-003-a-0250">,</pc> <w lemma="and" pos="cc" xml:id="A19883-003-a-0260">and</w> <w lemma="the" pos="d" xml:id="A19883-003-a-0270">the</w> <w lemma="spectator" pos="n2" xml:id="A19883-003-a-0280">Spectators</w> <w lemma="guest" pos="n2" xml:id="A19883-003-a-0290">Guests</w> <pc xml:id="A19883-003-a-0300">,</pc> </l> <l xml:id="A19883-e100230"> <w lemma="the" pos="d" xml:id="A19883-003-a-0310">The</w> <w lemma="actor" pos="n2" xml:id="A19883-003-a-0320">Actors</w> <w lemma="waiter" pos="n2" xml:id="A19883-003-a-0330">Waiters</w> <pc xml:id="A19883-003-a-0340">:</pc> <!-- ... --> </l> |
| Schematron |
Words should always contain a pc
<sch:rule context="tei:w[not(@part)]"> <sch:assert test="descendant::tei:pc[@force='weak']">ERROR: Word elements must include a pc since the element should only be used to flag words separated across line boundaries</sch:assert> </sch:rule> Words should contain either an lb or an add
<sch:rule context="tei:w[not(@part)]"> <sch:assert test="descendant::tei:lb or descendant::tei:add">ERROR: Word elements should usually include an lb since the element is used to connect words across lines. In special cases, an add can be used instead of an lb if the last part of the word has been appended to line.</sch:assert> </sch:rule> Words should always contain a pc
<sch:rule context="tei:w"> <sch:assert test="not(descendant::tei:fw or descendant::tei:pb)">ERROR: Do not include formeworks or page beginnings in a word element; if a word is split across pages, then use the @part attribute to signal the initial and final parts of the word.</sch:assert> </sch:rule> Words should only contain a single line-beginning
<sch:rule context="tei:w"> <sch:assert test="not(count(descendant::tei:lb) gt 1)">ERROR: Words should only go across a single line. This word has multiple line beginnings within it, which means it's probably more than one word.</sch:assert> </sch:rule> Words should always contain a pc
<sch:rule context="tei:w"> <sch:let name="text" value="string-join(descendant::text(),'')"/> <sch:assert test="not(matches($text,'^\s|\s$'))">ERROR:<sch:name/>should begin or end with spaces.</sch:assert> </sch:rule> <sch:rule context="tei:w"> <sch:assert test="not(text()[matches(.,'[a-zA-Z]\s[a-zA-Z]')])">ERROR: w elements should only contain a single word.</sch:assert> </sch:rule> <sch:rule context="tei:text[descendant::tei:w[@part]]"> <sch:assert test="count(descendant::tei:w[@part]) mod 2 = 0">ERROR: Odd number of split words found in this transcription.</sch:assert> </sch:rule> <sch:rule context="tei:w[@part = 'F']"> <sch:assert test="preceding-sibling::tei:w[1][@part = 'I']">ERROR: Initial word part missing following final part.</sch:assert> </sch:rule> <sch:rule context="tei:w[@part = 'I']"> <sch:assert test="following-sibling::tei:w[1][@part = 'F']">ERROR: Final word part missing preceding initial part.</sch:assert> </sch:rule> |
| Source | Github
<elementSpec rend="change" module="analysis" ident="w"> <gloss>word</gloss> <desc>represents a grammatical (not necessarily orthographic) word.</desc> <classes> <memberOf key="att.global"/> <memberOf key="model.segLike"/> <memberOf key="model.linePart"/> </classes> <content> <alternate minOccurs="0" maxOccurs="unbounded"> <textNode/> <classRef key="model.gLike"/> <elementRef key="seg"/> <elementRef key="w"/> <elementRef key="m"/> <elementRef key="c"/> <elementRef key="pc"/> <classRef key="model.global"/> <classRef key="model.lPart"/> <classRef key="model.hiLike"/> <classRef key="model.pPart.edit"/> </alternate> </content> </elementSpec> |