ISO INTERNATIONAL STANDARD 24611 First edition 2012-11-01 Language resource management - Morpho-syntactic annotation framework (MAF) Gestion des ressources langagieres - Cadre d'annotation morphosyntaxique (MAF) Reference number ISO 24611:2012(E) @ISO 2012 by IHS under lic Not for Resale ISO 24611:2012(E) COPYRIGHTPROTECTEDDOCUMENT @ISO2012 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either isO at the address below or IsO's memberbody in the country of the requester. ISO copyright office Case postale 56. CH-1211 Geneva 20 Tel. + 4122749 01 11 Fax + 41 22 749 09 47 E-mail [email protected] Web www.iso.org Published in Switzerland @ ISO 2012 - All rights reserved py IHS unde permitted without license from IHS Not for Resale ISO 24611:2012(E) Contents Page Foreword Introduction. 1 Scope 2 Normative references.. 3 Terms and definitions. 4 The MAF meta-model. 4.1 Overview... 4.2 MAF Meta-model 5 Segmenting with tokens .. 5.1 .6 5.2 Formaldescription:<token> 5.3 Embedding notation.... 5.4 Alternate representation for TEl based documents .8 5.5 9 5.6 Informative attributes.. 5.7 Completing the inline token notation ... 10 5.7.1 5.7.2 Overlapping tokens .. 11 6 Word-forms as linguistic units... 11 6.1 Formal description: <wordForm> 12 6.2 Token attachment....... 12 6.2.1 One token; one word-form ... 6.2.2 Several contiguous tokens; one word-form ... 12 6.2.3 Several discontinuous tokens; one word-form.... 13 6.2.4 Zero token,; one word.form.... 13 6.2.5 One token; several word-forms ... 14 6.3 Referring to lexical entries ... 14 6.4 Compoundword-forms. 15 6.5 Identification of word.forms within a TEl.comp.liant document...... 7 Morpho-syntactic content...... 7.1 General.. ..18 7.2 Using feature structures. 18 7.3 Compact morpho-syntactic tags 18 7.4 FSRlibraries.. 7.5 Designing tagsets.. 20 7.6 Formal description: <tagset> 22 8 Handling ambiguities .. 22 8.1 Word-form content ambiguities. 8.2 LexicalAmbiguities... 23 8.3 Structural ambiguities... 23 8.3.1 Structural ambiguities with word-forms 23 8.3.2 Structural ambiguities with tokens..... 8.4 Simplified structuring variants .. .24 8.4.1 Non-ambiguous linear representation, 24 8.4.2 Mixed linear and lattice representation. .25 8.5 Expandingthesimplifiedvariants. 26 8.5.1 Separating tokens and word-forms. 26 8.5.2 Wrapping into local lattices... 26 Copyrght International OrganizaionStandardizalionghts reserved ili ted without license from IHS Not for Resale
ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo