ISO INTERNATIONAL STANDARD 24611 First edition 2012-11-01 Language resource management - Morpho-syntactic annotation framework (MAF) Gestion des ressources langagieres - Cadre d'annotation morphosyntaxique (MAF) Reference number ISO 24611:2012(E) @ISO 2012 by IHS under lic Not for Resale ISO 24611:2012(E) COPYRIGHTPROTECTEDDOCUMENT @ISO2012 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either isO at the address below or IsO's memberbody in the country of the requester. ISO copyright office Case postale 56. CH-1211 Geneva 20 Tel. + 4122749 01 11 Fax + 41 22 749 09 47 E-mail [email protected] Web www.iso.org Published in Switzerland @ ISO 2012 - All rights reserved py IHS unde permitted without license from IHS Not for Resale ISO 24611:2012(E) Contents Page Foreword Introduction. 1 Scope 2 Normative references.. 3 Terms and definitions. 4 The MAF meta-model. 4.1 Overview... 4.2 MAF Meta-model 5 Segmenting with tokens .. 5.1 .6 5.2 Formaldescription:<token> 5.3 Embedding notation.... 5.4 Alternate representation for TEl based documents .8 5.5 9 5.6 Informative attributes.. 5.7 Completing the inline token notation ... 10 5.7.1 5.7.2 Overlapping tokens .. 11 6 Word-forms as linguistic units... 11 6.1 Formal description: <wordForm> 12 6.2 Token attachment....... 12 6.2.1 One token; one word-form ... 6.2.2 Several contiguous tokens; one word-form ... 12 6.2.3 Several discontinuous tokens; one word-form.... 13 6.2.4 Zero token,; one word.form.... 13 6.2.5 One token; several word-forms ... 14 6.3 Referring to lexical entries ... 14 6.4 Compoundword-forms. 15 6.5 Identification of word.forms within a TEl.comp.liant document...... 7 Morpho-syntactic content...... 7.1 General.. ..18 7.2 Using feature structures. 18 7.3 Compact morpho-syntactic tags 18 7.4 FSRlibraries.. 7.5 Designing tagsets.. 20 7.6 Formal description: <tagset> 22 8 Handling ambiguities .. 22 8.1 Word-form content ambiguities. 8.2 LexicalAmbiguities... 23 8.3 Structural ambiguities... 23 8.3.1 Structural ambiguities with word-forms 23 8.3.2 Structural ambiguities with tokens..... 8.4 Simplified structuring variants .. .24 8.4.1 Non-ambiguous linear representation, 24 8.4.2 Mixed linear and lattice representation. .25 8.5 Expandingthesimplifiedvariants. 26 8.5.1 Separating tokens and word-forms. 26 8.5.2 Wrapping into local lattices... 26 Copyrght International OrganizaionStandardizalionghts reserved ili ted without license from IHS Not for Resale

.pdf文档 ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo

文档预览
中文文档 68 页 50 下载 1000 浏览 0 评论 309 收藏 3.0分
温馨提示:本文档共68页,可预览 3 页,如浏览全部内容或当前文档出现乱码,可开通会员下载原始文档
ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo 第 1 页 ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo 第 2 页 ISO 24611 2012 Language resource management — Morpho-syntactic annotation framewo 第 3 页
下载文档到电脑,方便使用
本文档由 人生无常 于 2024-08-31 13:18:52上传分享
友情链接
站内资源均来自网友分享或网络收集整理,若无意中侵犯到您的权利,敬请联系我们微信(点击查看客服),我们将及时删除相关资源。