default search action
6th LREC 2008: Marrakech, Morocco
- Proceedings of the International Conference on Language Resources and Evaluation, LREC 2008, 26 May - 1 June 2008, Marrakech, Morocco. European Language Resources Association 2008
Session O1 - Information Extraction and Question Answering
- Kathrin Eichler, Holmer Hemsen, Günter Neumann:
Unsupervised Relation Extraction From Web Documents. - Muath Alzghool, Diana Inkpen:
Combining Multiple Models for Speech Information Retrieval. - Chun-Yuan Teng, Hsin-Hsi Chen:
Event Detection and Summarization in Weblogs with Temporal Collocations. - Cvetana Krstev, Ranka Stankovic, Dusko Vitas, Ivan Obradovic:
The Usage of Various Lexical Resources and Tools to Improve the Performance of Web Search Engines.
Session O2 - LRs: Infrastructures, Projects, Centers
- Steven Bird, Robert Dale, Bonnie J. Dorr, Bryan R. Gibson, Mark Thomas Joseph, Min-Yen Kan, Dongwon Lee, Brett Powley, Dragomir R. Radev, Yee Fan Tan:
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics. - Marian Reed, Denise DiPersio, Christopher Cieri:
The Linguistic Data Consortium Member Survey: Purpose, Execution and Results. - Dieter Van Uytvanck, Alex Dukers, Jacquelijn Ringersma, Paul Trilsbeek:
Language-Sites: Accessing and Presenting Language Resources via Geographic Information Systems. - Tamás Váradi, Steven Krauwer, Peter Wittenburg, Martin Wynne, Kimmo Koskenniemi:
CLARIN: Common Language Resources and Technology Infrastructure.
Session O3 - Corpus, Lexicon and Evaluation
- Jeroen Geertzen, Volha Petukhova, Harry Bunt:
Evaluating Dialogue Act Tagging with Naive and Expert Annotators. - Drahomíra "johanka" Spoustová, Pavel Pecina, Jan Hajic, Miroslav Spousta:
Validating the Quality of Full Morphological Annotation. - Kremena Ivanova, Ulrich Heid, Sabine Schulte im Walde, Adam Kilgarriff, Jan Pomikálek:
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case. - Mark McConville, Myroslava O. Dzikovska:
Evaluating Complement-Modifier Distinctions in a Semantically Annotated Corpus.
Session O4 - Multiparty and non-Verbal Communication
- Petra-Maria Strauß, Holger Hoffmann, Wolfgang Minker, Heiko Neumann, Günther Palm, Stefan Scherer, Harald C. Traue, Ulrich Weidenbacher:
The PIT Corpus of German Multi-Party Dialogues. - Martine Adda-Decker, Claude Barras, Gilles Adda, Patrick Paroubek, Philippe Boula de Mareüil, Benoit Habert:
Annotation and analysis of overlapping speech in political interviews. - Nicolas Moreau, Djamel Mostefa, Rainer Stiefelhagen, Susanne Burger, Khalid Choukri:
Data Collection for the CHIL CLEAR 2007 Evaluation Campaign. - Susanne Burger, Kornel Laskowski, Matthias Wölfel:
A Comparative Cross-Domain Study of the Occurrence of Laughter in Meeting and Seminar Corpora.
Session O5 - Spatio-Temporal Annotation
- Inderjeet Mani, Janet Hitzeman, Justin Richer, Dave Harris, Rob Quimby, Ben Wellner:
SpatialML: Annotation Scheme, Corpora, and Tools. - Steven Bethard, William J. Corvey, Sara Klingenstein, James H. Martin:
Building a Corpus of Temporal-Causal Structure. - Alessandra Zarcone, Alessandro Lenci:
Computational Models for Event Type Classification in Context. - Corina Forascu:
GMT to +2 or how can TimeML be used in Romanian. - Nianwen Xue, Hua Zhong, Kai-Yun Chen:
Annotating "tense" in a Tense-less Language.
Session O6 - Syntax and Parsing
- Barbara Plank, Khalil Sima'an:
Subdomain Sensitive Statistical Parsing using Raw Corpora. - Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert:
Developing a TT-MCTAG for German with an RCG-based Parser. - Peter Adolphs, Stephan Oepen, Ulrich Callmeier, Berthold Crysmann, Dan Flickinger, Bernd Kiefer:
Some Fine Points of Hybrid Natural Language Parsing. - Jeremy Nicholson, Valia Kordoni, Yi Zhang, Timothy Baldwin, Rebecca Dridan:
Evaluating and Extending the Coverage of HPSG Grammars: A Case Study for German. - Yi Zhang, Valia Kordoni:
Robust Parsing with a Large HPSG Grammar.
Session O7 - Document Classification
- Jahna Otterbacher, Dragomir R. Radev:
Modeling Document Dynamics: an Evolutionary Approach. - Dominic Widdows, Kathleen Ferraro:
Semantic Vectors: a Scalable Open Source Package and Online Technology Management Application. - Magnus Rosell, Sumithra Velupillai:
Revealing Relations between Open and Closed Answers in Questionnaires through Text Clustering Evaluation. - Kim Luyckx, Walter Daelemans:
Personae: a Corpus for Author and Personality Prediction from Text. - Leanne Spracklin, Diana Inkpen, Amiya Nayak:
Using the Complexity of the Distribution of Lexical Elements as a Feature in Authorship Attribution.
Session O8 - Multimodal Annotation Tools
- Thomas Schmidt, Susan Duncan, Oliver Ehmer, Jeffrey Hoyt, Michael Kipp, Dan Loehr, Magnus Magnusson, R. Travis Rose, Han Sloetjes:
An Exchange Format for Multimodal Annotations. - Laura Stoia, Darla Magdalena Shockley, Donna K. Byron, Eric Fosler-Lussier:
SCARE: a Situated Corpus with Annotated Referring Expressions. - Han Sloetjes, Peter Wittenburg:
Annotation by Category: ELAN and ISO DCR. - Hennie Brugman, Véronique Malaisé, Laura Hollink:
A Common Multimedia Annotation Framework for Cross Linking Cultural Heritage Digital Collections. - Philippe Blache, Roxane Bertrand, Gaëlle Ferré:
Creating and Exploiting Multimodal Annotated Corpora.
Session O9 - Lexicon, Corpus and Semantics
- Annie Zaenen, Daniel G. Bobrow, Cleo Condoravdi:
The Encoding of lexical implications in VerbNet Predicates of change of locations. - Aljoscha Burchardt, Marco Pennacchiotti:
FATE: a FrameNet-Annotated Corpus for Textual Entailment. - Stephen A. Boxwell, Michael White:
Projecting Propbank Roles onto the CCGbank. - Piek Vossen, Isa Maks, Roxane Segers, Hennie VanderVliet:
Integrating Lexical Units, Synsets and Ontology in the Cornetto Database. - Javier Álvez, Jordi Atserias, Jordi Carrera, Salvador Climent, Egoitz Laparra, Antoni Oliver, German Rigau:
Complete and Consistent Annotation of WordNet using the Top Concept Ontology.
Session O10 - Multimodal and Speech Data over the Web
- Adrian Popescu, Gregory Grefenstette:
A Conceptual Approach to Web Image Retrieval. - Gwénolé Lecorvé, Guillaume Gravier, Pascale Sébillot:
On the Use of Web Resources and Natural Language Processing Techniques to Improve Automatic Speech Recognition Systems. - Stanislas Oger, Georges Linarès, Frédéric Béchet:
Local Methods for On-Demand Out-of-Vocabulary Word Retrieval. - Marc Kemps-Snijders, Alexander Klassmann, Claus Zinn, Peter Berck, Albert Russel, Peter Wittenburg:
Exploring and Enriching a Language Resource Archive via the Web. - Florian Schiel, Hannes Mögele:
Talking and Looking: the SmartWeb Multimodal Interaction Corpus.
Session O11 - Coreference and Discourse
- Erhard W. Hinrichs, Monica Lau:
In Contrast - A Complex Discourse Connective. - Georg Rehm, Marina Santini, Alexander Mehler, Pavel Braslavski, Rüdiger Gleim, Andrea Stubbe, Svetlana Symonenko, Mirko Tavosanis, Vedrana Vidulin:
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems. - Olga Uryupina:
Error Analysis for Learning-based Coreference Resolution. - Lucie Mladová, Sárka Zikánová, Eva Hajicová:
From Sentence to Discourse: Building an Annotation Scheme for Discourse Based on Prague Dependency Treebank. - David Day, Janet Hitzeman, Michael L. Wick, Keith Crouch, Massimo Poesio:
A Corpus for Cross-Document Co-reference.
Session O12 - Named Entity Recognition
- Antonio Toral, Rafael Muñoz, Monica Monachini:
Named Entity WordNet. - Cristina Mota, Ralph Grishman:
Is this NE tagger getting old? - Benjamin Farber, Dayne Freitag, Nizar Habash, Owen Rambow:
Improving NER in Arabic Using a Morphological Tagger. - Stephan Busemann, Yajing Zhang:
Identifying Foreign Person Names in Chinese Text. - Marius Pasca:
Low-Complexity Heuristics for Deriving Fine-Grained Classes of Named Entities from Web Textual Data.
Session O13 - Parallel and Multilingual Resources
- Jinji Li, Dong-Il Kim, Jong-Hyeok Lee:
Annotation Guidelines for Chinese-Korean Word Alignment. - Ondrej Bojar, Miroslav Janícek, Zdenek Zabokrtský, Pavel Ceska, Peter Bena:
CzEng 0.7: Parallel Corpus with Community-Supplied Translations. - Jonathan Clark, Robert E. Frederking, Lori S. Levin:
Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation. - Michael Mohler, Rada Mihalcea:
Babylon Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages.
Session O14 - Evaluation Tools and Methodologies
- Cong-Phap Huynh, Christian Boitet, Hervé Blanchon:
SECTra_w.1: an Online Collaborative System for Evaluating, Post-editing and Presenting MT Translation Corpora. - Mark Arehart, Chris Wolf, Keith J. Miller:
Adjudicator Agreement and System Rankings for Person Name Search. - Paulo C. F. de Oliveira, Edson Wilson Torrens, Alexandre Cidral, Sidney Schossland, Evandro Bittencourt:
Evaluating Summaries Automatically - A system Proposal. - Thierry Poibeau, Cédric Messiant:
Do we Still Need Gold Standards for Evaluation?
Session O15 - LRs: Large Programs, Policies, Strategies
- Peter Spyns, Elisabeth D'Halleweyn, Catia Cucchiarini:
The Dutch-Flemish Comprehensive Approach to HLT Stimulation and Innovation: STEVIN, HLT Agency and beyond. - Christopher Cieri, Mark Liberman:
15 Years of Language Resource Creation and Sharing: a Progress Report on LDC Activities. - Anil Kumar Singh, Kiran Pala, Harshit Surana:
Estimating the Resource Adaption Cost from a Resource Rich Language to a Similar Resource Poor Language. - Valérie Mapelli, Victoria Arranz, Hélène Mazo, Khalid Choukri:
Latest Developments in ELRA's Services. - Carol Peters, Martin Braschler, Giorgio Maria Di Nunzio, Nicola Ferro, Julio Gonzalo, Mark Sanderson:
From Research to Application in Multilingual Information Access: the Contribution of Evaluation.
Session O16 - Biomedical Resources
- Scott Piao, John McNaught, Sophia Ananiadou:
Clustering Related Terms with Definitions. - Ngan L. T. Nguyen, Jin-Dong Kim, Jun'ichi Tsujii:
Challenges in Pronoun Resolution System for Biomedical Text. - Barry Haddow, Beatrice Alex:
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks. - Yuka Tateisi, Yusuke Miyao, Kenji Sagae, Jun'ichi Tsujii:
GENIA-GR: a Grammatical Relation Corpus for Parser Evaluation in the Biomedical Domain. - Xinglong Wang, Claire Grover:
Learning the Species of Biomedical Named Entities from Annotated Corpora.
Session O17 - Semantics in Lexicons and Corpora
- Tony Veale, Yanfen Hao:
Acquiring Naturalistic Concept Descriptions from the Web. - Ulrich Heid, Marion Weller:
Tools for Collocation Extraction: Preferences for Active vs. Passive. - Francis Bond, Hitoshi Isahara, Kyoko Kanzaki, Kiyotaka Uchimoto:
Boot-Strapping a WordNet Using Multiple Existing WordNets. - Bartosz Broda, Magdalena Derwojedowa, Maciej Piasecki, Stan Szpakowicz:
Corpus-based Semantic Relatedness for the Construction of Polish WordNet. - Rafiya Begum, Samar Husain, Lakshmi Bai, Dipti Misra Sharma:
Developing Verb Frames for Hindi.
Session O18 - Affect and Emotion in Speech
- Katherine Forbes-Riley, Diane J. Litman, Scott Silliman, Amruta Purandare:
Uncertainty Corpus: Resource to Study User Affect in Complex Spoken Dialogue Systems. - Milan Gnjatovic, Dietmar F. Rösner:
On the Role of the NIMITEK Corpus in Developing an Emotion Adaptive Spoken Dialogue System. - Stefan Scherer, Hansjörg Hofmann, Malte Lampmann, Martin Pfeil, Steffen Rhinow, Friedhelm Schwenker, Günther Palm:
Emotion Recognition from Speech: Stress Experiment. - Laure Charonnat, Gaëlle Vidal, Olivier Boëffard:
Automatic Phone Segmentation of Expressive Speech. - Márk Fék, Nicolas Audibert, János Szabó, Albert Rilliard, Géza Németh, Véronique Aubergé:
Multimodal Spontaneous Expressive Speech Corpus for Hungarian.
Session O19 - Opinion Mining and Summarization
- Wei-Hao Lin, Alexander G. Hauptmann:
Vox Populi Annotation: Measuring Intensity of Ideological Perspectives by Aggregating Group Judgments. - Carmen Banea, Rada Mihalcea, Janyce Wiebe:
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources. - Josef Ruppenhofer, Swapna Somasundaran, Janyce Wiebe:
Finding the Sources and Targets of Subjective Expressions. - Veselin Stoyanov, Claire Cardie:
Annotating Topics of Opinions. - Zhuli Xie, Barbara Di Eugenio, Peter C. Nelson:
From Extracting to Abstracting: Generating Quasi-abstractive Summaries.
Session O20 - Coreference and Discourse
- Jette Viethen, Robert Dale, Emiel Krahmer, Mariët Theune, Pascal Touset:
Controlling Redundancy in Referring Expressions. - Massimo Poesio, Ron Artstein:
Anaphoric Annotation in the ARRAU Corpus. - Mark-Christoph Müller, Margot Mieskes, Michael Strube:
Knowledge Sources for Bridging Resolution in Multi-Party Dialog. - Rashmi Prasad, Nikhil Dinesh, Alan Lee, Eleni Miltsakaki, Livio Robaldo, Aravind K. Joshi, Bonnie L. Webber:
The Penn Discourse TreeBank 2.0. - Iris Hendrickx, Gosse Bouma, Frederik Coppens, Walter Daelemans, Véronique Hoste, Geert Kloosterman, Anne-Marie Mineur, Joeri Van Der Vloet, Jean-Luc Verschelde:
A Coreference Corpus and Resolution System for Dutch.
Session O21 - Semantic Resources and Acquisition
- Kirk Baker, Chris Brew:
Statistical Identification of English Loanwords in Korean Using Automatically Generated Training Data. - Diana Trandabat, Maria Husarciuc:
Romanian Semantic Role Resource. - Alessandro Lenci, Barbara McGillivray, Simonetta Montemagni, Vito Pirrelli:
Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora. - Daisuke Kawahara, Kiyotaka Uchimoto:
A Method for Automatically Constructing Case Frames for English. - Núria Bel, Sergio Espeja, Montserrat Marimon:
Automatic Acquisition for low frequency lexical items.
Session O22 - Speaker and Dialect Identification
- Doroteo T. Toledano, Daniel Hernández López, Cristina Esteve-Elizalde, Julian Fiérrez, Javier Ortega-Garcia, Daniel Ramos, Joaquin Gonzalez-Rodriguez:
BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition. - Iker Luengo, Eva Navas, Iñaki Sainz, Ibon Saratxaga, Jon Sánchez, Igor Odriozola, Inma Hernáez:
Text Independent Speaker Identification in Multilingual Environments. - Udhyakumar Nallasamy, Alan W. Black, Tanja Schultz, Robert E. Frederking:
NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls. - Christopher Cieri, Stephanie M. Strassel, Meghan Lammie Glenn, Reva Schwartz, Wade Shen, Joseph P. Campbell:
Bridging the Gap between Linguists and Technology Developers: Large-Scale, Sociolinguistic Annotation for Dialect and Speaker Recognition. - Linda Brandschain, Christopher Cieri, David Graff, Abby Neely, Kevin Walker:
Speaker Recognition: Building the Mixer 4 and 5 Corpora.
Session O23 - Corpus Annotation and Classification
- Nancy Ide, Collin F. Baker, Christiane Fellbaum, Charles J. Fillmore, Rebecca J. Passonneau:
MASC: the Manually Annotated Sub-Corpus of American English. - Chu-Ren Huang, Lung-Hao Lee, Jia-Fei Hong, Weiguang Qu, Shiwen Yu:
Quality Assurance of Automatic Annotation of Very Large Corpora: a Study based on heterogeneous Tagging System. - Claire Cardie, Cynthia Farina, Matt Rawding, Adil Aijaz:
An eRulemaking Corpus: Identifying Substantive Issues in Public Comments. - Branimir Boguraev, Mary S. Neff:
Navigating through Dense Annotation Spaces. - David Guthrie, Louise Guthrie, Yorick Wilks:
An Unsupervised Probabilistic Approach for the Detection of Outliers in Corpora.
Session O24 - Machine Translation and Multilinguality
- Michael Carl:
Using Log-linear Models for Tuning Machine Translation Output. - Bogdan Babych, Serge Sharoff, Anthony Hartley:
Generalising Lexical Translation Strategies for MT Using Comparable Corpora. - Masaki Itagaki, Takako Aikawa:
Post-MT Term Swapper: Supplementing a Statistical Machine Translation System with a User Dictionary. - Germán Sanchis-Trilles, Joan-Andreu Sánchez:
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars. - Mark Fishel, Heiki-Jaan Kaalep:
Experiments on Processing Overlapping Parallel Corpora.
Session O25 - Evaluation
- Jennifer Foster, Josef van Genabith:
Parser Evaluation and the BNC: Evaluating 4 constituency parsers with 3 metrics. - Patrick Paroubek, Isabelle Robba, Anne Vilnat, Christelle Ayache:
EASY, Evaluation of Parsers of French: what are the Results? - Xavier Tannier, Philippe Muller:
Evaluation Metrics for Automatic Temporal Annotation of Texts. - Lena Grothe, Ernesto William De Luca, Andreas Nürnberger:
A Comparative Study on Language Identification Methods. - Éric Villemonte de la Clergerie, Olivier Hamon, Djamel Mostefa, Christelle Ayache, Patrick Paroubek, Anne Vilnat:
PASSAGE: from French Parser Evaluation to Large Sized Treebank.
Session O26 - Broadcast News Processing
- Jáchym Kolár, Jan Svec:
Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations. - Markpong Jongtaveesataporn, Chai Wutiwiwatchai, Koji Iwano, Sadaoki Furui:
Thai Broadcast News Corpus Construction and Evaluation. - Ingunn Amdal, Ole Morten Strand, Jørn Almberg, Torbjørn Svendsen:
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus. - Sopheap Seng, Sethserey Sam, Laurent Besacier, Brigitte Bigi, Eric Castelli:
First Broadcast News Transcription System for Khmer Language. - Chomicha Bendahman, Meghan Lammie Glenn, Djamel Mostefa, Niklas Paulsson, Stephanie M. Strassel:
Quick Rich Transcriptions of Arabic Broadcast News Speech Data.
Session O27 - Ontologies
- Dennis Spohr:
A General Methodology for Mapping EuroWordNets to the Suggested Upper Merged Ontology. - Satoshi Sekine:
Extended Named Entity Ontology with Attribute Information. - Mari Carmen Suárez-Figueroa, Asunción Gómez-Pérez:
Towards a Glossary of Activities in the Ontology Engineering Field. - Yi-Rong Chen, Qin Lu, Wenjie Li, Gaoying Cui:
Chinese Core Ontology Construction from a Bilingual Term Bank. - Michael Kluck, Axel Huckstorf:
The European Thesaurus on International Relations and Area Studies - a Multilingual Resource for Indexing, Retrieval, and Translation.
Session O28 - Machine Translation and Multilinguality
- Takashi Tsunakawa, Naoaki Okazaki, Jun'ichi Tsujii:
Building Bilingual Lexicons using Lexical Translation Probabilities via Pivot Languages. - Yu Chen, Andreas Eisele, Martin Kay:
Improving Statistical Machine Translation Efficiency by Triangulation. - Caroline Lavecchia, David Langlois, Kamel Smaïli:
Phrase-Based Machine Translation based on Simulated Annealing. - Marine Carpuat, Dekai Wu:
Evaluation of Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation. - Sasa Hasan, Hermann Ney:
A Multi-Genre SMT System for Arabic to French.
Session O29 - Information Extraction and Question Answering
- Estelle Delpech, Patrick Saint-Dizier:
Investigating the Structure of Procedural Texts for Answering How-to Questions. - Igor Leturia, Antton Gurrutxaga, Nerea Areta, Eli Pociello:
Analysis and Performance of Morphological Query Expansion and Language-Filtering Words on Basque Web Searching. - Kirk Roberts, Andrew Hickl:
Scaling Answer Type Detection to Large Hierarchies. - Majid Razmara, Leila Kosseim:
Answering List Questions using Co-occurrence and Clustering. - Torsten Zesch, Christof Müller, Iryna Gurevych:
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary.
Session O30 - Evaluation in Speech Processing
- Gregory A. Sanders, Sebastien Bronsart, Sherri L. Condon, Craig Schlenoff:
Odds of Successful Transfer of Low-Level Concepts: a Key Metric for Bidirectional Speech-to-Speech Machine Translation in DARPA's TRANSTAC Program. - Lori Lamel, Sophie Rosset, Christelle Ayache, Djamel Mostefa, Jordi Turmo, Pere Comas:
Question Answering on Speech Transcriptions: the QAST evaluation in CLEF. - Willemijn Heeren, Franciska de Jong, Laurens van der Werff, Marijn Huijbregts, Roeland Ordelman:
Evaluation of Spoken Document Retrieval for Historic Speech Collections. - Sherri L. Condon, Jon Phillips, Christy Doran, John S. Aberdeen, Dan Parvaz, Beatrice T. Oshika, Gregory A. Sanders, Craig Schlenoff:
Applying Automated Metrics to Speech Translation Dialogs. - Margot Mieskes, Michael Strube:
A Three-stage Disfluency Classifier for Multi Party Dialogues.
Session O31 - Evaluation and Machine Translation
- Jesús Giménez, Lluís Màrquez:
Towards Heterogeneous Automatic MT Error Analysis. - Bogdan Babych, Anthony Hartley:
Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods. - Mark A. Przybocki, Kay Peterson, Sebastien Bronsart:
Translation Adequacy and Preference Evaluation Tool (TAP-ET). - Constantin Orasan, Oana Andreea Chiorean:
Evaluation of a Cross-lingual Romanian-English Multi-document Summariser.
Session O32 - Syntactically Annotated Corpora
- Øistein E. Andersen, Julien Nioche, Ted Briscoe, John Carroll:
The BNC Parsed with RASP4UIMA. - Kiyotaka Uchimoto, Yasuharu Den:
Word-level Dependency-structure Annotation to Corpus of Spontaneous Japanese and its Application. - Tejaswini Deoskar, Mats Rooth:
Induction of Treebank-Aligned Lexical Resources. - Olga Pustylnikov, Alexander Mehler, Rüdiger Gleim:
A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data.
Session O33 - Terminology
- Aïcha Bouhjar:
Amazigh Language Terminology in Morocco or Management of a "Multidimensional" Variation. - Yuhang Yang, Qin Lu, Tiejun Zhao:
Chinese Term Extraction Based on Delimiters. - Siham Boulaknadel, Béatrice Daille, Driss Aboutajdine:
A Multi-Word Term Extraction Program for Arabic Language. - Jonathan Butters, Fabio Ciravegna:
Using Similarity Metrics For Terminology Recognition.
Session O34 - Emotions
- Marco Guerini, Carlo Strapparava, Oliviero Stock:
Resources for Persuasion. - Guillaume Pitel, Gregory Grefenstette:
Semi-automatic Building Method for a Multidimensional Affect Dictionary for a New Language. - Laurence Devillers, Jean-Claude Martin:
Coding Emotional Events in Audiovisual Corpora. - Andrea Esuli, Fabrizio Sebastiani, Ilaria Urciuoli:
Annotating Expressions of Opinion and Emotion in the Italian Content Annotation Bank. - Isa Maks, Piek Vossen, Roxane Segers, Hennie van der Vliet:
Adjectives in the Dutch Semantic Lexical Database CORNETTO.
Session 035 - Semantics and Semantic Annotation
- Markus Dickinson, Chong Min Lee:
Detecting Errors in Semantic Annotation. - Michael Roth, Sabine Schulte im Walde:
Corpus Co-Occurrence, Dictionary and Wikipedia Entries as Resources for Semantic Relatedness Information. - Emiliano Giovannetti, Simone Marchi, Simonetta Montemagni, Roberto Bartolini:
Ontology Learning and Semantic Annotation: a Necessary Symbiosis. - Jordi Atserias, Hugo Zaragoza, Massimiliano Ciaramita, Giuseppe Attardi:
Semantically Annotated Snapshot of the English Wikipedia. - Rodney D. Nielsen, Wayne H. Ward, James H. Martin, Martha Palmer:
Annotating Students' Understanding of Science Concepts.
Session O36 - Evaluation Methodologies
- Rebecca J. Passonneau, Tom Lippincott, Tae Yano, Judith L. Klavans:
Relation between Agreement Measures on Human Labeling and Machine Learning Performance: Results from an Art History Domain. - Yves Peirsman, Simon De Deyne, Kris Heylen, Dirk Geeraerts:
The Construction and Evaluation of Word Space Models. - Olga Babko-Malaya:
Annotation of Nuggets and Relevance in GALE Distillation Evaluation. - James V. White, Daniel Hunter, Jacob D. Goldstein:
Statistical Evaluation of Information Distillation Systems. - Verena Rieser, Oliver Lemon:
Automatic Learning and Evaluation of User-Centered Objective Functions for Dialogue System Optimisation.
Session O37 - Lexicons, Corpora and Acquisition
- Viktor Bielický, Otakar Smrz:
Building the Valency Lexicon of Arabic Verbs. - Angus Roberts, Robert J. Gaizauskas, Mark Hepple, Yikun Guo:
Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation. - Rogelio Nazar, Jorge Vivaldi, M. Teresa Cabré:
A Suite to Compile and Analyze an LSP Corpus. - Eduardo Blanco, Núria Castell, Dan I. Moldovan:
Causal Relation Extraction. - Grzegorz Chrupala, Georgiana Dinu, Josef van Genabith:
Learning Morphology with Morfette.
Session O38 - Ontologies
- Gaoying Cui, Qin Lu, Wenjie Li, Yi-Rong Chen:
Corpus Exploitation from Wikipedia for Ontology Construction. - Shiyan Ou, Viktor Pekar, Constantin Orasan, Christian Spurk, Matteo Negri:
Development and Alignment of a Domain-Specific Ontology for Question Answering. - David Manzano-Macho, Asunción Gómez-Pérez, Daniel Borrajo:
Unsupervised and Domain Independent Ontology Learning: Combining Heterogeneous Sources of Evidence. - Alessandra Potrich, Emanuele Pianta:
L-ISA: Learning Domain Specific Isa-Relations from the Web. - Arno Hartholt, Thomas A. Russ, David R. Traum, Eduard H. Hovy, Susan Robinson:
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture.
Session O39 - Multilingual Resources
- Eneko Agirre, Aitor Soroa:
Using the Multilingual Central Repository for Graph-Based Word Sense Disambiguation. - Fredric C. Gey, David Kirk Evans, Noriko Kando:
A Japanese-English Technical Lexicon for Translation and Language Research. - Le An Ha, Gabriela Fernandez, Ruslan Mitkov, Gloria Corpas Pastor:
Mutual Bilingual Terminology Extraction. - João Graça, Joana Paulo Pardal, Luísa Coheur, Diamantino Caseiro:
Building a Golden Collection of Parallel Multi-Language Word Alignment. - Elena Cabrio, Milen Kouylekov, Bernardo Magnini, Matteo Negri, Laura Hasler, Constantin Orasan, David Tomás, José Luis Vicedo González, Guenter Neumann, Corinna Weber:
The QALL-ME Benchmark: a Multilingual Resource of Annotated Spoken Requests for Question Answering.
Session O40 - Tools for Corpus Construction and Annotation
- Nick Campbell:
Tools & Resources for Visualising Conversational-Speech Interaction. - Maria Teresa Pazienza, Marco Pennacchiotti, Armando Stellato:
A Web Browser Extension for Growing-up Ontological Knowledge from Traditional Web Content. - Youssef Drissi, Branimir Boguraev, David A. Ferrucci, Paul T. Keyser, Anthony Levas:
A Development Environment for Configurable Meta-Annotators in a Pipelined NLP Architecture. - Georg Rehm, Richard Eckart, Christian Chiarcos, Johannes Dellert:
Ontology-Based XQuery'ing of XML-Encoded Language Resources on Multiple Annotation Layers. - Stefan Evert:
A Lightweight and Efficient Tool for Cleaning Web Pages.
Session O41 - Speech Varieties
- Lynette Melnar, Chen Liu:
Borrowing Language Resources for Development of Automatic Speech Recognition for Low- and Middle-Density Languages. - Sebastian Möller, Florian Gödde, Maria K. Wolters:
Corpus Analysis of Spoken Smart-Home Interactions with Older Users. - Kallirroi Georgila, Maria Klara Wolters, Vasilis Karaiskos, Melissa Kronenthal, Robert H. Logie, Neil Mayo, Johanna D. Moore, Matthew Watson:
A Fully Annotated Corpus for Studying the Effect of Cognitive Ageing on Users' Interactions with Spoken Dialogue Systems. - Catia Cucchiarini, Joris Driesen, Hugo Van hamme, Eric Sanders:
Recording Speech of Children, Non-Natives and Elderly People for HLT Applications: the JASMIN-CGN Corpus. - Christoph Draxler, Florian Schiel, Tania Ellbogen:
F0 of Adolescent Speakers - First Results for the German Ph@ttSessionz Database.
Session O42 - Multimodal Dialogue
- Yorick Wilks, David Benyon, Christopher Brewster, Pavel Ircing, Oli H. Mival:
Dialogue, Speech and Images: the Companions Project Data Set. - Jade Goldstein-Stewart, Kerri A. Goodwin, Roberta Evans Sabin, Ransom K. Winder:
Creating and Using a Correlated Corpus to Glean Communicative Commonalities. - Roberta Catizone, Alexiei Dingli, Hugo Pinto, Yorick Wilks:
Information Extraction Tools and Methods for Understanding Dialogue in a Companion. - Carlos Gómez Gallo, T. Florian Jaeger, James F. Allen, Mary D. Swift:
Production in a Multimodal Corpus: how Speakers Communicate Complex Actions.
Session O43 - Semantic Resources
- Harry Bunt, Chwhynny Overbeeke:
Towards Formal Interpretation of Semantic Annotation. - Marco Pennacchiotti, Diego De Cao, Paolo Marocco, Roberto Basili:
Towards a Vector Space Model for FrameNet-like Resources. - Pavel Smrz:
KnoFusius: a New Knowledge Fusion System for Interpretation of Gene Expression Data. - Kris Heylen, Yves Peirsman, Dirk Geeraerts, Dirk Speelman:
Modelling Word Similarity: an Evaluation of Automatic Synonymy Extraction Algorithms.
Session O44 - Corpora and Evaluation Resources
- Leen Cleuren, Jacques Duchateau, Pol Ghesquière, Hugo Van hamme:
Children's Oral Reading Corpus (CHOREC): Description and Assessment of Annotator Agreement. - Tommaso Caselli, Nancy Ide, Roberto Bartolini:
A Bilingual Corpus of Inter-linked Events. - Stephanie M. Strassel, Lauren Friedman, Safa Ismael, Linda Brandschain:
New Resources for Document Classification, Analysis and Translation Technologies. - Katrin Tomanek, Udo Hahn:
Approximating Learning Curves for Active-Learning-Driven Annotation.
Session O45 - Lexicons
- Thorsten Trippel, Michael Maxwell, Greville Corbett, Cambell Prince, Christopher D. Manning, Stephen Grimes, Steven Moran:
Lexicon Schemas and Related Data Models: when Standards Meet Users. - Cédric Messiant, Thierry Poibeau, Anna Korhonen:
LexSchem: a Large Subcategorization Lexicon for French Verbs. - Horacio Rodríguez, David Farwell, Javi Ferreres, Manuel Bertrán, Musa Alkhalifa, Maria Antònia Martí:
Arabic WordNet: Semi-automatic Extensions using Bayesian Inference.
Session O46 - Evaluation Methodologies
- Iñaki Sainz, Ibon Saratxaga, Eva Navas, Inmaculada Hernáez, Jon Sánchez, Iker Luengo, Igor Odriozola:
Subjective Evaluation of an Emotional Speech Database for Basque. - Sandra Kübler, Wolfgang Maier, Ines Rehbein, Yannick Versley:
How to Compare Treebanks. - Romaric Besançon, Stéphane Chaudiron, Djamel Mostefa, Ismaïl Timimi, Khalid Choukri:
The INFILE Project: a Crosslingual Filtering Systems Evaluation Campaign.
Session O47 - Authoring Tools and Corpora
- Dan Tufis, Alexandru Ceausu:
DIAC+: a Professional Diacritics Recovering System. - Ghazi Abuhakema, Reem Faraj, Anna Feldman, Eileen Fitzpatrick:
Annotating an Arabic Learner Corpus for Error. - Martin Reynaert:
All, and only, the Errors: more Complete and Consistent Spelling and OCR-Error Correction Evaluation.
Session O48 - TV and Video Processing
- Einav Itamar, Alon Itai:
Using Movie Subtitles for Creating a Large-Scale Bilingual Corpora. - Rob van Son, Wieneke Wesseling, Eric Sanders, Henk van den Heuvel:
The IFADV Corpus: a Free Dialog Video Corpus. - Alessio Brutti, Luca Cristoforetti, Walter Kellermann, Lutz Marquardt, Maurizio Omologo:
WOZ Acoustic Data Collection for Interactive TV.
Session P1 - Corpus Construction and Annotation
- Mikko Lounela:
Process Model for Composing High-quality Text Corpora. - Mariona Taulé, Maria Antònia Martí, Marta Recasens:
AnCora: Multilevel Annotated Corpora for Catalan and Spanish. - Stephen Purpura, John Wilkerson, Dustin Hillard:
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998. - Jeremy Bensley, Andrew Hickl:
Unsupervised Resource Creation for Textual Inference Applications. - Markus Dickinson, Charles Jochim:
A Simple Method for Tagset Comparision. - Nelleke Oostdijk, Martin Reynaert, Paola Monachesi, Gertjan van Noord, Roeland Ordelman, Ineke Schuurman, Vincent Vandeghinste:
From D-Coi to SoNaR: a reference corpus for Dutch. - Hiromi Itoh Ozaku, Akinori Abe, Kaoru Sagara, Kiyoshi Kogure:
Relationships between Nursing Converstaions and Activities. - Meghan Lammie Glenn, Stephanie M. Strassel, Lauren Friedman, Haejoong Lee, Shawn Medero:
Management of Large Annotation Projects Involving Multiple Human Judges: a Case Study of GALE Machine Translation Post-editing. - Harald Hammarström, Christina Thornell, Malin Petzell, Torbjörn Westerlund:
Bootstrapping Language Description: the case of Mpiemo (Bantu A, Central African Republic). - Satoshi Sato, Suguru Matsuyoshi, Yohsuke Kondoh:
Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus.
Session P2 - LRs for Specific Domains: Bio-Medicine and Chemistry
- Paul Thompson, Philip Cotter, John McNaught, Sophia Ananiadou, Simonetta Montemagni, Andrea Trabucco, Giulia Venturi:
Building a Bio-Event Annotated Corpus for the Acquisition of Semantic Frames from Biomedical Corpora. - C. J. Rupp, Ann A. Copestake, Peter T. Corbett, Peter Murray-Rust, Advaith Siddharthan, Simone Teufel, Benjamin Waldron:
Language Resources and Chemical Informatics. - Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Michael Poprat, Katrin Tomanek, Joachim Wermter:
Semantic Annotations for Biology: a Corpus Development Initiative at the Jena University Language & Information Engineering (JULIE) Lab. - Valeria Quochi, Monica Monachini, Riccardo Del Gratta, Nicoletta Calzolari:
A lexicon for biology and bioinformatics: the BOOTStrep experience. - Fabio Rinaldi, Gerold Schneider, Kaarel Kaljurand, Michael Hess:
Dependency-Based Relation Mining for Biomedical Literature. - Dimitrios Kokkinakis:
MeSH(c): from a Controlled Vocabulary to a Processable Resource. - Dimitrios Kokkinakis:
A Semantically Annotated Swedish Medical Corpus. - Mehdi Embarek, Olivier Ferret:
Learning Patterns for Building Resources about Semantic Relations in the Medical Domain.
Session P3 - Syntactically Annotated Resources and Related Tools
- Dino Ienco, Serena Villata, Cristina Bosco:
Automatic extraction of subcategorization frames for Italian. - Jerid Francom, Mans Hulden:
Parallel Multi-Theory Annotations of Syntactic Structure. - Meni Adler, Yael Dahan Netzer, Yoav Goldberg, David Gabay, Michael Elhadad:
Tagging a Hebrew Corpus: the Case of Participles. - Joy Deep Nath, Monojit Choudhury, Animesh Mukherjee, Christian Biemann, Niloy Ganguly:
Unsupervised Parts-of-Speech Induction for Bengali. - Guadalupe Aguado de Cea, Javier Puche, José Ángel Ramos Gargantilla:
Tagging Spanish Texts: the Problem of Problem of "SE". - Jirí Mírovský:
Does Netgraph Fit Prague Dependency Treebank? - Tomas By:
The Kalshnikov 691 Dependency Bank. - Natalie Schluter, Josef van Genabith:
Treebank-Based Acquisition of LFG Parsing Resources for French. - Svetla Koeva, Borislav Rizov, Svetlozara Leseva:
Chooser: a Multi-Task Annotation Tool. - Pavlina Fragkou, Georgios Petasis, Aris Theodorakos, Vangelis Karkaletsis, Constantine D. Spyropoulos:
BOEMIE Ontology-Based Text Annotation Tool. - Ralf Krestel, Sabine Bergler, René Witte:
Minding the Source: Automatic Tagging of Reported Speech in Newspaper Articles.
Session P4 - Named Entity Recognition, Information Extraction and Document Classification
- Piek Vossen, Eneko Agirre, Nicoletta Calzolari, Christiane Fellbaum, Shu-Kai Hsieh, Chu-Ren Huang, Hitoshi Isahara, Kyoko Kanzaki, Andrea Marchetti, Monica Monachini, Federico Neri, Remo Raffaelli, German Rigau, Maurizio Tesconi, Joop VanGent:
KYOTO: a System for Mining, Structuring and Distributing Knowledge across Languages and Cultures. - Ulrich Schäfer, Hans Uszkoreit, Christian Federmann, Torsten Marek, Yajing Zhang:
Extracting and Querying Relations in Scientific Papers on Language Technology. - Adrian Iftene, Alexandra Balahur-Dobrescu:
Named Entity Relation Mining using Wikipedia. - Claire Grover, Sharon Givon, Richard Tobin, Julian Ball:
Named Entity Recognition for Digitised Historical Texts. - Zhiyi Song, Stephanie M. Strassel:
Entity Translation and Alignment in the ACE-07 ET Task. - Yoji Kiyota, Noriyuki Tamura, Satoshi Sakai, Hiroshi Nakagawa, Hidetaka Masuda:
Automated Subject Induction from Query Keywords through Wikipedia Categories and Subject Headings. - Linus Sellberg, Arne Jönsson:
Using Random Indexing to improve Singular Value Decomposition for Latent Semantic Analysis.
Session P5 - Multi-Word Expressions
- Spela Vintar, Darja Fiser:
Harvesting Multi-Word Expressions from Parallel Corpora. - Andrea Agili, Marco Fabbri, Alessandro Panunzi, Manuel Zini:
Integration of a Multilingual Keyword Extractor in a Document Management System. - Daiga Deksne, Raivis Skadins, Inguna Skadina:
Dictionary of Multiword Expressions for Translation into highly Inflected Languages. - Grazyna Vetulani, Zygmunt Vetulani, Tomasz Obrêbski:
Verb-Noun Collocation SyntLex Dictionary: Corpus-Based Approach. - Weiruo Qu, Christoph Ringlstetter, Randy Goebel:
Targeting Chinese Nominal Compounds in Corpora. - Margarita Alonso Ramos, Owen Rambow, Leo Wanner:
Using Semantically Annotated Corpora to Build Collocation Resources.
Session P6 - Ontologies and Knowledge
- Katia Kermanidis, Aristomenis Thanopoulos, Manolis Maragoudakis, Nikos Fakotakis:
Eksairesis: A Domain-Adaptable System for Ontology Building from Unstructured Text. - Francisco José Álvarez Montero, Antonio Ramón Vaquero Sánchez, Fernando Sáenz-Pérez:
Conceptual Modeling of Ontology-based Linguistic Resources with a Focus on Semantic Relations. - Paul Buitelaar, Thomas Eigner:
Ontology Search with the OntoSelect Ontology Library. - Cássia Trojahn dos Santos, Paulo Quaresma, Renata Vieira:
A Framework for Multilingual Ontology Mapping. - Laura Kassner, Vivi Nastase, Michael Strube:
Acquiring a Taxonomy from the German Wikipedia. - Davide Picca, Alfio Massimiliano Gliozzo, Aldo Gangemi:
LMM: an OWL-DL MetaModel to Represent Heterogeneous Lexical Knowledge. - Hitoshi Isahara, Francis Bond, Kiyotaka Uchimoto, Masao Utiyama, Kyoko Kanzaki:
Development of the Japanese WordNet. - Neil Newbold, Bogdan Vrusias, Lee Gillam:
Lexical Ontology Extraction using Terminology Analysis: Automating Video Annotation. - Mukda Suktarachan, Dussadee Thamvijit, Sachit Rajbhandari, Daoyos Noikongka, Puwarat Pavaputanont Na Mahasarakham, Panita Yongyuth, Asanee Kawtrakul, Margherita Sini:
Workbench with Authoring Tools for Collaborative Multi-lingual Ontological Knowledge Construction and Maintenance. - Mehrnoush Shamsfard:
Towards Semi Automatic Construction of a Lexical Ontology for Persian. - Gerard de Melo, Gerhard Weikum:
Mapping Roget's Thesaurus and WordNet to French. - Christophe Jouis, Julien Bourdaillet:
Representation of Atypical Entities in Ontologies. - Siaw-Fong Chung, Laurent Prévot, Mingwei Xu, Kathleen Ahrens, Shu-Kai Hsieh, Chu-Ren Huang:
Extracting Concrete Senses of Lexicon through Measurement of Conceptual Similarity in Ontologies. - Jun Okamoto, Kiyoko Uchiyama, Shun Ishizaki:
A Contextual Dynamic Network Model for WSD Using Associative Concept Dictionary. - Berenike Loos, Lasse Schwarten:
A Semantic Memory for Incremental Ontology Population.
Session P7 - Term Identification/Extraction and Terminological Databases
- Jorge Vivaldi, Anna Joan, Mercè Lorente:
Turning a Term Extractor into a new Domain: first Experiences. - Peter G. Anick, Vijay Murthi, Shaji Sebastian:
Similar Term Discovery using Web Search. - Junko Kubo, Keita Tsuji, Shigeo Sugimoto:
Temporal Aspects of Terminology for Automatic Term Recognition: Case Study on Women's Studies Terms. - Ziqi Zhang, José Iria, Christopher Brewster, Fabio Ciravegna:
A Comparative Evaluation of Term Recognition Algorithms. - Véronique Hoste, Els Lefever, Klaar Vanopstal, Isabelle Delaere:
Learning-based Detection of Scientific Terms in Patient Information. - Eli Pociello, Antton Gurrutxaga, Eneko Agirre, Izaskun Aldezabal, German Rigau:
WNTERM: Enriching the MCR with a Terminological Dictionary. - Rita Marinelli, Melissa Tiberi, Remo Bindi:
Encoding Terms from a Scientific Domain in a Terminological Database: Methodology and Criteria.
Session P8 - Information Extraction, Question Answering and Document Classification
- Thomas Mandl, Fredric C. Gey, Giorgio Maria Di Nunzio, Nicola Ferro, Mark Sanderson, Diana Santos, Christa Womser-Hacker:
An Evaluation Resource for Geographic Information Retrieval. - Jorge Civera, Alfons Juan-Císcar:
Bilingual Text Classification using the IBM 1 Translation Model. - Hiroyuki Shinnou, Minoru Sasaki:
Ping-pong Document Clustering using NMF and Linkage-Based Refinement. - Hiroyuki Shinnou, Minoru Sasaki:
Spectral Clustering for a Large Data Set by Reducing the Similarity Matrix Size. - Danica Damljanovic, Valentin Tablan, Kalina Bontcheva:
A Text-based Query Interface to OWL Ontologies. - Han Ren, Dong-Hong Ji, Lei Han:
A Research on Automatic Chinese Catchword Extraction. - Isaac G. Councill, C. Lee Giles, Min-Yen Kan:
ParsCit: an Open-source CRF Reference String Parsing Package. - Shunsuke Kozawa, Hitomi Tohyama, Kiyotaka Uchimoto, Shigeki Matsubara:
Automatic Acquisition of Usage Information for Language Resources. - Michael Wiegand, Jochen L. Leidner, Dietrich Klakow:
Cost-Sensitive Learning in Answer Extraction. - Lukasz Degórski, Michal Marcinczuk, Adam Przepiórkowski:
Definition Extraction Using a Sequential Combination of Baseline Grammars and Machine Learning Classifiers. - Francesca Fallucchi, Fabio Massimo Zanzotto:
Yet another Platform for Extracting Knowledge from Corpora. - Milena Yankova, Horacio Saggion, Hamish Cunningham:
A Framework for Identity Resolution and Merging for Multi-source Information Extraction. - Jussi Karlgren, Hercules Dalianis, Bart Jongejan:
Experiments to Investigate the Connection between Case Distribution and Topical Relevance of Search Terms in an Information Retrieval Setting. - Fidelia Ibekwe-Sanjuan, Chaomei Chen, Roberto Pinho:
Identifying Strategic Information from Scientific Articles through Sentence Classification. - Susana Azeredo, Silvia Moraes, Vera Lúcia Strube de Lima:
Keywords, k-NN and Neural Networks: a Support for Hierarchical Categorization of Texts in Brazilian Portuguese. - Hossam Ibrahim, Kareem Darwish, Abdel-Rahim Madany:
Automatic Extraction of Textual Elements from News Web Pages. - Eiko Yamamoto, Hitoshi Isahara, Akira Terada, Yasunori Abe:
Extraction of Informative Expressions from Domain-specific Documents. - Rune Sætre, Brian Kemper, Kanae Oda, Naoaki Okazaki, Yukiko Matsuoka, Norihiro Kikuchi, Hiroaki Kitano, Yoshimasa Tsuruoka, Sophia Ananiadou, Jun'ichi Tsujii:
Connecting Text Mining and Pathways using the PathText Resource. - Jan Pomikálek, Pavel Rychlý:
Detecting Co-Derivative Documents in Large Text Collections. - Lothar Lemnitzer, Paola Monachesi:
Extraction and Evaluation of Keywords from Learning Objects: a Multilingual Approach. - Peng Zhang, Wenjie Li, Furu Wei, Qin Lu, Yuexian Hou:
Exploiting the Role of Position Feature in Chinese Relation Extraction. - Ben Allison, Louise Guthrie:
Authorship Attribution of E-Mail: Comparing Classifiers over a New Corpus for Evaluation. - Michael Kaisser, John Lowe:
Creating a Research Collection of Question Answer Sentence Pairs with Amazon's Mechanical Turk. - Feiyu Xu, Hans Uszkoreit, Hong Li, Niko Felger:
Adaptation of Relation Extraction Rules to New Domains. - Asuka Sumida, Naoki Yoshinaga, Kentaro Torisawa:
Boosting Precision and Recall of Hyponymy Relation Acquisition from Hierarchical Layouts in Wikipedia. - Margot Mieskes, Michael Strube:
Parameters for Topic Boundary Detection in Multi-Party Dialogues. - Eugenio Picchi, Eva Sassolini, Sebastiana Cucurullo, Francesca Bertagna, Paola Baroni:
Semantic Press. - Lei Xia, José Iria:
An Approach to Modeling Heterogeneous Resources for Information Extraction. - Anca Dinu:
On Classifying Coherent/Incoherent Romanian Short Texts. - Lorraine Goeuriot, Natalia Grabar, Béatrice Daille:
Characterization of Scientific and Popular Science Discourse in French, Japanese and Russian. - Jalal Maleki, Lars Ahrenberg:
Converting Romanized Persian to the Arabic Writing Systems. - Nasser Abouzakhar, Ben Allison, Louise Guthrie:
Unsupervised Learning-based Anomalous Arabic Text Detection. - Prokopis Prokopidis, Vassia Karra, Aggeliki Papagianopoulou, Stelios Piperidis:
Condensing Sentences for Subtitle Generation. - Simon Mille, Leo Wanner:
Making Text Resources Accessible to the Reader: the Case of Patent Claims.
Session P9 - Authoring Tools and Related Resources
- Jack Halpern:
Exploiting Lexical Resources for Disambiguating CJK and Arabic Orthographic Variants. - Neil Newbold, Lee Gillam:
Automatic Document Quality Control. - Thepchai Supnithi, Suchinder Singh, Taneth Ruangrajitpakorn, Prachya Boonkwan, Monthika Boriboon:
OpenCCG Workbench and Visualization Tool. - Matthieu Hermet, Alain Désilets, Stan Szpakowicz:
Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors. - Davide Fossati, Barbara Di Eugenio:
I saw TREE trees in the park: How to Correct Real-Word Spelling Mistakes. - Martí Quixal, Toni Badia, Francesc Benavent, José Roberto de Freitas Boullosa, Judith Domingo, Bernat Grau, Guillem Massó, Oriol Valentín:
User-Centred Design of Error Correction Tools. - Wei Liu, Ben Allison, Louise Guthrie:
Professor or Screaming Beast? Detecting Anomalous Words in Chinese. - Iñaki Alegria, Klara Ceberio, Nerea Ezeiza, Aitor Soroa, Gregorio Hernández:
Spelling Correction: from Two-Level Morphology to Open Source. - Catalina Hallett, David Hardcastle:
Automatic Rewriting of Patient Record Narratives.
Session P10 - Coreference and Discourse
- Yannick Versley, Simone Paolo Ponzetto, Massimo Poesio, Vladimir Eidelman, Alan Jern, Jason Smith, Xiaofeng Yang, Alessandro Moschitti:
BART: A modular toolkit for coreference resolution. - Massimo Poesio, Udo Kruschwitz, Jon Chamberlain:
ANAWIKI: Creating Anaphorically Annotated Resources through Web Cooperation. - Daniela Goecke, Maik Stührenberg, Andreas Witt:
Influence of Text Type and Text Length on Anaphoric Annotation. - Sandra Williams, Richard Power:
Deriving Rhetorical Complexity Data from the RST-DT Corpus. - Márton Miháltz:
Knowledge-based Coreference Resolution for Hungarian. - Malvina Nissim, Sara Perboni:
The Italian Particle "ne": Corpus Construction and Analysis.
Session P11 - Tools, Systems, Applications
- Dawn Knight, Paul Tennent:
Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis. - Michaela Atterer, Hinrich Schütze:
An Inverted Index for Storing and Retrieving Grammatical Dependencies. - Jens Nilsson, Joakim Nivre:
MaltEval: an Evaluation and Visualization Tool for Dependency Parsing. - Hiroaki Sato:
New Functions of FrameSQL for Multilingual FrameNets. - Hiroyuki Shinnou, Minoru Sasaki:
Division of Example Sentences Based on the Meaning of a Target Word Using Semi-Supervised Clustering. - Hiroaki Saito, Shunta Kuboya, Takaaki Sone, Hayato Tagami, Kyoko Ohara:
The Japanese FrameNet Software Tools. - Maria Teresa Pazienza, Armando Stellato, Alexandra Tudorache:
JMWNL: an Extensible Multilingual Library for Accessing Wordnets in Different Languages. - Diana Maynard:
Benchmarking Textual Annotation Tools for the Semantic Web. - Liviu Petrisor Dinu, Marius Popescu, Anca Dinu:
Authorship Identification of Romanian Texts with Controversial Paternity.
Session P12 - Lexical Resources and Tools
- Marc Kemps-Snijders, Claus Zinn, Jacquelijn Ringersma, Menzo Windhouwer:
Ensuring Semantic Interoperability on Lexical Resources. - Marc Finthammer, Irene M. Cramer:
Exploring and Navigating: Tools for GermaNet. - Marianne Santaholma, Nikos Chatzichrisafis:
A Knowledge-Modeling Approach for Multilingual Regulus Lexica. - Michael Rosner:
ODL: an Object Description Language for Lexical Information. - Dan Cristea, Corina Forascu, Marius Raschip, Michael Zock:
How to Evaluate and Raise the Quality in a Collaborative Lexicographic Approach. - Bolette Sandford Pedersen, Anna Braasch, Lina Henriksen, Sussi Olsen, Claus Povlsen:
Merging a Syntactic Resource with a WordNet: a Feasibility Study of a Merge between STO and DanNet. - Borislav Rizov:
Hydra: a Modal Logic Tool for Wordnet Development, Validation and Exploration.
Session P13 - Evaluation: Resources, Tools, Methodologies, Campaigns
- Míriam Luján-Mares, Carlos D. Martínez-Hinarejos, Vicent Alabau Gonzalvo:
Evaluation of several Maximum Likelihood Linear Regression Variants for Language Adaptation. - Laurianne Sitbon, Patrice Bellot, Philippe Blache:
Evaluation of Lexical Resources and Semantic Networks on a Corpus of Mental Associations. - Heike Bieler, Stefanie Dipper:
Measures for Term and Sentence Relevances: an Evaluation for German. - Julia Ritz, Stefanie Dipper, Michael Götze:
Annotation of Information Structure: an Evaluation across different Types of Texts. - Quang Thang Dinh, Hong Phuong Le, Nguyên Thi Minh Huyên, Cam-Tu Nguyen, Mathias Rossignol, Xuân Luong Vu:
Word Segmentation of Vietnamese Texts: a Comparison of Approaches. - Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Giuseppe Attardi, Anna Corazza, Alberto Lavelli, Leonardo Lesmo, Giorgio Satta, Maria Simi:
Comparing Italian parsers on a common Treebank: the EVALITA experience. - Bernardo Magnini, Amedeo Cappelli, Fabio Tamburini, Cristina Bosco, Alessandro Mazzei, Vincenzo Lombardo, Francesca Bertagna, Nicoletta Calzolari, Antonio Toral, Valentina Bartalesi Lenzi, Rachele Sprugnoli, Manuela Speranza:
Evaluation of Natural Language Tools for Italian: EVALITA 2007. - Maria Teresa Pazienza, Armando Stellato, Alexandra Tudorache:
A Bottom-up Comparative Study of EuroWordNet and WordNet 3.0 Lexical and Semantic Relations. - Simon Scerri, Myriam Mencke, Brian Davis, Siegfried Handschuh:
Evaluating the Ontology underlying sMail - the Conceptual Framework for Semantic Email Communication. - Václav Novák, Keith B. Hall:
Inter-sentential Coreferences in Semantic Networks: An Evaluation of Manual Annotation. - Mohamed Maamouri, Seth Kulick, Ann Bies:
Diacritic Annotation in the Arabic Treebank and its Impact on Parser Evaluation. - Chantal Enguehard, Harouna Naroua:
Evaluation of Virtual Keyboards for West-African Languages. - Constantin Orasan, Dan Cristea, Ruslan Mitkov, António Horta Branco:
Anaphora Resolution Exercise: an Overview. - Diana Santos, Alberto Simões:
Portuguese-English Word Alignment: some Experiments. - Karin Schuler, Vinod Kaggal, James J. Masanz, Philip V. Ogren, Guergana K. Savova:
System Evaluation on a Named Entity Corpus from Clinical Notes. - Philip V. Ogren, Guergana K. Savova, Christopher G. Chute:
Constructing Evaluation Corpora for Automated Clinical Named Entity Recognition. - Eric K. Ringger, Marc Carmen, Robbie Haertel, Kevin D. Seppi, Deryle Lonsdale, Peter McClanahan, James L. Carroll, Noel Ellison:
Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study. - Alexandre Allauzen, Hélène Bonneau-Maynard:
Training and Evaluation of POS Taggers on the French MULTITAG Corpus. - Marco Baroni, Francis Chantree, Adam Kilgarriff, Serge Sharoff:
Cleaneval: a Competition for Cleaning Web Pages. - Mark Arehart, Keith J. Miller:
A Ground Truth Dataset for Matching Culturally Diverse Romanized Person Names. - Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Takehito Utsuro:
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop. - Marilisa Amoia, Claire Gardent:
A Test Suite for Inference Involving Adjectives. - Takanobu Nishiura, Masato Nakayama, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -. - Olivier Hamon, Djamel Mostefa:
An Experimental Methodology for an End-to-End Evaluation in Speech-to-Speech Translation.
Session P14 - Evaluation: Resources, Tools, Systems, Methodologies
- Carlos D. Martínez-Hinarejos, Vicent Tamarit:
Evaluation of Different Segmentation Techniques for Dialogue Turns. - David Griol, Lluís F. Hurtado, Encarna Segarra, Emilio Sanchis:
Acquisition and Evaluation of a Dialog Corpus through WOz and Dialog Simulation Techniques. - Susan Robinson, David R. Traum, Midhun Ittycheriah, Joe Henderer:
What would you Ask a conversational Agent? Observations of Human-Agent Dialogues in a Museum Setting. - Dave Toney, Sophie Rosset, Aurélien Max, Olivier Galibert, Éric Bilinski:
An Evaluation of Spoken and Textual Interaction in the RITEL Interactive Question Answering System. - Muriel Amar, Sophie David, Rachel Panckhurst, Lisa Whistlecroft:
Classification Procedures for Software Evaluation. - Sylwia Ozdowska:
Cross-Corpus Evaluation of Word Alignment. - Diana Maynard, Wim Peters, Yaoyong Li:
Evaluating Evaluation Metrics for Ontology-Based Applications: Infinite Reflection. - Diana McCarthy:
Lexical Substitution as a Framework for Multiword Evaluation. - Martin Emms:
Tree Distance and Some Other Variants of Evalb. - A. Cüneyd Tantug, Kemal Oflazer, Ilknur Durgar El-Kahlout:
BLEU+: a Tool for Fine-Grained BLEU Computation. - C. Ray Graham, Deryle Lonsdale, Casey Kennington, Aaron Johnson, Jeremiah McGhee:
Elicited Imitation as an Oral Proficiency Measure with ASR Scoring. - Pedro Concejero Cerezo, Daniel Tapias Merino, Juan José Rodríguez Soler, Juan Carlos Luengo, Sebastián Sánchez:
Methodology for Evaluating the Usability of User Interfaces in Mobile Services. - Edouard Geoffrois:
An Economic View on Human Language Technology Evaluation. - Beatrice Alex:
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection. - Laura Hasler:
Centering Theory for Evaluation of Coherence in Computer-Aided Summaries. - Stephanie M. Strassel, Mark A. Przybocki, Kay Peterson, Zhiyi Song, Kazuaki Maeda:
Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction. - Johan Bos:
Let's not Argue about Semantics. - David Hardcastle, Donia Scott:
Can we Evaluate the Quality of Generated Text? - Keith J. Miller, Mark Arehart, Catherine Ball, John Polk, Alan Rubenstein, Kenneth Samuel, Elizabeth Schroeder, Eva Vecchi, Chris Wolf:
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems. - Laurianne Sitbon, Patrice Bellot, Philippe Blache:
Evaluating Robustness Of A QA System Through A Corpus Of Real-Life Questions. - Ann Devitt, Khurshid Ahmad:
Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation. - Cyril Grouin:
Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the "Grammatical" Quality of a Corpus. - Laurent Blin, Olivier Boëffard, Vincent Barreaud:
WEB-Based Listening Test System for Speech Synthesis and Speech Conversion Evaluation. - Renata Queiroz Dividino, Massimo Romanelli, Daniel Sonntag:
Semiotic-based Ontology Evaluation Tool (S-OntoEval). - George Demetriou, Robert J. Gaizauskas, Haotian Sun, Angus Roberts:
ANNALIST - ANNotation ALIgnment and Scoring Tool. - Andrei Popescu-Belis, Mike Flynn, Pierre Wellner, Philippe Baudrion:
Task-Based Evaluation of Meeting Browsers: from Task Elicitation to User Behavior Analysis. - Paula Estrella, Andrei Popescu-Belis, Maghi King:
Improving Contextual Quality Models for MT Evaluation Based on Evaluators' Feedback. - Brian A. Weiss, Craig Schlenoff, Gregory A. Sanders, Michelle Potts Steves, Sherri L. Condon, Jon Phillips, Dan Parvaz:
Performance Evaluation of Speech Translation Systems. - Arne Mauser, Sasa Hasan, Hermann Ney:
Automatic Evaluation Measures for Statistical Machine Translation System Optimization.
Session P15 - LR Infrastructures and Architectures
- Dan Tufis, Radu Ion, Alexandru Ceausu, Dan Stefanescu:
RACAI's Linguistic Web Services. - Hanno Biber, Evelyn Breiteneder, Karlheinz Mörth:
Words in Contexts: Digital Editions of Literary Journals in the "AAC - Austrian Academy Corpus". - Chris Biemann, Uwe Quasthoff, Gerhard Heyer, Florian Holz:
ASV Toolbox: a Modular Collection of Language Exploration Tools. - António Branco, Francisco Costa, Pedro Martins, Filipe Nunes, João Ricardo Silva, Sara Silveira:
LX-Service: Web Services of Language Technology for Portuguese. - Emanuele Pianta, Christian Girardi, Roberto Zanoli:
The TextPro Tool Suite. - Bayan Abu Shawar, Eric Atwell:
An AI-inspired intelligent agent/student architecture to combine Language Resources research and teaching. - Kjell Elenius, Eva Forsbom, Beáta Megyesi:
Language Resources and Tools for Swedish: A Survey. - Lars Nygaard, Joel Priestley, Anders Nøklestad, Janne Bondi Johannessen:
Glossa: a Multilingual, Multimodal, Configurable User Interface. - Ekaterina Buyko, Christian Chiarcos, Antonio Pareja-Lora:
Ontology-Based Interface Specifications for a NLP Pipeline Architecture. - Daan Broeder, Thierry Declerck, Erhard W. Hinrichs, Stelios Piperidis, Laurent Romary, Nicoletta Calzolari, Peter Wittenburg:
Foundation of a Component-based Flexible Registry for Language Resources and Technology. - Daan Broeder, David Nathan, Sven Strömqvist, Remco van Veenendaal:
Building a Federation of Language Resource Repositories: the DAM-LR Project and its Continuation within CLARIN. - Paul Trilsbeek, Daan Broeder, Tobias Valkenhoef, Peter Wittenburg:
A Grid of Regional Language Archives. - Takenobu Tokunaga, Dain Kaplan, Chu-Ren Huang, Shu-Kai Hsieh, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Kiyoaki Shirai, Virach Sornlertlamvanich, Thatsanee Charoenporn, Yingju Xia:
Adapting International Standard for Asian Language Technologies. - Keiji Shinzato, Daisuke Kawahara, Chikara Hashimoto, Sadao Kurohashi:
A Large-Scale Web Data Collection as a Natural Language Processing Infrastructure. - Riccardo Del Gratta, Roberto Bartolini, Tommaso Caselli, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
UFRA: a UIMA-based Approach to Federated Language Resource Architecture. - Georg Rehm, Oliver Schonefeld, Andreas Witt, Timm Lehmberg, Christian Chiarcos, Hanan Bechara, Florian Eishold, Kilian Evang, Magdalena Leshtanska, Aleksandar Savkov, Matthias Stark:
The Metadata-Database of a Next Generation Sustainability Web-Platform for Language Resources. - Piroska Lendvai, Steve Hunt:
From Field Notes towards a Knowledge Base. - Hitomi Tohyama, Shunsuke Kozawa, Kiyotaka Uchimoto, Shigeki Matsubara, Hitoshi Isahara:
Construction of a Metadata Database for Efficient Development and Use of Language Resources. - Bodil Nistrup Madsen, Hanne Erdman Thomsen:
A Taxonomy of Lexical Metadata Categories.
Session P16 - LR National/International Projects, Organizational/Policy Issues
- Shuichi Itahashi, Chiu-yu Tseng:
The 2008 Oriental COCOSDA Book Project: in Commemoration of the First Decade of Sustained Activities in Asia. - Adam Przepiórkowski, Rafal L. Górski, Barbara Lewandowska-Tomaszyk, Marek Lazinski:
Towards the National Corpus of Polish. - Einar Meister, Jaak Vilo:
Strengthening the Estonian Language Technology. - Bente Maegaard, Mohammed Atiyya, Khalid Choukri, Steven Krauwer, Chafic Mokbel, Mustafa Yaseen:
MEDAR: Collaboration between European and Mediterranean Arabic Partners to Support the Development of Language Technology for Arabic. - Simon Krek, Vojko Gorjanc, Spela Arhar:
Slovene Terminology Web Portal and the TBX-Compatible Simplified DTD/schema.
Session P17 - Standards and Best Practices for LRs
- Volha Petukhova, Harry Bunt:
LIRICS Semantic Role Annotation: Design and Evaluation of a Set of Data Categories. - Daniel Zeman:
Reusable Tagset Conversion Using Tagset Drivers. - Marie-Jeanne Derouin, André Le Meur:
Presentation of the New ISO-Standard for the Representation of Entries in Dictionaries: ISO 1951. - Marc Kemps-Snijders, Menzo Windhouwer, Peter Wittenburg, Sue Ellen Wright:
ISOcat: Corralling Data Categories in the Wild. - Isa Maks, Carole Tiberius, Remco van Veenendaal:
Standardising Bilingual Lexical Resources According to the Lexicon Markup Framework. - Thierry Declerck:
A Framework for Standardized Syntactic Annotation. - Victoria Arranz, Franck Gandcher, Valérie Mapelli, Khalid Choukri:
A Guide for the Production of Reusable Language Resources.
Session P18 - Lexical Resources and Tools
- Denis Maurel:
Prolexbase: a Multilingual Relational Lexical Database of Proper Names. - Yoshihiko Hayashi, Chiharu Narawa, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
Ontologizing Lexicon Access Functions based on an LMF-based Lexicon Taxonomy. - Ana-Maria Barbu:
Romanian Lexical Data Bases: Inflected and Syllabic Forms Dictionaries. - Atsushi Fujii:
Producing an Encyclopedic Dictionary using Patent Documents. - Folkert de Vriend, Jan Pieter Kunst, Louis ten Bosch, Charlotte Giesbers, Roeland van Hout:
Evaluating the Relationship between Linguistic and Geographic Distances using a 3D Visualization. - Piotr Banski, Radoslaw Moszczynski:
Enhancing an English-Polish Electronic Dictionary for Multiword Expression Research. - Claire Brierley, Eric Atwell:
ProPOSEL: A Prosody and POS English Lexicon for Language Engineering. - Eline Westerhout, Paola Monachesi:
Creating Glossaries Using Pattern-Based and Machine Learning Techniques. - Lynne J. Cahill:
Using Similarity Measures to Extend the LinGO Lexicon. - Peter Adolphs:
Acquiring a Poor Man's Inflectional Lexicon for German. - Núria Bel, Sergio Espeja, Montserrat Marimon, Marta Villegas:
COLDIC, a Lexicographic Platform for LMF compliant lexica.
Session P19 - Morphology, Syntax and Tools
- David Bamman, Marco Passarotti, Roberto Busa, Gregory R. Crane:
The Annotation Guidelines of the Latin Dependency Treebank and Index Thomisticus Treebank: the Treatment of some specific Syntactic Constructions in Latin. - Dan Tufis, Elena Irimia, Radu Ion, Alexandru Ceausu:
Unsupervised Lexical Acquisition for Part of Speech Tagging. - Amalia Todirascu, Dan Tufis, Ulrich Heid, Christopher Gledhill, Dan Stefanescu, Marion Weller, François Rousselot:
A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions. - Ekaterina Lapshinova-Koltunski, Ulrich Heid:
Head or Non-head? Semi-automatic Procedures for Extracting and Classifying Subcategorisation Properties of Compounds. - Manuel Kountz, Ulrich Heid, Kerstin Eckart:
A LAF/GrAF based Encoding Scheme for underspecified Representations of syntactic Annotations. - Tomaz Erjavec, Simon Krek:
The JOS Morphosyntactically Tagged Corpus of Slovene. - Aleksander Buczynski, Adam Przepiórkowski:
spade Demo: An Open Source Tool for Partial Parsing and Morphosyntactic Disambiguation. - Silke Scheible:
Annotating Superlatives. - Steliana Ivanova, Sandra Kübler:
POS Tagging for German: how important is the Right Context? - Christian Hänig, Stefan Bordag, Uwe Quasthoff:
UnsuParse: unsupervised Parsing with unsupervised Part of Speech Tagging. - Sara Tonelli, Rodolfo Delmonte, Antonella Bristot:
Enriching the Venice Italian Treebank with Dependency and Grammatical Relations. - Kristina Vuckovic, Marko Tadic, Zdravko Dovedan:
Rule-Based Chunker for Croatian. - Valeria Quochi, Basilio Calderone:
Learning properties of Noun Phrases: from data to functions. - Eva Banik, Alan Lee:
A Study of Parentheticals in Discourse Corpora - Implications for NLG Systems. - Mohamed Maamouri, Ann Bies, Seth Kulick:
Enhancing the Arabic Treebank: a Collaborative Effort toward New Annotation Guidelines. - Martha Palmer, Olga Babko-Malaya, Ann Bies, Mona T. Diab, Mohamed Maamouri, Aous Mansouri, Wajdi Zaghouani:
A Pilot Arabic Propbank.
Session P20 - Multimodal, Multimedia and Subjective Corpus
- Mark A. Greenwood, José Iria, Fabio Ciravegna:
Saxon: an Extensible Multimedia Annotator. - Michael Kipp:
Spatiotemporal Coding in ANVIL. - Michelina Savino, Laura Scivetti, Mario Refice:
Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different. - Kazuaki Maeda, Haejoong Lee, Shawn Medero, Julie Medero, Robert Parker, Stephanie M. Strassel:
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium. - Jana Trojanová, Marek Hrúz, Pavel Campr, Milos Zelezný:
Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition. - David Llorens, Federico Prat, Andrés Marzal, Juan Miguel Vilar, María José Castro, Juan-Carlos Amengual, Sergio Barrachina, Antonio Castellanos, Salvador España Boquera, J. A. Gómez, Jorge Gorbe-Moya, Albert Gordo, Vicente Palazón, Guillermo Peris, Rafael Ramos-Garijo, Francisco Zamora-Martínez:
The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters. - Emilie Chételat-Pelé, Annelies Braffort:
Sign Language Corpus Annotation: toward a new Methodology. - Philippe Dreuw, Carol Neidle, Vassilis Athitsos, Stan Sclaroff, Hermann Ney:
Benchmark Databases for Video-Based Automatic Sign Language Recognition. - Jan Bungeroth, Daniel Stein, Philippe Dreuw, Hermann Ney, Sara Morrissey, Andy Way, Lynette van Zijl:
The ATIS Sign Language Corpus. - Pavel Campr, Marek Hrúz, Jana Trojanová:
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition. - Shigeyoshi Kitazawa, Shinya Kiriyama, Tomohiko Kasami, Shogo Ishikawa, Naofumi Otani, Hiroaki Horiuchi, Yoichi Takebayashi:
A Multimodal Infant Behavior Annotation for Developmental Analysis of Demonstrative Expressions. - Yoshiko Arimoto, Sumio Ohno, Hitoshi Iida:
Automatic Emotional Degree Labeling for Speakers' Anger Utterance during Natural Japanese Dialog. - Theodoros Kostoulas, Todor Ganchev, Iosif Mporas, Nikos Fakotakis:
A Real-World Emotional Speech Corpus for Modern Greek. - Theresa Wilson:
Annotating Subjective Content in Meetings.
Session P21 - Tools and Data for Speech Systems Development
- Henk van den Heuvel, Jean-Pierre Martens, Bart D'hoore, Kristof D'hanens, Nanneke Konings:
The AUTONOMATA Spoken Names Corpus. - Briony Williams, Rhys James Jones:
Acquiring Pronunciation Data for a Placenames Lexicon in a Less-Resourced Language. - Reiko Kaji, Hajime Mochizuki:
Constructing a Database of Non-Japanese Pronunciations of Different Japanese Romanizations. - Antoine Laurent, Téva Merlin, Sylvain Meignier, Yannick Estève, Paul Deléglise:
Combined Systems for Automatic Phonetic Transcription of Proper Nouns. - Harald Höge, Zdravko Kacic, Bojan Kotnik, Matej Rojc, Nicolas Moreau, Horst-Udo Hain:
Evaluation of Modules and Tools for Speech Synthesis: the ECESS Framework. - Dafydd Gibbon, Jolanta Bachan:
An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation. - Stefan Scherer, Petra-Maria Strauß:
A Flexible Wizard of Oz Environment for Rapid Prototyping. - Jindrich Matousek, Daniel Tihelka, Jan Romportl:
Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis. - Luís C. Oliveira, Sérgio Paulo, Luís Figueira, Carlos Mendes, Ana Nunes, Joaquim Godinho:
Methodologies for Designing and Recording Speech Databases for Corpus Based Synthesis. - Alexandre Patry, Philippe Langlais:
MISTRAL: a Statistical Machine Translation Decoder for Speech Recognition Lattices. - Ute Ziegenhain, Hanne Fersøe, Henk van den Heuvel, Asunción Moreno:
LC-STAR II: Starring more Lexica. - Matthias Eck, Stephan Vogel, Alex Waibel:
Communicating Unknown Words in Machine Translation. - Pierrette Bouillon, Sonia Halimi, Yukie Nakao, Kyoko Kanzaki, Hitoshi Isahara, Nikos Tsourakis, Marianne Starlander, Beth Ann Hockey, Manny Rayner:
Developing Non-European Translation Pairs in a Medium-Vocabulary Medical Speech Translation System. - Nadine Perera, Michael Pitz, Manfred Pinkal:
CLIoS: Cross-lingual Induction of Speech Recognition Grammars. - Takahiro Ono, Hitomi Tohyama, Shigeki Matsubara:
Construction and Analysis of Word-level Time-aligned Simultaneous Interpretation Corpus. - Marie-Jean Meurs, Frédéric Duvert, Frédéric Béchet, Fabrice Lefèvre, Renato de Mori:
Semantic Frame Annotation on the French MEDIA corpus. - Nick Webb, Ting Liu, Mark Hepple, Yorick Wilks:
Cross-Domain Dialogue Act Tagging. - Nikos Tsourakis, Maria Georgescul, Pierrette Bouillon, Manny Rayner:
Building Mobile Spoken Dialogue Applications Using Regulus. - Christian Raymond, Kepa Joseba Rodríguez, Giuseppe Riccardi:
Active Annotation in the LUNA Italian Corpus of Spontaneous Dialogues. - Stefan Hahn, Patrick Lehnen, Christian Raymond, Hermann Ney:
A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding. - Stéphane Huet, Guillaume Gravier, Pascale Sébillot:
Morphosyntactic Resources for Automatic Speech Recognition.
Session P22 - Speech Corpus in Various Environments
- Nicolás Morales, Javier Tejedor, Javier Garrido Salas, José Colás, Doroteo T. Toledano:
rre STC-TIMIT: Generation of a Single-channel Telephone Corpus. - Eric Sanders, Asunción Moreno, Herbert S. Tropf, Lynette Melnar, Nurit Dekel, Breanna Gillies, Niklas Paulsson:
LILA: Cellular Telephone Speech Databases from Asia. - Grazyna Demenko, Stefan Grocholewski, Katarzyna Klessa, Jerzy Ogórkiewicz, Agnieszka Wagner, Marek Lange, Daniel Sledzinski, Natalia Cylwik:
JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts. - Tomas Dekens, Yorgos Patsis, Werner Verhelst, Frédéric Beaugendre, François Capman:
A Multi-sensor Speech Database with Applications towards Robust Speech Processing in hostile Environments. - Isabel Trancoso, Rui Martins, Helena Moniz, Ana Isabel Mata, Céu Viana:
The LECTRA Corpus - Classroom Lecture Transcriptions in European Portuguese. - Florian Schiel, Christian Heinrich, Sabine Barfüßer, Thomas Gilg:
ALC: Alcohol Language Corpus. - Rubén Fernández Pozo, Luis A. Hernández Gómez, Eduardo López Gonzalo, José Alcázar Ramírez, Guillermo Portillo, Doroteo T. Toledano:
Design of a Multimodal Database for Research on Automatic Detection of Severe Apnoea Cases. - Tomoyosi Akiba, Kiyoaki Aikawa, Yoshiaki Itoh, Tatsuya Kawahara, Hiroaki Nanjo, Hiromitsu Nishizaki, Norihito Yasuda, Yoichi Yamashita, Katunobu Itou:
Test Collections for Spoken Document Retrieval from Lecture Audio Data. - Akira Ozaki, Sunao Hara, Takashi Kusakawa, Chiyomi Miyajima, Takanori Nishino, Norihide Kitaoka, Katunobu Itou, Kazuya Takeda:
In-car Speech Data Collection along with Various Multimodal Signals. - Masatoshi Tsuchiya, Satoru Kogure, Hiromitsu Nishizaki, Kengo Ohta, Seiichi Nakagawa:
Developing Corpus of Japanese Classroom Lecture Speech Contents. - Konrad Hofbauer, Stefan Petrik, Horst Hering:
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech. - Thomas Winkler, Theodoros Kostoulas, Richard Adderley, Christian Bonkowski, Todor Ganchev, Joachim Köhler, Nikos Fakotakis:
The MoveOn Motorcycle Speech Corpus. - Stavros Ntalampiras, Ilyas Potamitis, Todor Ganchev, Nikos Fakotakis:
Audio Database in Support of Potentiel Threat and Crisis Situation Management. - Martine Garnier-Rizet, Gilles Adda, Frédérik Cailliau, Jean-Luc Gauvain, Sylvie Guillemin-Lanne, Lori Lamel, Stephan Vanni, Claire Waast-Richard:
CallSurf: Automatic Transcription, Indexing and Structuration of Call Center Conversational Speech for Knowledge Extraction and Query by Content. - Djamel Mostefa, Arnaud Vallée:
New Telephone Speech Databases for French: a Children Database and an optimized Adult Corpus.
Session P23 - Speech Corpus in Various Languages
- Krzysztof Marasek, Ryszard Gubrynowicz:
Design and Data Collection for Spoken Polish Dialogs Database. - Fabíola Santos, Tiago Freitas:
CORP-ORAL: Spontaneous Speech Corpus for European Portuguese. - Tiit Hennoste, Olga Gerassimenko, Riina Kasterpalu, Mare Koit, Andriela Rääbis, Krista Strandson:
From Human Communication to Intelligent User Interfaces: Corpora of Spoken Estonian. - Rudolf Muhr:
The Pronouncing Dictionary of Austrian German (AGPD) and the Austrian Phonetic Database (ADABA): Report on a large Phonetic Resources Database of the three Major Varieties of German. - Caren Brinckmann, Stefan Kleiner, Ralf Knöbl, Nina Berend:
German Today: a really extensive Corpus of Spoken Standard German. - Antonio Bonafonte, Jordi Adell, Ignasi Esquerra, Silvia Gallego, Asunción Moreno, Javier Pérez:
Corpus and Voices for Catalan Speech Synthesis. - Martine Adda-Decker, Thomas Pellegrini, Éric Bilinski, Gilles Adda:
Developments of "Lëtzebuergesch" Resources for Automatic Speech Processing and Linguistic Studies.
Session P24 - Speech Corpus Design Methodology and Tools
- Rena Nemoto, Ioana Vasilescu, Martine Adda-Decker:
Speech Errors on Frequently Observed Homophones in French: Perceptual Evaluation vs Automatic Classification. - Hiroki Yamazaki, Keisuke Kitamura, Takashi Harada, Seiichi Yamamoto:
Creation of Learner Corpus and Its Application to Speech Recognition. - Jean-Yves Antoine, Abdenour Mokrane, Nathalie Friburger:
Automatic Rich Annotation of Large Corpus of Conversational transcribed speech: the Chunking Task of the EPAC Project. - Thierry Bazillon, Yannick Estève, Daniel Luzzati:
Manual vs Assisted Transcription of Prepared and Spontaneous Speech. - Antonio Moreno-Sandoval, Doroteo Torre Toledano, Raùl de la Torre, Marta Garrote Salazar, José María Guirao:
Developing a Phonemic and Syllabic Frequency Inventory for Spontaneous Spoken Castilian Spanish and their Comparison to Text-Based Inventories. - Petr Pollák, Jan Volín, Radek Skarnitzl:
Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database. - Victoria Bobicev, Tatiana Zidrasco:
Estimating Word Phonosemantics. - Joachim Gasch, Caren Brinckmann, Sylvia Dickgießer:
memasysco: XML schema based metadata management system for speech corpora. - Jonathan Chevelu, Nelly Barbot, Olivier Boëffard, Arnaud Delhay:
Comparing Set-Covering Strategies for Optimal Corpus Design. - Pierre Lanchantin, Andrew C. Morris, Xavier Rodet, Christophe Veaux:
Automatic Phoneme Segmentation with Relaxed Textual Constraints. - Christophe Veaux, Grégory Beller, Xavier Rodet:
IrcamCorpusTools: an Extensible Platform for Spoken Corpora Exploitation. - Erin Fitzgerald, Frederick Jelinek:
Linguistic Resources for Reconstructing Spontaneous Speech Text. - Maarten Janssen, Tiago Freitas:
Spock - a Spoken Corpus Client. - Viktor Trón:
On the Durational Reduction of Repeated Mentions: Recency and Speaker Effects.
Session P25 - Morphology and Morphosyntax
- Florian Koehler, Hinrich Schütze, Michaela Atterer:
A Question Answering System for German. Experiments with Morphological Linguistic Resources. - Bruno Cartoni:
Lexical Resources for Automatic Translation of Constructed Neologisms: the Case Study of Relational Adjectives. - Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura:
A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation. - Reut Tsarfaty, Yoav Goldberg:
Word-Based or Morpheme-Based? Annotation Strategies for Modern Hebrew Clitics. - Sonja Bosch, Laurette Pretorius, Kholisa Podile, Axel Fleisch:
Experimental Fast-Tracking of Morphological Analysers for Nguni Languages. - Nikola Ljubesic, Tomislava Lauc, Damir Boras:
Generating a Morphological Lexicon of Organization Entity Names. - Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, Dagmar Divjak:
Designing and Evaluating a Russian Tagset. - Karel Pala, Lukás Svoboda, Pavel Smerk:
Czech MWE Database. - Nizar Habash, Ryan Roth:
Identification of Naturally Occurring Numerical Expressions in Arabic. - Shisanu Tongchim, Randolf Altmeyer, Virach Sornlertlamvanich, Hitoshi Isahara:
A Dependency Parser for Thai. - Mehrnoush Shamsfard, Hakimeh Fadaei:
A Hybrid Morphology-Based POS Tagger for Persian. - Baskaran Sankaran, Kalika Bali, Monojit Choudhury, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Girish Nath Jha, S. Rajendran, K. Saravanan, L. Sobha, Karumuri V. Subbarao:
A Common Parts-of-Speech Tagset Framework for Indian Languages.
Session P26 - Semantics, Semantic Resources and Semantic Annotation
- Rajat Kumar Mohanty, Pushpak Bhattacharyya:
Lexical Resources for Semantics Extraction. - Alain Joubert, Mathieu Lafourcade:
Evolutionary Basic Notions for a Thematic Representation of General Knowledge. - Ya-Min Chou, Chu-Ren Huang, Jia-Fei Hong:
The Extended Architecture of Hantology for Japan Kanji. - Petya Osenova, Kiril Ivanov Simov, Eelco Mossel:
Language Resources for Semantic Document Annotation and Crosslingual Retrieval. - Sanaz Jabbari, Ben Allison, Louise Guthrie:
Using a Probabilistic Model of Context to Detect Word Obfuscation. - Sara Tonelli, Emanuele Pianta:
Frame Information Transfer from English to Italian. - Jordi Carrera, Irene Castellón, Salvador Climent, Marta Coll-Florit:
Towards Spanish Verbs' Selectional Preferences Automatic Acquisition: Semantic Annotation of the SenSem Corpus. - Paula Cristina Vaz, David Martins de Matos, Nuno J. Mamede:
Using Lexical Acquisition to Enrich a Predicate Argument Reusable Database. - Chris Reed, Raquel Mochales Palau, Glenn Rowe, Marie-Francine Moens:
Language Resources for Studying Argument. - Cosmin Adrian Bejan, Sanda M. Harabagiu:
A Linguistic Resource for Discovering Event Structures and Resolving Event Coreference. - Kyoko Ohara:
Lexicon, Grammar, and Multilinguality in the Japanese FrameNet. - Nilda Ruimy, Antonio Toral:
More Semantic Links in the SIMPLE-CLIPS Database. - Riccardo Del Gratta, Nilda Ruimy, Antonio Toral:
Simple-Clips ongoing research: more information with less data by implementing inheritance. - Brian Davis, Siegfried Handschuh, Alexander Troussov, John Judge, Mikhail Sogrin:
Linguistically Light Lexical Extensions for Ontologies.
Session P27 - Temporal Annotation
- Stéphanie Weiser, Philippe Laublet, Jean-Luc Minel:
Automatic Identification of Temporal Information in Tourism Web Pages. - Sebastian Gottwald, Matthias Richter, Gerhard Heyer, Gerik Scheuermann:
Tapping Huge Temporally Indexed Textual Resources with WCTAnalyze. - Ineke Schuurman:
Spatiotemporal Annotation Using MiniSTEx: how to deal with Alternative, Foreign, Vague and/or Obsolete Names? - Maria Teresa Vicente-Díez, Doaa Samy, Paloma Martínez:
An Empirical Approach to a Preliminary Successful Identification and Resolution of Temporal Expressions in Spanish News Corpora. - Georgiana Puscasu, Verginica Barbu Mititelu:
Annotation of WordNet Verbs with TimeML Event Classes.
Session P28 - Multilinguality and Machine Translation
- Vincent Claveau:
Automatic Translation of Biomedical Terms by Supervised Machine Learning. - Toni Badia, Maite Melero, Oriol Valentín:
Rapid Deployment of a New METIS Language Pair: Catalan-English. - Vincent Vandeghinste, Peter Dirix, Ineke Schuurman, Stella Markantonatou, Sokratis Sofianopoulos, Marina Vassiliou, Olga Yannoutsou, Toni Badia, Maite Melero, Gemma Boleda, Michael Carl, Paul Schmidt:
Evaluation of a Machine Translation System for Low Resource Languages: METIS-II. - Marta R. Costa-jussà, José A. R. Fonollosa, Enric Monte:
Using Reordering in Statistical Machine Translation based on Alignment Block Classification. - Janne Bondi Johannessen, Torbjørn Nordgård, Lars Nygaard:
Evaluation of Linguistics-Based Translation. - Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing Ma, Hitoshi Isahara:
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus. - Qing Ma, Koichi Nakao, Masaki Murata, Hitoshi Isahara:
Selection of Japanese-English Equivalents by Integrating High-quality Corpora and Huge Amounts of Web Data. - Beáta Megyesi, Bengt Dahlqvist, Eva Pettersson, Joakim Nivre:
Swedish-Turkish Parallel Treebank. - Julia S. Trushkina, Lieve Macken, Hans Paulussen:
Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort. - Hiroyuki Kaji, Shin'ichi Tamamura, Dashtseren Erdenebat:
Automatic Construction of a Japanese-Chinese Dictionary via English. - Kathrin Spreyer, Jonas Kuhn, Bettina Schrader:
Identification of Comparable Argument-Head Relations in Parallel Corpora. - Svitlana Kurella, Serge Sharoff, Anthony Hartley:
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages. - Jörg Tiedemann:
Synchronizing Translated Movie Subtitles. - Takeshi Abekawa, Kyo Kageura:
Constructing a Corpus that Indicates Patterns of Modification between Draft and Final Translations by Human Translators. - Violaine Prince, Jacques Chauché:
Building a Bilingual Representation of the Roget Thesaurus for French to English Machine Translation. - Luka Nerima, Eric Wehrli:
Generating Bilingual Dictionaries by Transitivity. - Jean Tavernier, Rosa Cowan, Michelle Vanni:
Holy Moses! Leveraging Existing Tools and Resources for Entity Translation. - Christian Monson, Ariadna Font Llitjós, Vamshi Ambati, Lori S. Levin, Alon Lavie, Alison Alvarez, Roberto Aranovich, Jaime G. Carbonell, Robert E. Frederking, Erik Peterson, Katharina Probst:
Linguistic Structure and Bilingual Informants Help Induce Machine Translation of Lesser-Resourced Languages. - Kazuaki Maeda, Xiaoyi Ma, Stephanie M. Strassel:
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion. - Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Akira Terada, Yasunori Abe:
Application of Resource-based Machine Translation to Real Business Scenes. - Wolodja Wentland, Johannes Knopp, Carina Silberer, Matthias Hartung:
Building a Multilingual Lexical Resource for Named Entity Disambiguation, Translation and Transliteration. - Marianna Apidianaki:
Translation-oriented Word Sense Induction Based on Parallel Corpora. - Todor Arnaudov, Ruslan Mitkov:
Smarty - Extendable Framework for Bilingual and Multilingual Comprehension Assistants. - Péter Halácsy, András Kornai, Péter Németh, Dániel Varga:
Parallel Creation of Gigaword Corpora for Medium Density Languages - an Interim Report. - Reginald L. Hobbs, Jamal Laoudi, Clare R. Voss:
MTriage: Web-enabled Software for the Creation, Machine Translation, and Annotation of Smart Documents. - Clare R. Voss, Jamal Laoudi, Jeffrey Micher:
Exploitation of an Arabic Language Resource for Machine Translation Evaluation: using Buckwalter-based Lookup Tool to Augment CMU Alignment Algorithm. - Oana Frunza:
A Trainable Tokenizer, solution for multilingual texts and compound expression tokenization. - Karine Megerdoomian, Dan Parvaz:
Low-Density Language Bootstrapping: the Case of Tajiki Persian.
Session P29 - Semantic Resources and their Elicitation
- Lothar Lemnitzer, Holger Wunsch, Piklu Gupta:
Enriching GermaNet with verb-noun relations - a case study of lexical acquisition. - Diana Santos, Maria do Rosário Silva, Susana Inácio:
What's in a Colour? Studying and Contrasting Colours with COMPARA. - Beata Trawinski, Jan-Philipp Soehn:
A Multilingual Database of Polarity Items. - Ernesto William De Luca, Birte Lönneker-Rodman:
Integrating Metaphor Information into RDF/OWL EuroWordNet. - Richard Johansson, Pierre Nugues:
Comparing Dependency and Constituent Syntax for Frame-semantic Analysis. - Juan Aparicio, Mariona Taulé, Maria Antònia Martí:
AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora. - Davide Buscaldi, Paolo Rosso:
Geo-WordNet: Automatic Georeferencing of WordNet. - Mario Crespo Miguel, Paul Buitelaar:
Domain-Specific English-To-Spanish Translation of FrameNet. - Hagen Fürstenau:
Enriching Frame Semantic Resources with Dependency Graphs. - Bento Carlos Dias-da-Silva, Ariani Di Felippo, Maria das Graças Volpe Nunes:
The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database. - Roser Morante:
Semantic Role Labeling Tools Trained on the Cast3LB-CoNNL-SemRol Corpus. - Evi Marzelou, Maria Zourari, Voula Giouli, Stelios Piperidis:
Building a Greek corpus for Textual Entailment. - Kyoko Kanzaki, Francis Bond, Noriko Tomuro, Hitoshi Isahara:
Extraction of Attribute Concepts from Japanese Adjectives. - Adriana Roventini, Nilda Ruimy:
Mapping Events and Abstract Entities from PAROLE-SIMPLE-CLIPS to ItalWordNet. - Davide Picca, Alfio Massimiliano Gliozzo, Massimiliano Ciaramita:
Supersense Tagger for Italian. - Maria Teresa Pazienza, Armando Stellato:
Clustering of Terms from Translation Dictionaries and Synonyms Lists to Automatically Build more Structured Linguistic Resources. - Stephan Walter:
Linguistic Description and Automatic Extraction of Definitions from German Court Decisions. - Veronika Vincze, György Szarvas, Attila Almási, Dóra Szauter, Róbert Ormándi, Richárd Farkas, Csaba Hatvani, János Csirik:
Hungarian Word-Sense Disambiguated Corpus. - Olga N. Lashevskaja, Olga Yu. Shemanaeva:
Semantic Annotation Layer in Russian National Corpus: Lexical Classes of Nouns and Adjectives. - Mohamed Attia, Mohsen A. Rashwan, Ahmed Ragheb, Mohamed Al-Badrashiny, Husein Al-Basoumy:
A Compact Arabic Lexical Semantics Language Resource Based on the Theory of Semantic Fields. - Doaa Samy, Ana González-Ledesma:
Pragmatic Annotation of Discourse Markers in a Multilingual Parallel Corpus (Arabic- Spanish-English). - Patcharee Varasai, Chaveevan Pechsiri, Thana Sukvaree, Vee Satayamas, Asanee Kawtrakul:
Building an Annotated Corpus for Text Summarization and Question Answering.
Session P30 -Sentiment and Opinion Analysis
- Jonas Sjöbergh, Kenji Araki:
A Multi-Lingual Dictionary of Dirty Words. - Jonas Sjöbergh, Kenji Araki:
What is poorly Said is a Little Funny. - Yves Bestgen:
Building Affective Lexicons from Specific Corpora for Automatic Sentiment Analysis. - Ruifeng Xu, Yunqing Xia, Kam-Fai Wong, Wenjie Li:
Opinion Annotation in On-line Chinese Product Reviews. - Xiwen Cheng, Feiyu Xu:
Fine-grained Opinion Topic and Polarity Identification. - Kugatsu Sadamitsu, Satoshi Sekine, Mikio Yamamoto:
Sentiment Analysis Based on Probabilistic Models Using Inter-Sentence Information. - Marco Guerini, Carlo Strapparava, Oliviero Stock:
Valentino: A Tool for Valence Shifting of Natural Language Texts.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.