default search action
7th LREC 2010: Valletta, Malta
- Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias:
Proceedings of the International Conference on Language Resources and Evaluation, LREC 2010, 17-23 May 2010, Valletta, Malta. European Language Resources Association 2010, ISBN 2-9517408-6-7
Session O1 - Semantic Acquisition
- Fabienne Fritzinger, Frank Richter, Marion Weller:
Pattern-Based Extraction of Negative Polarity Items from Dependency-Parsed Text. - Luca Dini, Giampaolo Mazzini:
The Impact of Grammar Enhancement on Semantic Resources Induction. - Alessandro Lenci, Martina Johnson, Gabriella Lapesa:
Building an Italian FrameNet through Semi-automatic Corpus Analysis. - Claire Mouton, Gaël de Chalendar, Benoît Richert:
FrameNet Translation Using Bilingual Dictionaries with Evaluation on the English-French Pair. - Paul Cook, Suzanne Stevenson:
Automatically Identifying Changes in the Semantic Orientation of Words.
Session O2 - LR Infrastructures and Standards
- Lars Borin, Markus Forsberg, Dimitrios Kokkinakis:
Diabase: Towards a Diachronic BLARK in Support of Historical Studies. - Daan Broeder, Marc Kemps-Snijders, Dieter Van Uytvanck, Menzo Windhouwer, Peter Withers, Peter Wittenburg, Claus Zinn:
A Data Category Registry- and Component-based Metadata Framework. - Jan Odijk:
The CLARIN-NL Project. - Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary, Nasredine Semmar:
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information. - Peter Wittenburg, Núria Bel, Lars Borin, Gerhard Budin, Nicoletta Calzolari, Eva Hajicová, Kimmo Koskenniemi, Lothar Lemnitzer, Bente Maegaard, Maciej Piasecki, Jean-Marie Pierrel, Stelios Piperidis, Inguna Skadina, Dan Tufis, Remco van Veenendaal, Tamás Váradi, Martin Wynne:
Resource and Service Centres as the Backbone for a Sustainable Service Infrastructure.
Session O3 - Dialogue and Evaluation
- Susan Robinson, Antonio Roque, David R. Traum:
Dialogues in Context: An Objective User-Oriented Evaluation Approach for Virtual Human Dialogue. - Joshua B. Gordon, Rebecca J. Passonneau:
An Evaluation Framework for Natural Language Understanding in Spoken Dialogue Systems. - Sunao Hara, Norihide Kitaoka, Kazuya Takeda:
Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System. - Nick Webb, David Benyon, Preben Hansen, Oli H. Mival:
Evaluating Human-Machine Conversation for Appropriateness. - Svetlana Stoyanchev, Paul Piwek:
Constructing the CODA Corpus: A Parallel Corpus of Monologues and Expository Dialogues.
Session O4 - Text-to-Speech Corpora
- Didier Cadic, Cédric Boidin, Christophe d'Alessandro:
Towards Optimal TTS Corpora. - Michael Pucher, Friedrich Neubarth, Volker Strom, Sylvia Moosmüller, Gregor Hofer, Christian Kranzler, Gudrun Schuchmann, Dietmar Schabus:
Resources for Speech Synthesis of Viennese Varieties. - Pavel A. Skrelin, Nina B. Volskaya, Daniil Kocharov, Karina Evgrafova, Olga Glotova, Vera Evdokimova:
A Fully Annotated Corpus of Russian Speech. - Francisco Campillo Díaz, Daniela Braga, Ana Belén Mourín, Carmen García-Mateo, Pedro Silva, Miguel Sales Dias, Francisco Méndez Pazó:
Building High Quality Databases for Minority Languages such as Galician. - Alexandros Lazaridis, Theodoros Kostoulas, Todor Ganchev, Iosif Mporas, Nikos Fakotakis:
Vergina: A Modern Greek Speech Database for Speech Synthesis.
Session O5 - Knowledge Discovery
- Danica Damljanovic, Milan Agatonovic, Hamish Cunningham:
Identification of the Question Focus: Combining Syntactic Analysis and Ontology-based Lookup through the User Interaction. - Paul McNamee, Hoa Trang Dang, Heather Simpson, Patrick Schone, Stephanie M. Strassel:
An Evaluation of Technologies for Knowledge Base Population. - Eneko Agirre, Montse Cuadros, German Rigau, Aitor Soroa:
Exploring Knowledge Bases for Similarity. - Francesca Fallucchi, Maria Teresa Pazienza, Fabio Massimo Zanzotto:
Generic Ontology Learners on Application Domains. - Jorge Vivaldi, Horacio Rodríguez:
Finding Domain Terms using Wikipedia.
Session O6 - Temporal and Spatial Annotation - Special Session
- James Pustejovsky, Kiyong Lee, Harry Bunt, Laurent Romary:
ISO-TimeML: An International Standard for Semantic Annotation. - Leon Derczynski, Robert J. Gaizauskas:
Analysing Temporally Annotated Corpora with CAVaT. - Naushad UzZaman, James F. Allen:
TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text. - Parisa Kordjamshidi, Martijn van Otterlo, Marie-Francine Moens:
Spatial Role Labeling: Task Definition and Annotation Scheme.
Session O7 - Evaluation Methodologies
- Jerid Francom, Amy LaCross, Adam Ussishkin:
How Specialized are Specialized Corpora? Behavioral Evaluation of Corpus Representativeness for Maltese. - Yoshinobu Kano, Rubén Dorado, Luke McCrohon, Sophia Ananiadou, Jun'ichi Tsujii:
U-Compare: An Integrated Language Resource Evaluation Platform Including a Comprehensive UIMA Resource Library. - Haïfa Zargayouna, Adeline Nazarenko:
Evaluation of Textual Knowledge Acquisition Tools: a Challenging Task. - K. Bretonnel Cohen, Christophe Roeder, William A. Baumgartner Jr., Lawrence Hunter, Karin Verspoor:
Test Suite Design for Biomedical Ontology Concept Recognition Systems. - Ondrej Bojar, Adam Liska, Zdenek Zabokrtský:
Evaluating Utility of Data Sources in a Large Parallel Czech-English Corpus CzEng 0.9.
Session O8 - Sign Language
- Annelies Braffort, Laurence Bolot, Emilie Chételat-Pelé, Annick Choisier, Maxime Delorme, Michael Filhol, Jérémie Segouat, Cyril Verrecchia, Flora Badin, Nadège Devos:
Sign Language Corpora for Analysis, Processing and Evaluation. - Onno Crasborn:
The Sign Linguistics Corpora Network: Towards Standards for Signed Language Resources. - Kyle Duarte, Sylvie Gibet:
Heterogeneous Data Sources for Signed Language Analysis and Synthesis: The SignCom Project. - Antonio Balvet, Cyril Courtin, Dominique Boutet, Christian Cuxac, Ivani Fusellier-Souza, Brigitte Garcia, Marie-Thérèse L'Huillier, Marie-Anne Sallandre:
The Creagest Project: a Digitized and Annotated Corpus for French Sign Language (LSF) and Natural Gestural Languages. - Philippe Dreuw, Hermann Ney, Gregorio Martínez Pérez, Onno Crasborn, Justus H. Piater, Jose Miguel Moya, Mark Wheatley:
The SignSpeak Project - Bridging the Gap Between Signers and Speakers.
Session O9 - Anaphora, Coreference
- Massimo Poesio, Olga Uryupina, Yannick Versley:
Creating a Coreference Resolution System for Italian. - Arndt Riester, David Lorenz, Nina Seemann:
A Recursive Annotation Scheme for Referential Information Status. - Tommaso Caselli, Irina Prodanof:
Annotating Event Anaphora: A Case Study.
Session O10 - Machine Translation
- Sherri L. Condon, Dan Parvaz, John S. Aberdeen, Christy Doran, Andrew Freeman, Marwan Awad:
Evaluation of Machine Translation Errors in English and Iraqi Arabic. - Jörg Tiedemann:
Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment. - Maria Holmqvist:
Heuristic Word Alignment with Parallel Phrases. - Sylwia Ozdowska, Vincent Claveau:
Inferring Syntactic Rules for Word Alignment through Inductive Logic Programming.
Session O11 - Authoring Tools and Text Analysis
- Jennifer Pedler, Roger Mitton:
A Large List of Confusion Sets for Spellchecking Assessed Against a Corpus of Real-word Errors. - Na-Rae Han, Joel R. Tetreault, Soo-Hwa Lee, Jin-Young Ha:
Using an Error-Annotated Learner Corpus to Develop an ESL/EFL Error Correction System. - Alberto Barrón-Cedeño, Martin Potthast, Paolo Rosso, Benno Stein:
Corpus and Evaluation Measures for Automatic Plagiarism Detection. - Philip van Oosten, Dries Tanghe, Véronique Hoste:
Towards an Improved Methodology for Automated Readability Prediction.
Session O12 - Parsing
- Danielle Ben-Gera, Yi Zhang, Valia Kordoni:
Semantic Feature Engineering for Enhancing Disambiguation Performance in Deep Linguistic Processing. - Jordi Atserias, Giuseppe Attardi, Maria Simi, Hugo Zaragoza:
Active Learning for Building a Corpus of Questions for Parsing. - Eckhard Bick:
FrAG, a Hybrid Constraint Grammar Parser for French. - Elaine Uí Dhonnchadha, Josef van Genabith:
Partial Dependency Parsing for Irish.
Session O13 - Ontologies
- Marta Tatu, Dan I. Moldovan:
Inducing Ontologies from Folksonomies using Natural Language Understanding. - Vivi Nastase, Michael Strube, Benjamin Börschinger, Cäcilia Zirn, Anas Elghafari:
WikiNet: A Very Large Scale Multi-Lingual Concept Network. - Gosse Bouma:
Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia. - Matthias Hartung, Anette Frank:
A Semi-supervised Type-based Classification of Adjectives: Distinguishing Properties and Relations.
Session O14 - Terminology, Corpus and Lexicon
- Sylviane Cardey, Krzysztof Bogacki, Xavier Blanco, Ruslan Mitkov:
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective. - Klaar Vanopstal, Bart Desmet, Véronique Hoste:
Towards a Learning Approach for Abbreviation Detection and Resolution. - Bruno Cartoni, Pierre Zweigenbaum:
Semi-Automated Extension of a Specialized Medical Lexicon for French. - Rogelio Nazar, Maarten Janssen:
Combining Resources: Taxonomy Extraction from Multiple Dictionaries.
Session O15 - Trends in Speech Databases
- Toomas Altosaar, Louis ten Bosch, Guillaume Aimetti, Christos Koniaris, Kris Demuynck, Henk van den Heuvel:
A Speech Corpus for Modeling Language Acquisition: CAREGIVER. - Florian Schiel:
BAStat : New Statistical Resources at the Bavarian Archive for Speech Signals. - Kseniya Zablotskaya, Steffen Walter, Wolfgang Minker:
Speech Data Corpus for Verbal Intelligence Estimation. - Janne Bondi Johannessen, Kristin Hagen, Anders Nøklestad, Joel Priestley:
Enhancing Language Resources with Maps.
Session O16 - LRs: Infrastructures and Strategies
- Christopher Cieri, Mark Liberman:
Adapting to Trends in Language Resource Development: A Progress Report on LDC Activities. - Victoria Arranz, Khalid Choukri:
ELRA's Services 15 Years on...Sharing and Anticipating the Community. - Nicoletta Calzolari, Claudia Soria:
Preparing the field for an Open Resource Infrastructure: the role of the FLaReNet Network of Excellence. - Jonathan H. Clark, Alon Lavie:
LoonyBin: Keeping Language Technologists Sane through Automated Management of Experimental (Hyper)Workflows. - Zhiyi Song, Stephanie M. Strassel, Gary Krug, Kazuaki Maeda:
Enhanced Infrastructure for Creation and Collection of Translation Resources.
Session O17 - Opinion Mining and Emotions
- Lun-Wei Ku, Ting-Hao (Kenneth) Huang, Hsin-Hsi Chen:
Construction of a Chinese Opinion Treebank. - Alexander Pak, Patrick Paroubek:
Twitter as a Corpus for Sentiment Analysis and Opinion Mining. - Isa Maks, Piek Vossen:
Annotation Scheme and Gold Standard for Dutch Subjective Adjectives. - Matthieu Vernier, Laura Monceaux, Béatrice Daille:
Learning Subjectivity Phrases missing from Resources through a Large Set of Semantic Tests. - Carlo Strapparava, Marco Guerini, Oliviero Stock:
Predicting Persuasiveness in Political Discourses.
Session O18 - Information Extraction
- Yassine Benajiba, Imed Zitouni:
Arabic Word Segmentation for Better Unit of Analysis. - Xabier Saralegi, Maddalen Lopez de Lacalle:
Dictionary and Monolingual Corpus-based Query Translation for Basque-English CLIR. - Jana Straková, Pavel Pecina:
Czech Information Retrieval with Syntax-based Language Models. - Lukas Michelbacher, Florian Laws, Beate Dorow, Ulrich Heid, Hinrich Schütze:
Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure. - Walid Magdy, Jinming Min, Johannes Leveling, Gareth J. F. Jones:
Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval.
Session O19 - Semantics
- Torsten Zesch, Iryna Gurevych:
The More the Better? Assessing the Influence of Wikipedia's Growth on Semantic Relatedness Measures. - Sabine Schulte im Walde:
Comparing Computational Models of Selectional Preferences - Second-order Co-Occurrence vs. Latent Semantic Clusters. - Daisuke Kawahara, Sadao Kurohashi:
Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation. - Ziqi Zhang, Anna Lisa Gentile, Lei Xia, José Iria, Sam Chapman:
A Random Graph Walk based Approach to Computing Semantic Relatedness Using Knowledge from Wikipedia. - Kathrin Baker, Michael Bloodgood, Bonnie J. Dorr, Nathaniel Wesley Filardo, Lori S. Levin, Christine D. Piatko:
A Modality Lexicon and its use in Automatic Tagging.
Session O20 - Discourse Annotation and Parsing
- Nathanael Chambers, Daniel Jurafsky:
A Database of Narrative Schemas. - Markus Egg, Gisela Redeker:
How Complex is Discourse Structure? - Bonaventura Coppola, Alessandro Moschitti:
A General Purpose FrameNet-based Shallow Semantic Parser. - Daniel M. Cer, Marie-Catherine de Marneffe, Daniel Jurafsky, Christopher D. Manning:
Parsing to Stanford Dependencies: Trade-offs between Speed and Accuracy.
Session O21 - Emotion, Sentiment
- Alexander Schmitt, Gregor Bertrand, Tobias Heinroth, Wolfgang Minker, Jackson Liscombe:
WITcHCRafT: A Workbench for Intelligent exploraTion of Human ComputeR conversaTions. - Ulli Waltinger:
GermanPolarityClues: A Lexical Resource for German Sentiment Analysis. - Björn W. Schuller, Riccardo Zaccarelli, Nicolas Rollet, Laurence Devillers:
CINEMO - A French Spoken Language Resource for Complex Emotions: Facts and Baselines. - Gregor Bertrand, Florian Nothdurft, Steffen Walter, Andreas Scheck, Henrik Kessler, Wolfgang Minker:
Towards Investigating Effective Affective Dialogue Strategies.
Session O22 - Corpus Building, Annotation and Methodology
- Martin Volk, Noah Bubenhofer, Adrian Althaus, Maya Bangerter, Lenz Furrer, Beni Ruef:
Challenges in Building a Multilingual Alpine Heritage Corpus. - Marc Carmen, Paul Felt, Robbie Haertel, Deryle Lonsdale, Peter McClanahan, Owen Merkling, Eric K. Ringger, Kevin D. Seppi:
Tag Dictionaries Accelerate Manual Annotation. - Dan Flickinger, Stephan Oepen, Gisle Ytrestøl:
WikiWoods: Syntacto-Semantic Annotation for English Wikipedia. - Hai Zhao, Yan Song, Chunyu Kit:
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method.
Session O23 - Broadcast News
- Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Germán Bordel, Amparo Varona, Mireia Díez:
KALAKA: A TV Broadcast Speech Database for the Evaluation of Language Recognition Systems. - Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet, Jérôme Farinas:
The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News. - Kwanchiva Saykham, Ananlada Chotimongkol, Chai Wutiwiwatchai:
Online Temporal Language Model Adaptation for a Thai Broadcast News Transcription System. - Doris Baum, Daniel Schneider, Rolf Bardeli, Jochen Schwenninger, Barbara Samlowski, Thomas Winkler, Joachim Köhler:
DiSCo - A German Evaluation Corpus for Challenging Problems in the Broadcast Domain.
Session O24 - Machine Translation
- Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell:
Active Learning and Crowd-Sourcing for Machine Translation. - Sara Stymne, Lars Ahrenberg:
Using a Grammar Checker for Evaluation and Postprocessing of Statistical Machine Translation. - Hiroyuki Kaji, Takashi Tsunakawa, Daisuke Okada:
Using Comparable Corpora to Adapt a Translation Model to Domains. - Xuansong Li, Niyu Ge, Stephen Grimes, Stephanie M. Strassel, Kazuaki Maeda:
Enriching Word Alignment with Linguistic Tags. - Sisay Adugna, Andreas Eisele:
English - Oromo Machine Translation: An Experiment Using a Statistical Approach.
Session O25 - Emotion, Sentiment - Special Session
- Stefano Baccianella, Andrea Esuli, Fabrizio Sebastiani:
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. - Mátyás Brendel, Riccardo Zaccarelli, Laurence Devillers:
Building a System for Emotions Detection from Speech to Control an Affective Avatar. - Martijn Goudbeek, Mirjam Broersma:
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception of Emotion. - Alexandra Balahur, Ralf Steinberger, Mijail A. Kabadjov, Vanni Zavarella, Erik Van der Goot, Matina Halkia, Bruno Pouliquen, Jenya Belyaeva:
Sentiment Analysis in the News.
Session O26 - Corpus Tools
- Dekang Lin, Kenneth Ward Church, Heng Ji, Satoshi Sekine, David Yarowsky, Shane Bergsma, Kailash Patil, Emily Pitler, Rachel Lathbury, Vikram Rao, Kapil Dalwani, Sushant Narsale:
New Tools for Web-Scale N-grams. - Verena Henrich, Erhard W. Hinrichs:
GernEdiT - The GermaNet Editing Tool. - Véronika Lux-Pogodalla, Dominique Besagni, Karën Fort:
FastKwic, an "Intelligent" Concordancer Using FASTR. - Giuseppe Attardi, Stefano Dei Rossi, Giulia Di Pietro, Alessandro Lenci, Simonetta Montemagni, Maria Simi:
A Resource and Tool for Super-sense Tagging of Italian Texts. - Richard Schwarz, Hinrich Schütze, Fabienne Martin, Achim Stein:
Identification of Rare & Novel Senses Using Translations in a Parallel Corpus.
Session O27 - Lexicon, Morphology
- Johannes Handl, Carsten Weber:
A Multilayered Declarative Approach to Cope with Morphotactics and Allomorphy in Derivational Morphology. - Helena Blancafort:
Learning Morphology of Romance, Germanic and Slavic Languages with the Tool Linguistica. - Núria Gala, Véronique Rey, Michael Zock:
A Tool for Linking Stems and Conceptual Fragments to Enhance word Access. - Patrice Lopez, Laurent Romary:
GRISP: A Massive Multilingual Terminological Database for Scientific and Technical Domains. - Wauter Bosma, Piek Vossen:
Bootstrapping Language Neutral Term Extraction.
Session O28 - Syntax and Semantics
- Ineke Schuurman, Véronique Hoste, Paola Monachesi:
Interacting Semantic Layers of Annotation in SoNaR, a Reference Corpus of Contemporary Written Dutch. - Anne Vilnat, Patrick Paroubek, Éric Villemonte de la Clergerie, Gil Francopoulo, Marie-Laure Guénot:
PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation. - Sara Rosenthal, William Lipovsky, Kathleen R. McKeown, Kapil Thadani, Jacob Andreas:
Towards Semi-Automated Annotation for Prepositional Phrase Attachment. - Max Jakob, Markéta Lopatková, Valia Kordoni:
Mapping between Dependency Structures and Compositional Semantic Representations.
Session O29 - Metadata
- Raheel Nawaz, Paul Thompson, John McNaught, Sophia Ananiadou:
Meta-Knowledge Annotation of Bio-Events. - Christopher Cieri, Khalid Choukri, Nicoletta Calzolari, D. Terence Langendoen, Johannes Leveling, Martha Palmer, Nancy Ide, James Pustejovsky:
A Road Map for Interoperable Language Resource Metadata. - Josef Ruppenhofer, Caroline Sporleder, Fabian Shirokov:
Speaker Attribution in Cabinet Protocols. - Katrin Tomanek, Udo Hahn:
Annotation Time Stamps - Temporal Metadata from the Linguistic Annotation Process.
Session O30 - Tagging
- Markus Dickinson, Charles Jochim:
Evaluating Distributional Properties of Tagsets. - Kais Dukes, Nizar Habash:
Morphological Annotation of Quranic Arabic. - Emad Mohamed, Sandra Kübler:
Arabic Part of Speech Tagging. - Tomaz Erjavec:
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora.
Session O31 - Multimodal Annotation
- Harry Bunt, Jan Alexandersson, Jean Carletta, Jae-Woong Choe, Alex Chengyu Fang, Kôiti Hasida, Kiyong Lee, Volha Petukhova, Andrei Popescu-Belis, Laurent Romary, Claudia Soria, David R. Traum:
Towards an ISO Standard for Dialogue Act Annotation. - Volha Petukhova, Harry Bunt:
Towards an Integrated Scheme for Semantic Annotation of Multimodal Dialogue Data. - Pierre Tirilly, Vincent Claveau, Patrick Gros:
News Image Annotation on a Large Parallel Text-image Corpus. - Isabella Poggi, Francesca D'Errico, Laura Vincze:
Types of Nods. The Polysemy of a Social Signal.
Session O32 - Lexicon
- Núria Bel:
Handling of Missing Values in Lexical Acquisition. - Josef Ruppenhofer, Jonas Sunde, Manfred Pinkal:
Generating FrameNets of Various Granularities: The FrameNet Transformer. - Benoît Sagot:
The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French. - Diego De Cao, Danilo Croce, Roberto Basili:
Extensive Evaluation of a FrameNet-WordNet mapping resource.
Session O33 - Question Answering
- Guillaume Bernard, Sophie Rosset, Martine Adda-Decker, Olivier Galibert:
A Question-answer Distance Measure to Investigate QA System Progress. - Peter Adolphs, Xiwen Cheng, Tina Klüwer, Hans Uszkoreit, Feiyu Xu:
Question Answering Biographic Information and Social Network Powered by the Semantic Web. - Nicolas Moreau, Olivier Hamon, Djamel Mostefa, Sophie Rosset, Olivier Galibert, Lori Lamel, Jordi Turmo, Pere Comas, Paolo Rosso, Davide Buscaldi, Khalid Choukri:
Evaluation Protocol and Tools for Question-Answering on Speech Transcripts. - Pamela Forner, Danilo Giampiccolo, Bernardo Magnini, Anselmo Peñas, Álvaro Rodrigo, Richard F. E. Sutcliffe:
Evaluating Multilingual Question Answering Systems at CLEF.
Session O34 - Endangered Languages
- Lene Antonsen, Trond Trosterud, Linda Wiechetek:
Reusing Grammatical Resources for New Languages. - Fei Xia, Carrie Lewis, William D. Lewis:
The Problems of Language Identification within Hugely Multilingual Data Sets. - Eniko Héja:
The Role of Parallel Corpora in Bilingual Lexicography. - Cheikh M. Bamba Dione, Jonas Kuhn, Sina Zarrieß:
Design and Development of Part-of-Speech-Tagging Resources for Wolof (Niger-Congo, spoken in Senegal).
Session O35 - Disordered Speech Corpus
- Oscar Saz, Eduardo Lleida, Carlos Vaquero, William Ricardo Rodríguez:
The Alborada-I3A Corpus of Disordered Speech. - Jakob Schou Pedersen, Lars Bo Larsen:
A Speech Corpus for Dyslexic Reading Training. - Caroline Williams, Andrew Thwaites, Paula Buttery, Jeroen Geertzen, Billi Randall, Meredith A. Shafto, Barry Devereux, Lorraine K. Tyler:
The Cambridge Cookie-Theft Corpus: A Corpus of Directed and Spontaneous Speech of Brain-Damaged Patients and Healthy Individuals. - Cécile Fougeron, Lise Crevier-Buchman, Corinne Fredouille, Alain Ghio, Christine Meunier, Claude Chevrie-Muller, Jean-François Bonastre, Antonia Colazo-Simon, Céline De Looze, Danielle Duez, Cédric Gendrot, Thierry Legou, Nathalie Lévêque, Claire Pillot-Loiseau, Serge Pinto, Gilles Pouchoulin, Danièle Robert, Jacqueline Vaissière, François Viallet, Coralie Vincent:
The DesPho-APaDy Project: Developing an Acoustic-phonetic Characterization of Dysarthric Speech in French.
Session O36 - National and International projects
- Marina B. Ruiter, Toni C. M. Rietveld, Catia Cucchiarini, Emiel Krahmer, Helmer Strik:
Human Language Technology and Communicative Disabilities: Requirements and Possibilities for the Future. - Aditi Sharma Grover, Gerhard B. Van Huyssteen:
The South African Human Language Technologies Audit. - Swaran Lata, Somnath Chandra Vijay Kumar:
Development of Linguistic Resources and Tools for Providing Multilingual Solutions in Indian Languages - A Report on National Initiative. - Peter Spyns, Elisabeth D'Halleweyn:
Flemish-Dutch HLT Policy: Evolving to New Forms of Collaboration. - Bente Maegaard, Mohamed Attia, Khalid Choukri, Olivier Hamon, Steven Krauwer, Mustafa Yaseen:
Cooperation for Arabic Language Resources and Tools - The MEDAR Project.
Session O37 - Machine Translation
- Andreas Eisele, Yu Chen:
MultiUN: A Multilingual Corpus from United Nation Documents. - Chi-kiu Lo, Dekai Wu:
Evaluating Machine Translation Utility via Semantic Role Labels. - William D. Lewis, Chris Wendt, David Bullock:
Achieving Domain Specificity in SMT without Overt Siloing. - Billy Tak-Ming Wong:
Semantic Evaluation of Machine Translation. - David Guthrie, Mark Hepple, Wei Liu:
Efficient Minimal Perfect Hash Language Models.
Session O38 - Corpus Tools
- Ting Qian, Kristy Hollingshead, Su-Youn Yoon, Kyoung-Young Kim, Richard Sproat:
A Python Toolkit for Universal Transliteration. - Sowmya V. B., Monojit Choudhury, Kalika Bali, Tirthankar Dasgupta, Anupam Basu:
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages. - Fabienne Fritzinger, Marion Weller, Ulrich Heid:
A Survey of Idiomatic Preposition-Noun-Verb Triples on Token Level. - Meghan Lammie Glenn, Stephanie M. Strassel, Haejoong Lee, Kazuaki Maeda, Ramez Zakhary, Xuansong Li:
Transcription Methods for Consistency, Volume and Efficiency. - Muhammad Kamran Malik, Tafseer Ahmed, Sebastian Sulger, Tina Bögel, Atif Gulzar, Ghulam Raza, Sarmad Hussain, Miriam Butt:
Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar.
Session O39 - Information Extraction
- Ralph Grishman:
The Impact of Task and Corpus on Event Extraction Systems. - Darja Fiser, Senja Pollak, Spela Vintar:
Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources. - Silvana Marianela Bernaola Biggio, Manuela Speranza, Roberto Zanoli:
Entity Mention Detection using a Combination of Redundancy-Driven Classifiers. - Klaar Vanopstal, Robert Vander Stichele, Godelieve Laureys, Joost Buysschaert:
Assessing the Impact of English Language Skills and Education Level on PubMed Searches by Dutch-speaking Users. - André Blessing, Hinrich Schütze:
Fine-Grained Geographical Relation Extraction from Wikipedia.
Session O40 - Ontologies
- Ekaterina Ovchinnikova, Laure Vieu, Alessandro Oltramari, Stefano Borgo, Theodore Alexandrov:
Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning. - Hans-Ulrich Krieger:
A General Methodology for Equipping Ontologies with Time. - Dan Tufis, Dan Stefanescu:
A Differential Semantics Approach to the Annotation of Synsets in WordNet. - Bolette Sandford Pedersen, Sanni Nimb, Anna Braasch:
Merging Specialist Taxonomies and Folk Taxonomies in Wordnets - A case Study of Plants, Animals and Foods in the Danish Wordnet. - Mithun Balakrishna, Dan I. Moldovan, Marta Tatu, Marian Olteanu:
Semi-Automatic Domain Ontology Creation from Text Resources.
Session O41 - Multiword Expressions and Collocations
- Marion Weller, Ulrich Heid:
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features. - Stefania Spina:
The Dictionary of Italian Collocations: Design and Integration in an Online Learning Environment. - Margarita Alonso Ramos, Leo Wanner, Orsolya Vincze, Gerard Casamayor del Bosque, Nancy Vázquez Veiga, Estela Mosqueira Suárez, Sabela Prieto González:
Towards a Motivated Annotation Schema of Collocation Errors in Learner Corpora. - Ulrich Heid, Fabienne Fritzinger, Erhard W. Hinrichs, Marie Hinrichs, Thomas Zastrow:
Term and Collocation Extraction by Means of Complex Linguistic Web Services. - Francesca Bonin, Felice Dell'Orletta, Simonetta Montemagni, Giulia Venturi:
A Contrastive Approach to Multi-word Extraction from Domain-specific Corpora.
Session O42 - Word Sense Disambiguation
- Amal Zouaq, Michel Gagnon, Benoît Ozell:
Can Syntactic and Logical Graphs help Word Sense Disambiguation? - Susan Windisch Brown, Travis Rood, Martha Palmer:
Number or Nuance: Which Factors Restrict Reliable Word Sense Annotation? - Rebecca J. Passonneau, Ansaf Salleb-Aouissi, Vikas Bhardwaj, Nancy Ide:
Word Sense Annotation of Polysemous Words by Multiple Annotators. - Sanaz Jabbari, Mark Hepple, Louise Guthrie:
Evaluating Lexical Substitution: Analysis and New Measures. - Ekaterina Shutova, Simone Teufel:
Metaphor Corpus Annotated for Source - Target Domain Mappings.
Session O43 - Speech Corpus Processing
- Philippe Blache, Roxane Bertrand, Mathilde Guardiola, Marie-Laure Guénot, Christine Meunier, Irina Nesterenko, Berthille Pallaud, Laurent Prévot, Béatrice Priego-Valverde, Stéphane Rauzy:
The OTIM Formal Annotation Model: A Preliminary Step before Annotation Scheme. - Grégory Senay, Georges Linarès, Benjamin Lecouteux, Stanislas Oger, Thierry Michel:
Transcriber Driving Strategies for Transcription Aid System. - Rena Nemoto, Martine Adda-Decker, Jacques Durand:
Word Boundaries in French: Evidence from Large Speech Corpora. - Christina Leitner, Martin Schickbichler, Stefan Petrik:
Example-Based Automatic Phonetic Transcription. - Brigitte Bigi, Christine Meunier, Irina Nesterenko, Roxane Bertrand:
Automatic Detection of Syllable Boundaries in Spontaneous Speech.
Session O44 - Web Services
- Arif Bramantoro, Ulrich Schäfer, Toru Ishida:
Towards an Integrated Architecture for Composite Language Servicesand Multiple Linguistic Processing Components. - Marta Villegas, Núria Bel, Santiago Bel, Víctor Rodríguez-Doncel:
A Case Study on Interoperability for Language Resources and Applications. - Nancy Ide, Keith Suderman, Brian Simms:
ANC2Go: A Web Application for Customized Corpus Creation. - Yohei Murakami, Donghui Lin, Masahiro Tanaka, Takao Nakaguchi, Toru Ishida:
Language Service Management with the Language Grid. - Jennifer DeCamp:
Language Technology Resource Center.
Session O45 - Textual Entailment and Question Answering
- Louise Deléger, Pierre Zweigenbaum:
Identifying Paraphrases between Technical and Lay Corpora. - Luisa Bentivogli, Elena Cabrio, Ido Dagan, Danilo Giampiccolo, Medea Lo Leggio, Bernardo Magnini:
Building Textual Entailment Specialized Data Sets: a Methodology for Isolating Linguistic Phenomena Relevant to Inference. - Milen Kouylekov, Yashar Mehdad, Matteo Negri:
Mining Wikipedia for Large-scale Repositories of Context-Sensitive Entailment Rules. - Daniel Sonntag, Bogdan Sacaleanu:
Speech Grammars for Textual Entailment Patterns in Multimodal Question Answering. - Anne Garcia-Fernandez, Sophie Rosset, Anne Vilnat:
MACAQ : A Multi Annotated Corpus to Study how we Adapt Answers to Various Questions.
Session O46 - Discourse Annotation
- Silvia Pareti, Irina Prodanof:
Annotating Attribution Relations: Towards an Italian Discourse Treebank. - Charles Teissèdre, Delphine Battistelli, Jean-Luc Minel:
Resources for Calendar Expressions Semantic Tagging and Temporal Navigation through Texts. - Stergos D. Afantenos, Pascal Denis, Philippe Muller, Laurence Danlos:
Learning Recursive Segments for Discourse Parsing. - Gerlof Bouma, Lilja Øvrelid, Jonas Kuhn:
Towards a Large Parallel Corpus of Cleft Constructions. - Livio Robaldo, Eleni Miltsakaki, Alessia Bianchini:
Corpus-based Semantics of Concession: Where do Expectations Come from?
Session O47 - Named Entity Recognition
- Mark Arehart:
Indexing Methods for Faster and More Effective Person Name Search. - Asif Ekbal, Sriparna Saha:
Maximum Entropy Classifier Ensembling using Genetic Algorithm for NER in Bengali. - Mohammed Attia, Antonio Toral, Lamia Tounsi, Monica Monachini, Josef van Genabith:
An Automatically Built Named Entity Lexicon for Arabic. - Agata Savary, Jakub Waszczuk, Adam Przepiórkowski:
Towards the Annotation of Named Entities in the National Corpus of Polish. - Cláudia Freitas, Cristina Mota, Diana Santos, Hugo Gonçalo Oliveira, Paula Carvalho:
Second HAREM: Advancing the State of the Art of Named Entity Recognition in Portuguese.
Session P1 - Anaphora, Coreference and Evaluation
- Ruud Koolen, Emiel Krahmer:
The D-TUNA Corpus: A Dutch Dataset for the Evaluation of Referring Expression Generation Algorithms. - Azad Abad, Luisa Bentivogli, Ido Dagan, Danilo Giampiccolo, Shachar Mirkin, Emanuele Pianta, Asher Stern:
A Resource for Investigating the Impact of Anaphora and Coreference on Inference. - Cristina Nicolae, Gabriel Nicolae, Kirk Roberts:
C-3: Coherence and Coreference Corpus. - Claudiu Mihaila, Iustina Ilisei, Diana Inkpen:
Romanian Zero Pronoun Distribution: A Comparative Study. - Marta Recasens, Eduard H. Hovy, Maria Antònia Martí:
A Typology of Near-Identity Relations for Coreference (NIDENT). - Kepa Joseba Rodríguez, Francesca Delogu, Yannick Versley, Egon Stemle, Massimo Poesio:
Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus. - Samuel Broscheit, Simone Paolo Ponzetto, Yannick Versley, Massimo Poesio:
Extending BART to Provide a Coreference Resolution System for German. - Jirí Mírovský, Petr Pajas, Anna Nedoluzhko:
Annotation Tool for Extended Textual Coreference and Bridging Anaphora. - Petya Osenova, Laska Laskova, Kiril Ivanov Simov:
Exploring Co-Reference Chains for Concept Annotation of Domain Texts. - Heather Simpson, Stephanie M. Strassel, Robert Parker, Paul McNamee:
Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population.
Session P2 - Tools, Systems and Evaluation
- Athanasios Karasimos, Evanthia Petropoulou:
A Crash Test with Linguistica in Modern Greek: The Case of Derivational Affixes and Bound Stems. - Anil Kumar Singh, Bharat Ram Ambati:
An Integrated Digital Tool for Accessing Language Resources. - Paul Felt, Owen Merkling, Marc Carmen, Eric K. Ringger, Warren Lemmon, Kevin D. Seppi, Robbie Haertel:
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development. - Rüdiger Gleim, Alexander Mehler:
Computational Linguistics for Mere Mortals - Powerful but Easy-to-use Linguistic Processing for Scientists in the Humanities. - Bernd Bohnet, Leo Wanner:
Open Soucre Graph Transducer Interpreter and Grammar Development Environment. - Federico Sangati, Willem H. Zuidema, Rens Bod:
Efficiently Extract Recurring Tree Fragments from Large Treebanks. - José João Almeida, André Santos, Alberto Simões:
Bigorna -- A Toolkit for Orthography Migration Challenges. - Carl Christensen, Ross Hendrickson, Deryle Lonsdale:
Principled Construction of Elicited Imitation Tests. - Jan Jona Javorsek, Tomaz Erjavec:
Experimental Deployment of a Grid Virtual Organization for Human Language Technologies. - Peter Nabende:
Applying a Dynamic Bayesian Network Framework to Transliteration Identification.
Session P3 - Lexical Resources
- Adrien Lardilleux, Julien Gosme, Yves Lepage:
Bilingual Lexicon Induction: Effortless Evaluation of Word Alignment Tools and Production of Resources for Improbable Language Pairs. - Akira Utsumi:
Exploring the Relationship between Semantic Spaces and Semantic Relations. - C. Anton Rytting, Paul Rodrigues, Tim Buckwalter, David M. Zajic, Bridget Hirsch, Jeff Carnes, Nathanael Lynn, Sarah C. Wayland, Chris Taylor, Jason White, Charles C. Blake, Evelyn Browne, Corey Miller, Tristan Purvis:
Error Correction for Arabic Dictionary Lookup. - Noureddine Loukil, Kais Haddar, Abdelmajid Ben Hamadou:
A Syntactic Lexicon for Arabic Verbs. - Amit Kirschenbaum, Shuly Wintner:
A General Method for Creating a Bilingual Transliteration Dictionary. - Thomas Proisl, Besim Kabashi:
Using High-Quality Resources in NLP: The Valency Dictionary of English as a Resource for Left-Associative Grammars. - Grigori Sidorov, Alberto Barrón-Cedeño, Paolo Rosso:
English-Spanish Large Statistical Dictionary of Inflectional Forms. - Majdi Sawalha, Eric Atwell:
Constructing and Using Broad-coverage Lexical Resource for Enhancing Morphological Analysis of Arabic. - Rania Al-Sabbagh, Roxana Girju:
Mining the Web for the Induction of a Dialectical Arabic Lexicon. - Benoît Sagot, Laurence Danlos, Rosa Stern:
A Lexicon of French Quotation Verbs for Automatic Quotation Extraction. - Benoît Sagot, Géraldine Walther:
A Morphological Lexicon for the Persian Language. - Jana Sindlerová, Ondrej Bojar:
Building a Bilingual ValLex Using Treebank Token Alignment: First Observations. - Óscar Ferrández, Michael Ellsworth, Rafael Muñoz, Collin F. Baker:
Aligning FrameNet and WordNet based on Semantic Neighborhoods. - Anca Dinu:
Building a Generative Lexicon for Romanian. - Hiroaki Sato:
How FrameSQL Shows the Japanese FrameNet Data. - Svetla Koeva:
Lexicon and Grammar in Bulgarian FrameNet. - Bento Carlos Dias-da-Silva, Ariani Di Felippo:
REBECA: Turning WordNet Databases into "Ontolexicons". - Karel Pala, Christiane Fellbaum, Sonja Bosch:
Lexical Resources for Noun Compounds in Czech, English and Zulu. - Michael Gasser:
Expanding the Lexicon for a Resource-Poor Language Using a Morphological Analyzer and a Web Crawler. - Gerard de Melo, Gerhard Weikum:
Providing Multilingual, Multimodal Answers to Lexical Database Queries. - Sabine Ploux, Armelle Boussidan, Hyungsuk Ji:
The Semantic Atlas: an Interactive Model of Lexical Representation.
Session P4 - Web Services
- Adam Funk, Kalina Bontcheva:
Ontology-Based Categorization of Web Services with Machine Learning. - Marie Hinrichs, Thomas Zastrow, Erhard W. Hinrichs:
WebLicht: Web-based LRT Services in a Distributed eScience Infrastructure. - Ulrich Heid, Helmut Schmid, Kerstin Eckart, Erhard W. Hinrichs:
A Corpus Representation Format for Linguistic Web Services: The D-SPIN Text Corpus Format and its Relationship with ISO Standards. - Donghui Lin, Yoshiaki Murakami, Toru Ishida, Yohei Murakami, Masahiro Tanaka:
Composing Human and Machine Translation Services: Language Grid for Improving Localization Processes. - Savas Ali Bora, Yoshihiko Hayashi, Monica Monachini, Claudia Soria, Nicoletta Calzolari:
An LMF-based Web Service for Accessing WordNet-type Semantic Lexicons. - Virach Sornlertlamvanich, Thatsanee Charoenporn, Hitoshi Isahara:
Language Resource Management System for Asian WordNet Collaboration and Its Web Service Application.
Session P5 - Named Entity Recognition
- Rita Marinelli:
Lexical Resources and Ontological Classifications for the Recognition of Proper Names Sense Extension. - Damien Nouvel, Jean-Yves Antoine, Nathalie Friburger, Denis Maurel:
An Analysis of the Performances of the CasEN Named Entities Recognition System in the Ester2 Evaluation Campaign. - Olivier Galibert, Sophie Rosset, Xavier Tannier, Fanny Grandry:
Hybrid Citation Extraction from Patents. - Bart Desmet, Véronique Hoste:
Towards a Balanced Named Entity Corpus for Dutch. - Satoshi Sato, Sayoko Kaide:
A Person-Name Filter for Automatic Compilation of Bilingual Person-Name Lexicons. - Michael A. Tanenblatt, Anni Coden, Igor L. Sominsky:
The ConceptMapper Approach to Named Entity Recognition. - Grzegorz Chrupala, Dietrich Klakow:
A Named Entity Labeler for German: Exploiting Wikipedia and Distributional Clusters. - Keith J. Miller, Sarah McLeod, Elizabeth Schroeder, Mark Arehart, Kenneth Samuel, James Finley, Vanesa Jurica, John Polk:
Improving Personal Name Search in the TIGR System. - Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim, Ralf Steinberger:
Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic. - Dietrich Rebholz-Schuhmann, Antonio José Jimeno-Yepes, Erik M. van Mulligen, Ning Kang, Jan A. Kors, David Milward, Peter T. Corbett, Ekaterina Buyko, Katrin Tomanek, Elena Beisswanger, Udo Hahn:
The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers. - Ana Cristina Mendes, Luísa Coheur, Paula Vaz Lobo:
Named Entity Recognition in Questions: Towards a Golden Collection.
Session P6 - Pronunciation Variants
- Alexander Schmitt, Tim Polzehl, Wolfgang Minker, Jackson Liscombe:
The Influence of the Utterance Length on the Recognition of Aged Voices. - Nikos Tsourakis, Agnes Lisowska, Manny Rayner, Pierrette Bouillon:
Examining the Effects of Rephrasing User Input on Two Mobile Spoken Language Systems. - Damjan Vlaj, Aleksandra Zögling Markus, Marko Kos, Zdravko Kacic:
Acquisition and Annotation of Slovenian Lombard Speech Database. - Natalie D. Snoeren, Martine Adda-Decker, Gilles Adda:
The Study of Writing Variants in an Under-resourced Language: Some Evidence from Mobile N-Deletion in Luxembourgish. - Jean-Luc Rouas, Mayumi Beppu, Martine Adda-Decker:
Comparison of Spectral Properties of Read, Prepared and Casual Speech in French. - Marijn Schraagen, Gerrit Bloothooft:
Evaluating Repetitions, or how to Improve your Multilingual ASR System by doing Nothing. - Elena Grishina, Svetlana Savchuk, Alexej Poljakov:
Design and Data Collection for the Accentological Corpus of the Russian Language. - Siim Orasmaa, Reina Käärik, Jaak Vilo, Tiit Hennoste:
Information Retrieval of Word Form Variants in Spoken Language Corpora Using Generalized Edit Distance.
Session P7 - Multiword Expressions and Collocations
- Meng Wang, Chu-Ren Huang, Shiwen Yu, Weiwei Sun:
Automatic Acquisition of Chinese Novel Noun Compounds. - Luka Nerima, Eric Wehrli, Violeta Seretan:
A Recursive Treatment of Collocations. - Caroline Sporleder, Linlin Li, Philip Gorinski, Xaver Koch:
Idioms in Context: The IDIX Corpus. - Laura Street, Nathan Michalov, Rachel Silverstein, Michael Reynolds, Lurdes Ruela, Felicia Flowers, Angela Talucci, Priscilla Pereira, Gabriella Morgon, Samantha Siegel, Marci Barousse, Antequa Anderson, Tashom Carroll, Anna Feldman:
Like Finding a Needle in a Haystack: Annotating the American National Corpus for Idiomatic Expressions. - Andrea Zaninello, Malvina Nissim:
Creation of Lexical Resources for a Characterisation of Multiword Expressions in Italian. - Carlos Ramisch, Aline Villavicencio, Christian Boitet:
mwetoolkit: a Framework for Multiword Expression Identification. - Junko Kubo, Keita Tsuji, Shigeo Sugimoto:
Automatic Term Recognition Based on the Statistical Differences of Relative Frequencies in Different Corpora.
Session P8 - Validation of Language Resources
- Claire Gardent, Alejandra Lorenzo:
Identifying Sources of Weakness in Syntactic Lexicon Extraction. - Bharat Ram Ambati, Mridul Gupta, Samar Husain, Dipti Misra Sharma:
A High Recall Error Identification Tool for Hindi Treebank Validation.
Session P9 - Grammar and Syntax
- Anne Abeillé, Danièle Godard:
The Grande Grammaire du Français Project. - Marina Lloberes, Irene Castellón, Lluís Padró:
Spanish FreeLing Dependency Grammar. - Montserrat Marimon:
The Spanish Resource Grammar.
Session P10 - Morphology
- Gertrud Faaß, Ulrich Heid, Helmut Schmid:
Design and Application of a Gold Standard for Morphological Analysis: SMOR as an Example of Morphological Evaluation. - Niraj Aswani, Robert J. Gaizauskas:
Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages. - Cvetana Krstev, Ranka Stankovic, Dusko Vitas:
A Description of Morphological Features of Serbian: a Revision using Feature System Declaration. - Çagri Çöltekin:
A Freely Available Morphological Analyzer for Turkish. - Iñaki Alegria, Garbiñe Aranbarri, Klara Ceberio, Gorka Labaka, Bittor Laskurain, Ruben Urizar:
A Morphological Processor Based on Foma for Biscayan (a Basque dialect). - Yugo Murawaki, Sadao Kurohashi:
Online Japanese Unknown Morpheme Detection using Orthographic Variation. - Bruno Cartoni, Marie-Aude Lefer:
The MuLeXFoR Database: Representing Word-Formation Processes in a Multilingual Lexicographic Environment. - Ting-Hao (Kenneth) Huang, Lun-Wei Ku, Hsin-Hsi Chen:
Predicting Morphological Types of Chinese Bi-Character Words by Machine Learning Approaches. - Mohamed Altantawy, Nizar Habash, Owen Rambow, Ibrahim Saleh:
Morphological Analysis and Generation of Arabic Nouns: A Morphemic Functional Approach. - Mehrnoush Shamsfard, Hoda Sadat Jafari, Mahdi Ilbeygi:
STeP-1: A Set of Fundamental Tools for Persian Text Processing. - Sara Tonelli, Emanuele Pianta, Rodolfo Delmonte, Michele Brunelli:
VenPro: A Morphological Analyzer for Venetan.
Session P11 - Tools for Multimodal Corpus
- Nick Campbell, Akiko Tabata:
A Software Toolkit for Viewing Annotated Multimodal Data Interactively over the Web. - Nick Webb, David Benyon, Jay Bradley, Preben Hansen, Oli H. Mival:
Wizard of Oz Experiments for a Companion Dialogue System: Eliciting Companionable Conversation. - Volker Fritzsch, Stefan Scherer, Friedhelm Schwenker:
An Open Source Process Engine Framework for Realtime Pattern Recognition and Information Fusion Tasks. - Jens Allwood, Harald Hammarström, Andries Hendrikse, Mtholeni N. Ngcobo, Nozibele Nomdebevana, Laurette Pretorius, Mac van der Merwe:
Work on Spoken (Multimodal) Language Corpora in South Africa. - Eric Auer, Albert Russel, Han Sloetjes, Peter Wittenburg, Oliver Schreer, S. Masnieri, Daniel Schneider, Sebastian Tschöpel:
ELAN as Flexible Annotation Framework for Sound and Image Processing Detectors.
Session P12 - Language Resource Infrastructures
- Claus Zinn, Peter Wittenburg, Jacquelijn Ringersma:
An Evolving eScience Environment for Research Data in Linguistics. - Dieter Van Uytvanck, Claus Zinn, Daan Broeder, Peter Wittenburg, Mariano Gardellini:
Virtual Language Observatory: The Portal to the Language Resources and Technology Universe. - Adam Kilgarriff, Siva Reddy, Jan Pomikálek, P. V. S. Avinesh:
A Corpus Factory for Many Languages. - Erhard W. Hinrichs, Verena Henrich, Thomas Zastrow:
Sustainability of Linguistic Data and Analysis in the Context of a Collaborative eScience Environment. - Armando Stellato, Heiko Stoermer, Stefano Bortoli, Noemi Scarpato, Andrea Turbati, Paolo Bouquet, Maria Teresa Pazienza:
Maskkot - An Entity-centric Annotation Platform. - Maite Melero, Gemma Boleda, Montse Cuadros, Cristina España-Bonet, Lluís Padró, Martí Quixal, Carlos Rodríguez Penagos, Roser Saurí:
Language Technology Challenges of a 'Small' Language (Catalan). - Lluís Padró, Miquel Collado, Samuel Reese, Marina Lloberes, Irene Castellón:
FreeLing 2.1: Five Years of Open-source Language Processing Tools. - Bartosz Broda, Michal Marcinczuk, Maciej Piasecki:
Building a Node of the Accessible Language Technology Infrastructure. - Peter Menke, Alexander Mehler:
The Ariadne System: A Flexible and Extensible Framework for the Modeling and Storage of Experimental Data in the Humanities. - Nicoletta Calzolari, Claudia Soria, Riccardo Del Gratta, Sara Goggi, Valeria Quochi, Irene Russo, Khalid Choukri, Joseph Mariani, Stelios Piperidis:
The LREC Map of Language Resources and Technologies. - Nick Rizzolo, Dan Roth:
Learning Based Java for Rapid Development of NLP Systems. - Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee, Andrea Mazzucchi:
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation. - Thepchai Supnithi, Taneth Ruangrajitpakorn, Kanokorn Trakultaweekool, Peerachet Porkaew:
AutoTagTCG : A Framework for Automatic Thai CG Tagging. - Javier Couto, Helena Blancafort, Somara Seng, Nicolas Kuchmann-Beauger, Anass Talby, Claude de Loupy:
OAL: A NLP Architecture to Improve the Development of Linguistic Resources for NLP. - Girish Nath Jha:
The TDIL Program and the Indian Langauge Corpora Intitiative (ILCI). - Stephanie M. Strassel, Dan Adams, Henry Goldberg, Jonathan Herr, Ron Keesing, Daniel Oblinger, Heather Simpson, Robert Schrag, Jonathan Wright:
The DARPA Machine Reading Program - Encouraging Linguistic and Reasoning Research with a Series of Reading Tasks. - Adam Przepiórkowski, Rafal L. Górski, Marek Lazinski, Piotr Pezik:
Recent Developments in the National Corpus of Polish. - Drahomíra "johanka" Spoustová, Miroslav Spousta, Pavel Pecina:
Building a Web Corpus of Czech. - Brigitte Jörg, Hans Uszkoreit, Alastair Burt:
LT World: Ontology and Reference Information Portal.
Session P13 - Subjectivity: Sentiments, Emotions, Opinions
- Vassiliki Rentoumi, Stefanos Petrakis, Manfred Klenner, George A. Vouros, Vangelis Karkaletsis:
United we Stand: Improving Sentiment Analysis by Joining Machine Learning and Rule Based Methods. - Plaban Kumar Bhowmick, Anupam Basu, Pabitra Mitra:
Determining Reliability of Subjective and Multi-label Emotion Annotation through Novel Fuzzy Agreement Measure. - Aleksander Wawer:
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning. - Patrick Paroubek, Alexander Pak, Djamel Mostefa:
Annotations for Opinion Mining Evaluation in the Industrial Context of the DOXA project. - Huan-An Kao, Hsin-Hsi Chen:
Comment Extraction from Blog Posts and Its Applications to Opinion Mining. - Sophia Yat Mei Lee, Ying Chen, Shoushan Li, Chu-Ren Huang:
Emotion Cause Events: Corpus Construction and Analysis. - Horacio Saggion, Adam Funk:
Interpreting SentiWordNet for Opinion Classification. - Polina Panicheva, John Cardiff, Paolo Rosso:
Personal Sense and Idiolect: Combining Authorship Attribution and Opinion Analysis. - Antonio Reyes, Martin Potthast, Paolo Rosso, Benno Stein:
Evaluating Humour Features on Web Comments. - Shu Zhang, Wen-Jie Jia, Yingju Xia, Yao Meng, Hao Yu:
Extracting Product Features and Sentiments from Chinese Customer Reviews. - Changqin Quan, Fuji Ren:
Automatic Annotation of Word Emotion in Sentences Based on Ren-CECps. - Bal Krishna Bal, Patrick Saint-Dizier:
Towards Building Annotated Resources for Analyzing Opinions and Argumentation in News Editorials. - Irene Russo:
Discovering Polarity for Ambiguous and Objective Adjectives through Adverbial Modification. - Zeljko Agic, Nikola Ljubesic, Marko Tadic:
Towards Sentiment Analysis of Financial Texts in Croatian. - Robert Remus, Uwe Quasthoff, Gerhard Heyer:
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis. - Stefan Scherer, Ingo Siegert, Lutz Bigalke, Sascha Meudt:
Developing an Expressive Speech Labeling Tool Incorporating the Temporal Characteristics of Emotion.
Session P14 - Word Sense Disambiguation and Evaluation
- Kyota Tsutsumida, Jun Okamoto, Shun Ishizaki, Makoto Nakatsuji, Akimichi Tanaka, Tadasu Uchiyama:
Study of Word Sense Disambiguation System that uses Contextual Features - Approach of Combining Associative Concept Dictionary and Corpus -. - Jun Okamoto, Shun Ishizaki:
Homographic Ideogram Understanding Using Contextual Dynamic Network. - Christian Scheible:
An Evaluation of Predicate Argument Clustering using Pseudo-Disambiguation. - Lubomír Otrusina, Pavel Smrz:
A New Approach to Pseudoword Generation. - Myriam Rakho, Matthieu Constant:
Evaluating the Impact of Some Linguistic Information on the Performances of a Similarity-based and Translation-oriented Word-Sense Disambiguation Method. - Ines Rehbein, Josef Ruppenhofer:
There's no Data like More Data? Revisiting the Impact of Data Size on a Classification Task. - Egoitz Laparra, German Rigau:
eXtended WordFrameNet. - Attila Görög, Piek Vossen:
Computer Assisted Semantic Annotation in the DutchSemCor Project.
Session P15 - Metadata and Digital Libraries
- Shunsuke Kozawa, Hitomi Tohyama, Kiyotaka Uchimoto, Shigeki Matsubara:
Collection of Usage Information for Language Resources from Academic Articles. - Cristina Vertan:
Towards the Integration of Language Tools Within Historical Digital Libraries. - Alistair Willis, David King, David R. Morse, Anton Dil, Chris Lyal, Dave Roberts:
From XML to XML: The Why and How of Making the Biodiversity Literature Accessible to Researchers. - Manuela Sassi, Gabriella Pardelli, Stefania Biagioni, Carlo Carlesi, Sara Goggi:
A Digital Archive of Research Papers in Computer Science.
Session P16 - Part-of-Speech Tagging
- Yan Zhao, Gertjan van Noord:
POS Multi-tagging Based on Combined Models. - Mahdi Mohseni, Behrouz Minaei-Bidgoli:
A Persian Part-Of-Speech Tagger Based on Morphological Analysis. - Majdi Sawalha, Eric Atwell:
Fine-Grain Morphological Analyzer and Part-of-Speech Tagger for Arabic Text. - Claire Brierley, Eric Atwell:
ProPOSEC: A Prosody and PoS Annotated Spoken English Corpus. - Boris Haselbach, Ulrich Heid:
The Development of a Morphosyntactic Tagset for Afrikaans and its Use with Statistical Tagging. - Jirka Hana, Anna Feldman:
A Positional Tagset for Russian.
Session P17 - Semantic Annotation
- Antonio Balvet, Lucie Barque, Rafael Marín:
Building a Lexicon of French Deverbal Nouns from a Semantically Annotated Corpus. - Izaskun Aldezabal, María Jesús Aranzabe, Arantza Díaz de Ilarraza Sánchez, Ainara Estarrona:
Building the Basque PropBank. - Samuel Reese, Gemma Boleda, Montse Cuadros, Lluís Padró, German Rigau:
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus. - Aina Peris, Mariona Taulé, Gemma Boleda, Horacio Rodríguez:
ADN-Classifier: Automatically Assigning Denotation Types to Nominalizations. - Roser Morante:
Descriptive Analysis of Negation Cues in Biomedical Texts. - Diana Santos, Cristina Mota:
Experiments in Human-computer Cooperation for the Semantic Annotation of Portuguese Corpora. - Magali Sanches Duran, Marcelo Adriano Amâncio, Sandra M. Aluísio:
Assigning Wh-Questions to Verbal Arguments: Annotation Tools Evaluation and Corpus Building. - Stuart Moore, Sabine Buchholz, Anna Korhonen:
Annotating the Enron Email Corpus with Number Senses. - Suguru Matsuyoshi, Megumi Eguchi, Chitose Sao, Koji Murakami, Kentaro Inui, Yuji Matsumoto:
Annotating Event Mentions in Text with Modality, Focus, and Source Information. - Elisabetta Jezek, Valeria Quochi:
Capturing Coercions in Texts: a First Annotation Exercise. - Paula Vaz Lobo, David Martins de Matos:
Fairy Tale Corpus Organization Using Latent Semantic Mapping and an Item-to-item Top-n Recommendation Algorithm.
Session P18 - Corpus and Morphological Annotation
- Antonio Pareja-Lora, Guadalupe Aguado de Cea:
Ontology-based Interoperation of Linguistic Tools for an Improved Lemma Annotation in Spanish. - Kikuo Maekawa, Makoto Yamazaki, Takehiko Maruyama, Masaya Yamaguchi, Hideki Ogura, Wakako Kashino, Toshinobu Ogiso, Hanae Koiso, Yasuharu Den:
Design, Compilation, and Preliminary Analyses of Balanced Corpus of Contemporary Written Japanese. - Bracha Nir, Brian MacWhinney, Shuly Wintner:
A Morphologically-Analyzed CHILDES Corpus of Hebrew. - Jarmila Panevová, Magda Sevcíková:
Annotation of Morphological Meanings of Verbs Revisited. - Seth Kulick, Ann Bies, Mohamed Maamouri:
Consistent and Flexible Integration of Morphological Annotation in the Arabic Treebank.
Session P19 - Applications of Speech Technology
- Justus C. Roux, Pieter Scholtz, Daleen Klop, Claus Povlsen, Bart Jongejan, Asta Magnusdottir:
Incorporating Speech Synthesis in the Development of a Mobile Platform for e-learning. - Alejandro Abejón, Doroteo T. Toledano, Danilo Spada, González Victor, Daniel Hernández López:
A Study of the Influence of Speech Type on Automatic Language Recognition Performance. - Joseph Polifroni, Imre Kiss, Mark Adler:
Bootstrapping Named Entity Extraction for the Creation of Mobile Services. - Jesús Tomás, Alejandro Canovas, Jaime Lloret, Miguel García-Pineda, Jose L. Abad:
Speech Translation in Pedagogical Environment Using Additional Sources of Knowledge. - Koichiro Honda, Tomoyosi Akiba:
Language Modeling Approach for Retrieving Passages in Lecture Audio Data. - Manny Rayner, Pierrette Bouillon, Nikos Tsourakis, Johanna Gerlach, Maria Georgescul, Yukie Nakao, Claudia Baur:
A Multilingual CALL Game Based on Speech Translation. - Iker Luengo, Eva Navas, Igor Odriozola, Ibon Saratxaga, Inmaculada Hernáez, Iñaki Sainz, Daniel Erro:
Modified LTSE-VAD Algorithm for Applications Requiring Reduced Silence Frame Misclassification. - Michal Gishri, Vered Silber-Varod, Ami Moyal:
Lexicon Design for Transcription of Spontaneous Voice Messages. - Kevin Walker, Christopher Caruso, Denise DiPersio:
Large Scale Multilingual Broadcast Data Collection to Support Machine Translation and Distillation Technology Development.
Session P20 - Speech Data Collection
- Line Adde, Torbjørn Svendsen:
NameDat: A Database of English Proper Names Spoken by Native Norwegians. - Felix Burkhardt, Martin Eckert, Wiebke Johannsen, Joachim Stegmann:
A Database of Age and Gender Annotated Telephone Speech. - Patrick Bauer, David Scheler, Tim Fingscheidt:
WTIMIT: The TIMIT Speech Corpus Transmitted Over The 3G AMR Wideband Mobile Network. - Petr Pollák, Josef Rajnoha:
Multi-Channel Database of Spontaneous Czech with Synchronization of Channels Recorded by Independent Devices. - Ian McGraw, Chia-ying Lee, I. Lee Hetherington, Stephanie Seneff, James R. Glass:
Collecting Voices from the Cloud.
Session P21 - Dialogue Evaluation
- Els Lefever, Véronique Hoste:
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation. - Marianne Laurent, Philippe Bretier, Carole Manquillet:
Ad-hoc Evaluations Along the Lifecycle of Industrial Spoken Dialogue Systems: Heading to Harmonisation? - Xuchen Yao, Pravin Bhutada, Kallirroi Georgila, Kenji Sagae, Ron Artstein, David R. Traum:
Practical Evaluation of Speech Recognizers for Virtual Human Dialogue Systems. - Barbara Plank:
Improved Statistical Measures to Assess Natural Language Parser Performance across Domains. - Carlos D. Martínez-Hinarejos, Vicent Tamarit, José-Miguel Benedí:
Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns.
Session P22 - Machine Translation and Evaluation
- Hercules Dalianis, Hao-chun Xing, Xin Zhang:
Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction. - Marta R. Costa-jussà, Mireia Farrús, José B. Mariño, José A. R. Fonollosa:
Automatic and Human Evaluation Study of a Rule-based and a Statistical Catalan-Spanish Machine Translation Systems. - Marta R. Costa-jussà, José A. R. Fonollosa:
Using Linear Interpolation and Weighted Reordering Hypotheses in the Moses System. - Maxim Khalilov, José A. R. Fonollosa, Inguna Skadina, Edgars Bralitis, Lauma Pretkalnina:
Towards Improving English-Latvian Translation: A System Comparison and a New Rescoring Feature. - Yanli Sun:
Mining the Correlation between Human and Automatic Evaluation at Sentence Level. - Christian Federmann:
Appraise: An Open-Source Toolkit for Manual Phrase-Based Evaluation of Translations. - Olivier Hamon:
Is my Judge a good One? - Mark Fishel, Harri Kirik:
Linguistically Motivated Unsupervised Segmentation for Machine Translation. - Yu Chen, Andreas Eisele:
Integrating a Rule-based with a Hierarchical Translation System. - Aurélien Max, Josep Maria Crego, François Yvon:
Contrastive Lexical Evaluation of Machine Translation. - Yiou Wang, Kiyotaka Uchimoto, Jun'ichi Kazama, Canasai Kruengkrai, Kentaro Torisawa:
Adapting Chinese Word Segmentation for Machine Translation Based on Short Units. - Masaki Murata, Tomohiro Ohno, Shigeki Matsubara, Yasuyoshi Inagaki:
Construction of Chunk-Aligned Bilingual Lecture Corpus for Simultaneous Machine Translation. - Ondrej Bojar, Pavel Stranák, Daniel Zeman:
Data Issues in English-to-Hindi Machine Translation. - Taiji Nagasaka, Ran Shimanouchi, Akiko Sakamoto, Takafumi Suzuki, Yohei Morishita, Takehito Utsuro, Suguru Matsuyoshi:
Utilizing Semantic Equivalence Classes of Japanese Functional Expressions in Translation Rule Acquisition from Parallel Patent Sentences. - Niraj Aswani, Robert J. Gaizauskas:
English-Hindi Transliteration using Multiple Similarity Metrics.
Session P23 - Corpora and Treebanks, Grammar and Syntax
- Cristina Bosco, Simonetta Montemagni, Alessandro Mazzei, Vincenzo Lombardo, Felice Dell'Orletta, Alessandro Lenci, Leonardo Lesmo, Giuseppe Attardi, Maria Simi, Alberto Lavelli, Johan Hall, Jens Nilsson, Joakim Nivre:
Comparing the Influence of Different Treebank Annotations on Dependency Parsing. - Olga Lyashevskaya:
Bank of Russian Constructions and Valencies. - Tomaz Erjavec, Darja Fiser, Simon Krek, Nina Ledinek:
The JOS Linguistically Tagged Corpus of Slovene. - António Branco, Francisco Costa, João Ricardo Silva, Sara Silveira, Sérgio Castro, Mariana Avelãs, Clara Pinto, João Graça:
Developing a Deep Linguistic Databank Supporting a Collection of Treebanks: the CINTIL DeepGramBank. - Katarzyna Glowinska, Adam Przepiórkowski:
The Design of Syntactic Annotation Levels in the National Corpus of Polish. - Kais Dukes, Eric Atwell, Abdul-Baquee M. Sharaf:
Syntactic Annotation Guidelines for the Quranic Arabic Dependency Treebank. - Jan Stepánek, Petr Pajas:
Querying Diverse Treebanks in a Uniform Way. - Marie Mikulová, Jan Stepánek:
Ways of Evaluation of the Annotators in Building the Prague Czech-English Dependency Treebank. - Marie Candito, Benoît Crabbé, Pascal Denis:
Statistical French Dependency Parsing: Treebank Conversion and First Results. - Marc Kupietz, Cyril Belica, Holger Keibel, Andreas Witt:
The German Reference Corpus DeReKo: A Primordial Sample for Linguistic Research. - Veronika Vincze, Dóra Szauter, Attila Almási, György Móra, Zoltán Alexin, János Csirik:
Hungarian Dependency Treebank. - Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya, Fei Xia:
Empty Categories in a Hindi Treebank. - Jinho D. Choi, Claire Bonial, Martha Palmer:
Propbank Instance Annotation Guidelines Using a Dedicated Editor, Jubilee. - Hiroki Hanaoka, Hideki Mima, Jun'ichi Tsujii:
A Japanese Particle Corpus Built by Example-Based Annotation. - Stephen A. Boxwell, Chris Brew:
A Pilot Arabic CCGbank. - Simon Mille, Leo Wanner:
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation. - Adriane Boyd:
EAGLE: an Error-Annotated Corpus of Beginning Learner German. - José M. García-Miguel, Gael Vaamonde, Fita González Domínguez:
ADESSE, a Database with Syntactic and Semantic Annotation of a Corpus of Spanish. - Jan Strunk:
Enriching a Treebank to Investigate Relative Clause Extraposition in German. - John Lee, Dag Trygve Truslew Haug:
Porting an Ancient Greek and Latin Treebank.
Session P24 - Parsing
- Alexis Baird, Christopher R. Walker:
The Creation of a Large-Scale LFG-Based Gold Parsebank. - Mridul Gupta, Vineet Yadav, Samar Husain, Dipti Misra Sharma:
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank. - Djamé Seddah:
Exploring the Spinal-STIG Model for Parsing French. - Kristina Vuckovic, Zeljko Agic, Marko Tadic:
Improving Chunking Accuracy on Croatian Texts by Morphosyntactic Tagging. - Rui Wang, Yi Zhang:
Hybrid Constituent and Dependency Parsing with Tsinghua Chinese Treebank. - Valia Kordoni, Yi Zhang:
Disambiguating Compound Nouns for a Dynamic HPSG Treebank of Wall Street Journal Texts. - João Ricardo Silva, António Branco, Patrícia Nunes Gonçalves:
Top-Performing Robust Constituency Parsing of Portuguese: Freely Available in as Many Ways as you Can Get it. - Marco Passarotti, Felice Dell'Orletta:
Improvements in Parsing the Index Thomisticus Treebank. Revision, Combination and a Feature Model for Medieval Latin. - Violeta Seretan, Eric Wehrli, Luka Nerima, Gabriela Soare:
FipsRomanian: Towards a Romanian Version of the Fips Syntactic Parser. - Kathrin Spreyer, Lilja Øvrelid, Jonas Kuhn:
Training Parsers on Partial Trees: A Cross-language Comparison. - Lamia Tounsi, Josef van Genabith:
Arabic Parsing Using Grammar Transforms. - Yoshihiko Hayashi, Thierry Declerck, Chiharu Narawa:
LAF/GrAF-grounded Representation of Dependency Structures.
Session P25 - Discourse Annotation
- Piroska Lendvai, Thierry Declerck, Sándor Darányi, Pablo Gervás, Raquel Hervás, Scott A. Malec, Federico Peinado:
Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case. - Sárka Zikánová, Lucie Mladová, Jirí Mírovský, Pavlína Jínová:
Typical Cases of Annotators' Disagreement in Discourse Annotations in Prague Dependency Treebank. - Samira Shaikh, Tomek Strzalkowski, George Aaron Broadwell, Jennifer Stromer-Galley, Sarah M. Taylor, Nick Webb:
MPC: A Multi-Party Chat Corpus for Modeling Social Phenomena in Discourse. - Raffaella Bernardi, Manuel Kirschner, Zorana Ratkovic:
Context Fusion: The Role of Discourse Structure and Centering Theory. - Xuchen Yao, Irina Borisova, Mehwish Alam:
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0. - Horacio Saggion, Elena Stein-Sparvieri, David Maldavsky, Sandra Szasz:
NLP Resources for the Analysis of Patient/Therapist Interviews. - Nicole Novielli, Carlo Strapparava:
Studying the Lexicon of Dialogue Acts. - Nils Reiter, Oliver Hellwig, Anand Mishra, Anette Frank, Jens Burkhardt:
Using NLP Methods for the Analysis of Rituals. - Amal Al-Saif, Katja Markert:
The Leeds Arabic Discourse Treebank: Annotating Discourse Connectives for Arabic. - Maria Liakata, Simone Teufel, Advaith Siddharthan, Colin R. Batchelor:
Corpora for the Conceptualisation and Zoning of Scientific Papers. - Oi Yee Kwong:
Constructing an Annotated Story Corpus: Some Observations and Issues. - David K. Elson, Kathleen R. McKeown:
Building a Bank of Semantically Encoded Narratives. - Rashmi Prasad, Aravind K. Joshi, Bonnie L. Webber:
Exploiting Scope for Shallow Discourse Parsing.
Session P26 - Dialogue Annotation
- Sara Tonelli, Giuseppe Riccardi, Rashmi Prasad, Aravind K. Joshi:
Annotation of Discourse Relations for Conversational Spoken Dialogs. - Thomas Schmidt, Wilfried Schütte:
FOLKER: An Annotation Tool for Efficient Transcription of Natural, Multi-party Interaction. - Agnieszka Mykowiecka, Katarzyna Glowinska, Joanna Rabiega-Wisniewska:
Domain-related Annotation of Polish Spoken Dialogue Corpus LUNA.PL. - Yasuharu Den, Hanae Koiso, Takehiko Maruyama, Kikuo Maekawa, Katsuya Takanashi, Mika Enomoto, Nao Yoshida:
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme. - Olivier Blanc, Matthieu Constant, Anne Dister, Patrick Watrin:
Partial Parsing of Spontaneous Spoken French. - Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, David Graff, Michael Ciul:
From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News. - Kiyonori Ohtake, Teruhisa Misu, Chiori Hori, Hideki Kashioka, Satoshi Nakamura:
Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems. - Iris Eshkol, Denis Maurel, Nathalie Friburger:
Eslo: From Transcription to Speakers' Personal Information Annotation. - Roberta Catizone, Alexiei Dingli, Robert J. Gaizauskas:
Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue. - Renata Savy:
Pr.A.Ti.D: A Coding Scheme for Pragmatic Annotation of Dialogues.
Session P27 - Evaluation of Speech Recognition and Speech Synthesis
- Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel:
Improving Proper Name Recognition by Adding Automatically Learned Pronunciation Variants to the Lexicon. - Iñaki Sainz, Eva Navas, Inma Hernáez, Antonio Bonafonte, Francisco Campillo:
TTS Evaluation Campaign with a Common Spanish Database. - Timo Sowa, Fiorenza Arisio, Luca Cristoforetti:
DICIT: Evaluation of a Distant-talking Speech Interface for Television.
Session P28 - Terminological Lexicons, Ontologies, Corpora
- Ranka Stankovic, Ivan Obradovic, Olivera Kitanovic:
GIS Application Improvement with Multilingual Lexical and Terminological Resources. - Rita Marinelli, Adriana Roventini, Giovanni Spadoni, Sebastiana Cucurullo:
Lexical Semantic Resources in a Terminological Network. - Nelleke Oostdijk, Suzan Verberne, Cornelis H. A. Koster:
Constructing a Broad-coverage Lexicon for Text Mining in the Patent Domain. - Rodrigo Agerri, Ana García-Serrano:
Q-WordNet: Extracting Polarity from WordNet Senses. - Aya Nishikawa, Ryo Nishimura, Yasuhiko Watanabe, Yoshihiro Okada:
A Context Sensitive Variant Dictionary for Supporting Variant Selection. - Montse Cuadros, Egoitz Laparra, German Rigau, Piek Vossen, Wauter Bosma:
Integrating a Large Domain Ontology of Species into WordNet. - Andrejs Vasiljevs, Kaspars Balodis:
Corpus Based Analysis for Multilingual Terminology Entry Compounding. - Arianne Reimerink, Pilar León Araúz, Pedro J. Magaña Redondo:
EcoLexicon: An Environmental TKB. - Dimitrios Kokkinakis, Ulla Gerdin:
A Swedish Scientific Medical Corpus for Terminology Management and Linguistic Exploration.
Session P29 - Question Answering and Evaluation
- Silvia Quarteroni, Alessandro Moschitti:
A Comprehensive Resource to Evaluate Complex Open Domain Question Answering. - Alessandra Giordani, Alessandro Moschitti:
Corpora for Automatically Learning to Map Natural Language Questions into SQL Queries. - Fang Xu, Dietrich Klakow:
Paragraph Acquisition and Selection for List Question Using Amazon's Mechanical Turk. - Diana Santos, Luís Miguel Cabral, Corina Forascu, Pamela Forner, Fredric C. Gey, Katrin Lamm, Thomas Mandl, Petya Osenova, Anselmo Peñas, Álvaro Rodrigo, Julia Maria Schulz, Yvonne Skalban, Erik F. Tjong Kim Sang:
GikiCLEF: Crosscultural Issues in Multilingual Information Access. - Sarra El Ayari, Brigitte Grau, Anne-Laure Ligozat:
Fine-grained Linguistic Evaluation of Question Answering Systems. - Arnaud Grappy, Brigitte Grau, Olivier Ferret, Cyril Grouin, Véronique Moriceau, Isabelle Robba, Xavier Tannier, Anne Vilnat, Vincent Barbier:
A Corpus for Studying Full Answer Justification. - Ludovic Quintard, Olivier Galibert, Gilles Adda, Brigitte Grau, Dominique Laurent, Véronique Moriceau, Sophie Rosset, Xavier Tannier, Anne Vilnat:
Question Answering on Web Data: The QA Evaluation in Quæro. - Xavier Tannier, Véronique Moriceau:
FIDJI: Web Question-Answering at Quaero 2009. - Bernard Jacquemin:
A Derivational Rephrasing Experiment for Question Answering.
Session P30 - Natural Language Generation
- Roberto P. A. Araujo, Rafael Lage de Oliveira, Eder Miranda de Novais, Thiago Dias Tadeu, Daniel Bastos Pereira, Ivandré Paraboni:
SINotas: the Evaluation of a NLG Application. - Thiago Dias Tadeu, Eder Miranda de Novais, Ivandré Paraboni:
Extracting Surface Realisation Templates from Corpora. - Sandra Williams, Richard Power:
A Fact-aligned Corpus of Numerical Expressions. - Andrew Gargett, Konstantina Garoufi, Alexander Koller, Kristina Striegnitz:
The GIVE-2 Corpus of Giving Instructions in Virtual Environments.
Session P31 - Dialogue Corpora
- Keyan Zhou, Aijun Li, Zhigang Yin, Chengqing Zong:
CASIA-CASSIL: a Chinese Telephone Conversation Corpus in Real Scenarios with Multi-leveled Annotation. - Yuki Kamiya, Tomohiro Ohno, Shigeki Matsubara, Hideki Kashioka:
Construction of Back-Channel Utterance Corpus for Responsive Spoken Dialogue System Development. - Werner Spiegl, Korbinian Riedhammer, Stefan Steidl, Elmar Nöth:
FAU IISAH Corpus -- A German Speech Database Consisting of Human-Machine and Human-Human Interaction Acquired by Close-Talking and Far-Distance Microphones. - Rodolfo Delmonte, Antonella Bristot, Vincenzo Pallotta:
Deep Linguistic Processing with GETARUNS for Spoken Dialogue Understanding. - Helena Spilková, Daniel Brenner, Anton Öttl, Pavel Vondricka, Wim A. van Dommelen, Mirjam Ernestus:
The Kachna L1/L2 Picture Replication Corpus. - Linda Brandschain, David Graff, Christopher Cieri, Kevin Walker, Chris Caruso, Abby Neely:
Greybeard Longitudinal Speech Study. - Linda Brandschain, David Graff, Chris Cieri, Kevin Walker, Chris Caruso, Abby Neely:
Mixer 6.
Session P32 - Dialogue Management and Systems
- Tobias Heinroth, Dan Denich, Alexander Schmitt, Wolfgang Minker:
Efficient Spoken Dialogue Domain Representation and Interpretation. - Ioana Vasilescu, Sophie Rosset, Martine Adda-Decker:
On the Role of Discourse Markers in Interactive Spoken Question Answering Systems. - Jette Viethen, Simon Zwarts, Robert Dale, Markus Guhe:
Dialogue Reference in a Visual Domain. - Anton Leuski, David R. Traum:
NPCEditor: A Tool for Building Question-Answering Characters.
Session P33 - Information Extraction, Terminology, Corpora
- Claudia Borg, Mike Rosner, Gordon J. Pace:
Automatic Grammar Rule Extraction and Ranking for Definitions. - Alberto Tretti, Barbara Di Eugenio:
Analysis and Presentation of Results for Mobile Local Search. - Atsushi Fujii:
Modeling Wikipedia Articles to Enhance Encyclopedic Search. - Christian Federmann, Thierry Declerck:
Extraction, Merging, and Monitoring of Company Data from Heterogeneous Sources. - Alberto Simões, José João Almeida, Rita Farinha:
Processing and Extracting Data from Dicionário Aberto. - Ziqi Zhang, José Iria, Fabio Ciravegna:
Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction. - Jakob Halskov, Dorte Haltrup Hansen, Anna Braasch, Sussi Olsen:
Quality Indicators of LSP Texts - Selection and Measurements Measuring the Terminological Usefulness of Documents for an LSP Corpus. - Eric Charton, Juan-Manuel Torres-Moreno:
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems. - Cécile Grivaz:
Human Judgements on Causation in French Texts. - Heng Ji, Xiang Li, Angelo Lucia, Jianting Zhang:
Annotating Event Chains for Carbon Sequestration Literature. - Kumutha Swampillai, Mark Stevenson:
Inter-sentential Relations in Information Extraction Corpora. - Christopher R. Walker, Hannah Copperman:
Evaluating Complex Semantic Artifacts. - Marc Kemps-Snijders, Thomas Koller, Han Sloetjes, Huib Verwey:
LAT Bridge: Bridging Tools for Annotation and Exploration of Rich Linguistic Data.
Session P34 - Knowledge Discovery
- Paola Monachesi, Thomas Markus:
Socially Driven Ontology Enrichment for eLearning. - Avaré Stewart, Kerstin Denecke, Wolfgang Nejdl:
Cross-Corpus Textual Entailment for Sublanguage Analysis in Epidemic Intelligence. - Ekaterina Buyko, Elena Beisswanger, Udo Hahn:
The GeneReg Corpus for Gene Expression Regulation Events - An Overview of the Corpus and its In-Domain and Out-of-Domain Interoperability. - Carlos Periñán-Pascual, Francisco Arcas-Túnez:
The Architecture of FunGramKB. - Jaouad Mousser:
A Large Coverage Verb Taxonomy for Arabic. - Satoshi Sekine, Kapil Dalwani:
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information.
Session P35 - Text Corpora and Language Resources
- Henk van den Heuvel, René van Horik, Stef Scagliola, Eric Sanders, Paula Witkamp:
The VeteranTapes: Research Corpus, Fragment Processing Tool, and Enhanced Publications for the e-Humanities. - Martin Reynaert, Nelleke Oostdijk, Orphée De Clercq, Henk van den Heuvel, Franciska de Jong:
Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus. - Youssef Aït Ouguengay, Aïcha Bouhjar:
For Standardised Amazigh Linguistic Resources. - Dafydd Gibbon, Moses Ekpenyong, Eno-Abasi Urua:
Medefaidrin: Resources Documenting the Birth and Death Language Life-cycle. - Nicolás Serrano, Francisco Castro, Alfons Juan:
The RODRIGO Database. - Cristina Sánchez Marco, Gemma Boleda, Josep Maria Fontana, Judith Domingo:
Annotation and Representation of a Diachronic Corpus of Spanish. - Roser Sanromà, Gemma Boleda:
The Database of Catalan Adjectives. - Graham Neubig, Shinsuke Mori:
Word-based Partial Annotation for Efficient Corpus Construction.
Session P36 - Multimodal and Audiovisual Corpora
- Elena Grishina:
Multimodal Russian Corpus (MURCO): First Steps. - Kristiina Jokinen:
Non-verbal Signals for Turn-taking and Feedback. - Patrizia Paggio, Jens Allwood, Elisabeth Ahlsén, Kristiina Jokinen, Costanza Navarretta:
The NOMCO Multimodal Nordic Resource - Goals and Characteristics. - Fernando Fernández Martínez, Juan Manuel Lucas-Cuesta, Roberto Barra-Chicote, Javier Ferreiros, Javier Macías Guarasa:
HIFI-AV: An Audio-visual Corpus for Spoken Language Human-Machine Dialogue Research in Spanish. - Francisco Torreira, Mirjam Ernestus:
The Nijmegen Corpus of Casual Spanish. - Rein Ove Sikveland, Anton Öttl, Ingunn Amdal, Mirjam Ernestus, Torbjørn Svendsen, Jens Edlund:
Spontal-N: A Corpus of Interactional Spoken Norwegian. - Jens Edlund, Jonas Beskow, Kjell Elenius, Kahl Hellmer, Sofia Strömbergsson, David House:
Spontal: A Swedish Spontaneous Dialogue Corpus of Audio, Video and Motion Capture. - Jérôme Urbain, Elisabetta Bevacqua, Thierry Dutoit, Alexis Moinet, Radoslaw Niewiadomski, Catherine Pelachaud, Benjamin Picart, Joëlle Tilmanne, Johannes Wagner:
The AVLaughterCycle Database. - Carlos Gómez Gallo, T. Florian Jaeger, Katrina Furth:
A Database for the Exploration of Spanish Planning. - Stavros Ntalampiras, Todor Ganchev, Ilyas Potamitis, Nikos Fakotakis:
Heterogeneous Sensor Database in Support of Human Behaviour Analysis in Unrestricted Environments: The Audio Part. - Theodoros Kostoulas, Otilia Kocsis, Todor Ganchev, Fernando Fernández-Aranda, Juan J. Santamaría, Susana Jiménez-Murcia, Maher Ben Moussa, Nadia Magnenat-Thalmann, Nikos Fakotakis:
The PlayMancer Database: A Multimodal Affect Database in Support of Research and Development Activities in Serious Game Environment. - Alexander Vorwerk, Xiaohui Wang, Dorothea Kolossa, Steffen Zeiler, Reinhold Orglmeister:
WAPUSK20 - A Database for Robust Audiovisual Speech Recognition. - Peng-Wen Chen, Snehal Kumar Chennuru, Ying Zhang:
A Language Approach to Modeling Human Behaviors. - Kathleen M. Eberhard, Hannele Nicholson, Sandra Kübler, Susan Gundersen, Matthias Scheutz:
The Indiana "Cooperative Remote Search Task" (CReST) Corpus. - Katerina Pastra, Christian Wallraven, Michael Schultze, Argyro Vataki, Kathrin Kaulard:
The POETICON Corpus: Capturing Language Use and Sensorimotor Experience in Everyday Interaction. - Quan Nguyen, Michael Kipp:
Annotation of Human Gesture using 3D Skeleton Controls. - Massimo Poesio, Marco Baroni, Oswald Lanz, Alessandro Lenci, Alexandros Potamianos, Hinrich Schütze, Sabine Schulte im Walde, Luca Surian:
BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.
Session P37 - Sign Language
- François Lefebvre-Albaret, Patrice Dalle:
Video Retrieval in Sign Language Videos : How to Model and Compare Signs? - Antoinette Hawayek, Riccardo Del Gratta, Giuseppe Cappelli:
A Bilingual Dictionary Mexican Sign Language-Spanish/Spanish-Mexican Sign Language.
Session P38 - Document Classification
- Serge Sharoff, Zhili Wu, Katja Markert:
The Web Library of Babel: evaluating genre collections. - Hercules Dalianis, Sumithra Velupillai:
How Certain are Clinical Assessments? Annotating Swedish Clinical Text for (Un)certainties, Speculations and Negations. - Magnus Rosell:
Text Cluster Trimming for Better Descriptions and Improved Quality. - Alberto Díaz, Pablo Gervás, Antonio García, Laura Plaza:
Development and Use of an Evaluation Collection for Personalisation of Digital Newspapers. - Michael Wiegand, Dietrich Klakow:
Predictive Features for Detecting Indefinite Polar Sentences. - Naoki Ishikawa, Ryo Nishimura, Yasuhiko Watanabe, Yoshihiro Okada, Masaki Murata:
Detection of submitters suspected of pretending to be someone else in a community site. - Nikola Ljubesic, Tomislava Lauc, Damir Boras:
Building a Gold Standard for Event Detection in Croatian.
Session P39 - Summarisation
- Jorge Vivaldi, Iria da Cunha, Juan-Manuel Torres-Moreno, Patricia Velázquez-Morales:
Automatic Summarization Using Terminological and Semantic Resources. - Claude de Loupy, Marie Guégan, Christelle Ayache, Somara Seng, Juan-Manuel Torres-Moreno:
A French Human Reference Corpus for Multi-Document Summarization and Sentence Compression. - Ahmet Aker, Robert J. Gaizauskas:
Model Summaries for Location-related Images. - Masahiro Nakano, Hideyuki Shibuki, Rintaro Miyazaki, Madoka Ishioroshi, Koichi Kaneko, Tatsunori Mori:
Construction of Text Summarization Corpus for the Credibility of Information on the Web.
Session P40 - Textual Entailment
- Paul Bédaride, Claire Gardent:
Syntactic Testsuites and Textual Entailment Recognition. - Rui Wang, Caroline Sporleder:
Constructing a Textual Semantic Relation Corpus Using a Discourse Treebank. - Aurélien Max, Guillaume Wisniewski:
Mining Naturally-occurring Corrections and Paraphrases from Wikipedia's Revision History. - Jana Z. Sukkarieh, Eleanor Bolge:
Building a Textual Entailment Suite for the Evaluation of Automatic Content Scoring Technologies.
Session P41 - Semantics and Evaluation
- Kirk Roberts, Srikanth Gullapalli, Cosmin Adrian Bejan, Sanda M. Harabagiu:
A Linguistic Resource for Semantic Parsing of Motion Events. - Zareen Syed, Evelyne Viegas, Savas Parastatidis:
Automatic Discovery of Semantic Relations using MindNet. - Ineke Schuurman, Vincent Vandeghinste:
Cultural Aspects of Spatiotemporal Analysis in Multilingual Applications. - Fabienne Venant:
Meaning Representation: From Continuity to Discreteness. - Dirk Goldhahn, Uwe Quasthoff:
Automatic Annotation of Co-Occurrence Relations. - Simon Scerri, Gerhard Gossen, Brian Davis, Siegfried Handschuh:
Classifying Action Items for Semantic Email. - Jirí Materna, Karel Pala:
Using Ontologies for Semi-automatic Linking VerbaLex with FrameNet. - Olivier Ferret:
Testing Semantic Similarity Measures for Extracting Synonyms from a Corpus.
Session P42 - Text Mining
- Sophia Ananiadou, John McNaught, James Thomas, Mark Rickinson, Sandy Oliver:
Evaluating a Text Mining Based Educational Search Portal. - Hiroyuki Shinnou, Minoru Sasaki:
Detection of Peculiar Examples using LOF and One Class SVM. - Agata Cybulska, Piek Vossen:
Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classification of Time, Location and Participants. - Eva Sassolini, Alessandra Cinini:
Cultural Heritage: Knowledge Extraction from Web Documents.
Session P43 - Multilingual Corpora for Machine Translation
- Lieve Macken:
An Annotation Scheme and Gold Standard for Dutch-English Word Alignment. - Lucia Specia, Nicola Cancedda, Marc Dymetman:
A Dataset for Assessing Machine Translation Evaluation Metrics. - Gábor Recski, András Rung, Attila Zséder, András Kornai:
NP Alignment in Bilingual Corpora. - Orphée De Clercq, Maribel Montero Perez:
Data Collection and IPR in Multilingual Parallel Corpora. Dutch Parallel Corpus. - Yulia Tsvetkov, Shuly Wintner:
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content. - Beáta Megyesi, Bengt Dahlqvist, Éva Á. Csató, Joakim Nivre:
The English-Swedish-Turkish Parallel Treebank. - Lars Ahrenberg:
Alignment-based Profiling of Europarl Data in an English-Swedish Parallel Corpus. - Jesús González-Rubio, Jorge Civera, Alfons Juan, Francisco Casacuberta:
Saturnalia: A Latin-Catalan Parallel Corpus for Statistical MT. - Julia Maria Schulz, Christa Womser-Hacker, Thomas Mandl:
Multilingual Corpus Development for Opinion Mining. - Tom Vanallemeersch:
Belgisch Staatsblad Corpus: Retrieving French-Dutch Sentences from Official Documents.
Session P44 - Language Identification
- Yu Fu, Feiyu Xu, Hans Uszkoreit:
Determining the Origin and Structure of Person Names. - Tommi Vatanen, Jaakko J. Väyrynen, Sami Virpioja:
Language Identification of Short Text Segments with N-gram Models. - Stasinos Konstantopoulos:
Learning Language Identification Models: A Comparative Analysis of the Distinctive Features of Names and Common Words. - Mohamed Belgacem, Georges Antoniadis, Laurent Besacier:
Automatic Identification of Arabic Dialects.
Session P45 - Evaluation Methodologies
- Elin Carlsson, Hercules Dalianis:
Influence of Module Order on Rule-Based De-identification of Personal Names in Electronic Patient Records Written in Swedish. - Olga Babko-Malaya, Daniel Hunter, Connie Fournelle, Jim White:
Evaluation of Document Citations in Phase 2 Gale Distillation. - Olivier Galibert, Ludovic Quintard, Sophie Rosset, Pierre Zweigenbaum, Claire Nedellec, Sophie Aubin, Laurent Gillard, Jean-Pierre Raysz, Delphine Pois, Xavier Tannier, Louise Deléger, Dominique Laurent:
Named and Specific Entity Detection in Varied Data: The Quæro Named Entity Baseline Evaluation. - Marco Guerini, Carlo Strapparava, Oliviero Stock:
Evaluation Metrics for Persuasive NLP with Google AdWords. - Joana Hois:
Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space. - Petra-Maria Strauß, Stefan Scherer, Georg Layher, Holger Hoffmann:
Evaluation of the PIT Corpus Or What a Difference a Face Makes?
Session P46 - Corpora and Evaluation
- Ernesto William De Luca:
A Corpus for Evaluating Semantic Multilingual Web Retrieval Systems: The Sense Folder Corpus. - Sanja Seljan, Marko Tadic, Zeljko Agic, Jan Snajder, Bojana Dalbelo Basic, Vjekoslav Osmann:
Corpus Aligner (CorAl) Evaluation on English-Croatian Parallel Corpora. - Irina P. Temnikova:
Cognitive Evaluation Approach for a Controlled Language Post--Editing Experiment. - Maria Khokhlova, Victor Zakharov:
Studying Word Sketches for Russian. - Hannah Copperman, Christopher R. Walker:
Fred's Reusable Evaluation Device: Providing Support for Quick and Reliable Linguistic Annotation. - Emilia Apostolova, Sean Neilan, Gary An, Noriko Tomuro, Steven L. Lytinen:
Djangology: A Light-weight Web-based Tool for Distributed Collaborative Text Annotation.
Session P47 - Corpora, Annotation and Tools
- Marc Verhagen:
The Brandeis Annotation Tool. - Georgios Petasis, Dimitrios Petasis:
BlogBuster: A Tool for Extracting Corpora from the Blogosphere. - Jinho D. Choi, Claire Bonial, Martha Palmer:
Propbank Frameset Annotation Guidelines Using a Dedicated Editor, Cornerstone. - Dain Kaplan, Ryu Iida, Takenobu Tokunaga:
Annotation Process Management Revisited. - Takeshi Abekawa, Masao Utiyama, Eiichiro Sumita, Kyo Kageura:
Community-based Construction of Draft and Final Translation Corpus Through a Translation Hosting Site Minna no Hon'yaku (MNH). - Maarten Marx, Anne Schuth:
DutchParl. The Parliamentary Documents in Dutch. - Svetla Koeva, Diana Blagoeva, Siya Kolkovska:
Bulgarian National Corpus Project. - Khalil Dahab, Anja Belz:
A Game-based Approach to Transcribing Images of Text. - Ghulam Raza:
Inferring Subcat Frames of Verbs in Urdu. - Romaric Besançon, Gaël de Chalendar, Olivier Ferret, Faiïza Gara, Olivier Mesnard:
LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation. - Catarina Magro:
When CORDIAL Becomes Friendly: Endowing the CORDIAL Corpus with a Syntactic Annotation Layer. - Richard Johansson, Alessandro Moschitti:
A Flexible Representation of Heterogeneous Annotation Data.
Session P48 - Tools for Speech Corpus
- Kai Wörner:
A Tool for Feature-Structure Stand-Off-Annotation on Transcriptions of Spoken Discourse. - Andrew Thwaites, Jeroen Geertzen, William D. Marslen-Wilson, Paula Buttery:
LIPS: A Tool for Predicting the Lexical Isolation Point of a Word. - Ibon Saratxaga, Inmaculada Hernáez, Eva Navas, Iñaki Sainz, Iker Luengo, Jon Sánchez, Igor Odriozola, Daniel Erro:
AhoTransf: A Tool for Multiband Excitation Based Speech Analysis and Modification. - Sara Romano, Francesco Cutugno:
New Features in Spoken Language Search Hawk (SpLaSH): Query Language and Query Sequence. - Kornel Laskowski, Jens Edlund:
A Snack Implementation and Tcl/Tk Interface to the Fundamental Frequency Variation Spectrum Algorithm. - Sathish Pammi, Marcela Charfuelan, Marc Schröder:
Multilingual Voice Creation Toolkit for the MARY TTS Platform.
Session P49 - WordNet, Framenet, Ontologies
- Winston N. Anderson, Laurette Pretorius, Albert E. Kotzé:
Base Concepts in the African Languages Compared to Upper Ontologies and the WordNet Top Ontology. - Yue Ma, Adeline Nazarenko, Laurent Audibert:
Formal Description of Resources for Ontology-based Semantic Annotation. - Roxane Segers, Piek Vossen:
Facilitating Non-expert Users of the KYOTO Platform: the TMEKO Editing Protocol for Synset to Ontology Mappings. - Chris Irwin Davis, Dan I. Moldovan:
Feasibility of Automatically Bootstrapping a Persian WordNet. - Pushpak Bhattacharyya:
IndoWordNet. - Zygmunt Vetulani, Marek Kubis, Tomasz Obrêbski:
PolNet - Polish WordNet: Data and Tools. - Mehrnoush Shamsfard, Hakimeh Fadaei, Elham Fekri:
Extracting Lexico-conceptual Knowledge for Developing Persian WordNet. - Prasanth Kolachina, Sudheer Kolachina, Anil Kumar Singh, Samar Husain, Viswanatha Naidu, Rajeev Sangal, Akshar Bharati:
Grammar Extraction from Treebanks for Hindi and Telugu. - Emiliano Giovannetti:
An Unsupervised Approach for Semantic Relation Interpretation. - Gabor Melli:
Concept Mentions within KDD-2009 Abstracts (kdd09cma1) Linked to a KDD Ontology (kddo1). - Min-Jae Kwon, Hae-Yun Lee, Hee-Rahk Chae:
Linking Korean Words with an Ontology. - Hassina Aliane, Zaia Alimazighi, Ahmed Cherif Mazari:
Al - Khalil : The Arabic Linguistic Ontology Project. - Cássia Trojahn dos Santos, Paulo Quaresma, Renata Vieira:
An API for Multi-lingual Ontology Matching. - Thierry Declerck, Piroska Lendvai:
Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems. - Kiril Ivanov Simov, Petya Osenova:
Constructing of an Ontology-based Lexicon for Bulgarian. - René Witte, Ninus Khamis, Juergen Rilling:
Flexible Ontology Population from Text: The OwlExporter. - Takehiro Teraoka, Jun Okamoto, Shun Ishizaki:
An Associative Concept Dictionary for Verbs and its Application to Elliptical Word Estimation. - Nao Tatsumi, Jun Okamoto, Shun Ishizaki:
Evaluating Semantic Relations and Distances in the Associative Concept Dictionary using NIRS-imaging. - Giulio Paci, Giorgio Pedrazzi, Roberta Turra:
Wikipedia-based Approach for Linking Ontology Concepts to their Realisations in Text. - Pradeep Dantuluri, Brian Davis, Siegfried Handschuh:
A Use Case for Controlled Languages as Interfaces to Semantic Web Applications. - Alessandro Oltramari, Guido Vetere, Maurizio Lenzerini, Aldo Gangemi, Nicola Guarino:
Senso Comune.
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.