Morpho project
The goal of the Morpho project is to develop unsupervised data-driven
methods that discover the regularities behind word forming in natural
languages. In particular, we are focussing on the discovery of
morphemes, which are the primitive units of syntax, the smallest
individually meaningful elements in the utterances of a
language. Morphemes are important in automatic generation and
recognition of a language, especially in languages in which words
may have many different inflected forms.
Read more about the problem and
our methods or see the
list of publications.
Demonstrations and packages
- Morfessor demonstration:
Try the segmentation of words into morphs
- Morfessor FlatCat 1.0 software
- Download Morfessor FlatCat (published under simplified BSD license)
- Related article: Stig-Arne Grönroos, Sami Virpioja, Peter Smit, Mikko Kurimo (2014).
Morfessor FlatCat: An HMM-based method for unsupervised and semi-supervised learning of morphology.
In proceedings of the 25th International Conference on Computational Linguistics.
Dublin, Ireland, August 2014, Association for Computational Linguistics.
[ Article (PDF) ]
[ bib ]
- Morfessor 2.0 software
- Download Morfessor 2.0 (45 kB, published under simplified BSD license)
- Related article: Sami Virpioja, Peter Smit, Stig-Arne
Grönroos, and Mikko Kurimo (2013).
Morfessor 2.0: Python Implementation and Extensions for Morfessor Baseline.
Aalto University publication series SCIENCE + TECHNOLOGY, 25/2013.
Aalto University, Helsinki, 2013. ISBN 978-952-60-5501-5.
[ Article ]
- Morfessor Categories-MAP 0.9.2 software
- Download Morfessor Categories-MAP (100 kB, published under GNU GPL)
- Related article: Mathias Creutz and Krista Lagus
(2005). Inducing the Morphological Lexicon of a Natural Language
from Unannotated Text. In Proceedings of the International and
Interdisciplinary Conference on Adaptive Knowledge Representation
and Reasoning (AKRR'05), Espoo, Finland, 15-17 June.
[ Article (PDF) ]
- Morfessor 1.0 software (Morfessor Baseline algorithm)
- Download morfessor1.0.perl (60 kB, published under GNU GPL)
- Related article: Mathias Creutz and Krista Lagus
(2005). Unsupervised Morpheme Segmentation and Morphology Induction
from Text Corpora Using Morfessor 1.0. Publications in Computer and
Information Science, Report A81, Helsinki University of Technology,
March.
[ Abstract ] [ Article (PDF) ]
- Hutmegs 1.0 evaluation package (Helsinki University of Technology
Morphological Evaluation Gold Standard).
- Download Hutmegs version 1.0 (9.6 MB)
- Related article: Mathias
Creutz and Krister Lindén (2004). Morpheme Segmentation Gold
Standards for Finnish and English. Publications in Computer and
Information Science, Report A77, Helsinki University of Technology,
October.
[ Abstract ] [ Article (PDF) ]
Morpho Challenges
For overview of the Morpho Challenges, see
http://morpho.aalto.fi/events/morphochallenge/.
The previous Challenge we have organized is
Morpho
Challenge 2010 - Semi-supervised and Unsupervised Analysis.
Older Challenges:
2009
2008
2007
2005
Press releases (in Finnish)
People
Feedback or questions: morpho (at) aalto.fi
The Morpho project has been part of the
Adaptive Natural Language Processing
research activities at the Laboratory of Computer and Information Science. Currently Morfessor is developed in the
Department
of Signal Processing and Acoustics at
Aalto University.
Page maintained by morpho at aalto.fi,
last updated Tue Jul 14 15:12:07 2015