
Next: 12.7 References
Up: 12 Language Resources
Previous: 12.5 Terminology
Contact information for the corpora mentioned in
section
is provided here in alphabetical
order.
- British National Corpus (BNC):
- smbowie@vax.oxford.ac.uk
- Consortium for Lexical Research (CLR):
- lexical@nmsu.edu
- Dansk Korpus (DK):
- olenc@coco.ihi.ku.dk (Ole
Norling-Christensen)
- European Corpus Initiative (ECI):
- (in Europe):
eucorp@cogsci.edinburgh.ac.uk
- European Corpus Initiative (ECI):
- (in U.S.) LDC:
ehodas@unagi.cis.upenn.edu
- Frantext of Institut National de la Langue
Francaise (INaLF-CNRS):
- emartin@FRCII171 (Eveline Martin)
- Institut für deutsche Sprache (IDS):
-
neumann@ids-mannheim.de (Robert Neumann)
- Instituut voor Nederlandse Lexicologie (INL):
-
postmaster@hnympi52.bitnet
- International Computer Archive of Modern
English (ICAME):
- stigj@hedda.uio.no (Stig Johansson)
- Istituto di Linguistica Computazionale (ILC-CNR):
-
glottolo@vm.cnuce.cnr.it (Antonio Zampolli)
- Linguistic Data Consortium (LDC):
-
ehodas@unagi.cis.upenn.edu (Elizabeth Hodas). The WWW
page is http://www.cis.upenn.edu/ ldc. Information about the
LDC and its activities can also be obtained via anonymous FTP
ftp.cis.upenn.edu under pub/ldc. Most of the data are
compressed using the tool Shorten by T. Robinson
which is available via ft
svr-ftp.eng.cam.ac.uk
- Norsk Tekstarkiv:
-
per.vestbostad@hd.uib.no (Per Vestbostad)
- Spanish Reference Corpus Project:
- marcos@emduam11.bitnet (Francisco Marcos Marin), Sociedad
Estatal del V Centenario
- Stockholm-Umea Corpus (SUC):
- gunnel@ling.su.se
(Gunnel Kallgren); ejerhed@ling.umu.se (Eva Ejerhed); Sprakdata
gellerstam@svenska.gu.se (Martin Gellerstam)
- Text Encoding Initiative (TEI):
- lou@vax.ox.ac.uk (Lou
Burnard), u35395@uicvm.bitnet (C.M. Sperberg McQueen)
- University of Helsinki:
-
fkarlsso@ling.helsinki.fi (Fred Karlsson)
Contact information for the corpora mentioned in
section
is provided here in alphabetical order.
- ACCOR:
- Project contact: Prof. W. Hardcastle,
sphard@queen-margaret-college.main.ac.uk; Prof. A. Marchal,
phonetic@fraix11.bitnet (The British English portion of the
ACCOR corpus is being produced on CDROM with partial financing from
ELSNET)
- ALBAYZIN:
- Corpus contact: Professor Climent
Nadeu, Department of Speech Signal Theory and Communications,
Universitat Politecnica de Catalunya, ETSET, Apartat 30002, 08071
Barcelona, Spain, nadeu@tsc.upc.es
- ARS:
- CSELT (coordinator),
Mr. G. Babini, Via G. Beis Romoli 274, I-101488, Torino, Italy
- ATR, ETL & JEIDA:
- Contact
person: K. Kataoka, AI and Fuzzy Promotion Center, Japan Information
Processing Development Center (JIPDEC), 3-5-8 Shibakoen, Minatoku,
Tokyo 105, Japan, TEL. +81 3 3432 9390, FAX. +81 3 3431 4324
- Australian National Database of Spoken
Language (ANDOSL):
- Corpus contact: Bruce Millar, Computer Sciences
Laboratory, Research School of Information Sciences and Engineering,
Australian National University, Canberra, ACT 0200, Australia, email:
bruce@cslab.anu.edu.au
- BREF:
- Corpus contact: send email to
bref@limsi.fr
- Bramshill:
- LDC (as above)
- CAR & Waxholm:
- Corpus contact:
Bjorn Granstrom bjorn@speech.kth.se
- Center for Spoken Language Understanding (CSLU):
-
Information on the collection and availability of CSLU corpora
can be obtained on the World Wide Web,
http://www.cse.ogi.edu/CSLU/corpora.html
- Chinese National Speech Corpus:
- Contact person: Prof. Jialu Zhang, Academia Sinica,
Institute of Acoustics, 17 Shongguanjun St, Beijing PO Box 2712,
100080 Beijing, Peoples Republic of China
- ERBA:
- Corpus contact: Stefan Rieck, Lehrstuhl
Informatik 5 (Pattern Recognition), University of Erlangen-Nurnberg,
Martensstr.3 , 8520 Erlangen, Germany, Email:
rieck@informatik.uni-erlangen.de
- ETL:
- see ATR above.
- EUROM1:
- Project contact for Multilingual speech
database: A. Fourcin (UCL) adrian@phonetics.ucl.ac.uk; or the
following for individual languages:
D: D. Gibbon (Un.Bielefeld) gibbon@asl.uni-bielefeld.de
DK: B. Lindberg (IES) bli@stc.auc.dk
F: J.F. Serignat (ICP) serignat@icp.grenet.fr
I: G. Castagneri (CSELT) castagneri@cselt.stet.it
N: T. Svendsen (SINTEF-DELAB)
torbjorn@telesun.tele.unit.no
NL: J. Hendriks or L. Boves (PTT Research)
boves@lett.kun.nl
SW: G. Hult (Televerket) or B. Granstrom (KTH)
bjorn@speech.kth.se
UK: A. Fourcin (UCL) adrian@phonetics.ucl.ac.uk
Contact for SAM-A EUROM1:
E: A. Moreno (UPC) amoreno@tsc.upc.es
G: J. Mourjopoulos (UPatras) mourjop@grpafvx1.earn
P: I. Trancoso (INESC) imt@inesc.pt
- EuroCocosda:
- Corpus contact: A Fourcin, email:
adrian@phonetics.ucl.ac.uk
- European Language Resources Association (ELRA):
- For
membership information contact: Sarah Houston, email:
100126.1262@compuserve.com
- European Network in Language and Speech (ELSNET):
-
OTS, Utrecht University, Trans 10, 3512 JK, Utrecht, The Netherlands,
Email: elsnet@let.ruu.nl
- Groningen:
- Corpus contact: Els den Os, Speech
Processing Expertise Centre, P.O.Box 421, 2260 AK Leidschendam, The
Netherlands, els@spex.nl (CDs available via ELSNET)
- JEIDA:
- see ATR above.
- LRE ONOMASTICA:
- Project contact:
M. Jack, CCIR, University of Edinburgh, mervyn.jack@ed.ac.uk
- Linguistic Data Consortium (LDC):
- see LDC above.
- Normal Speech Corpus:
- Corpus
Contact: Steve Crowdy, Longman UK, Burnt Mill, Harlow, CM20 2JE, UK
- Oregon Graduate Institute (OGI):
- see CSLU above.
- PAROLE:
- Project contact: Mr. T. Schneider, Sietec
Systemtechnik Gmbh, Nonnendammallee 101, D-13629 Berlin
- PHONDAT2:
- Corpus contact: B. Eisen,
University of Munich, Germany
- POINTER:
- Project contact: Mr. Corentin Roulin ,
BJL Consult, Boulevard du Souverain 207/12, B-1160 Bruxelles
- POLYGLOT:
- Contact person: Antonio Cantatore,
Syntax Sistemi Software, Via G. Fanelli 206/16, I- 70125 Bari, Italy
- Relator:
- Project contact: A. Zampolli, Istituto
di Linguistica Computazionale, CNR, Pisa, I, E-mail:
giulia@icnucevm.cnuce.cnr.it; Information as well as a list of
resources, is available on the World Wide Web,
http://www.XX.relator.research.ec.org
- ROARS:
- Contact person: Pierre Alinat,
Thomson-CSF/Sintra-ASM, 525 Route des Dolines, Parc de Sophia
Antipolis, BP 138, F-06561 Valbonne, France
- SCRIBE:
- Corpus contact: Mike Tomlinson, Speech
Research Unit, DRA, Malvern, Worc WR14 3PS, England
- SPEECHDAT:
- Project contact: Mr. Harald Hoege,
Siemens AG, Otto Hahn Ring 6, D-81739 Munich
- SPELL:
- Contact person: Jean-Paul Lefevre, Agora
Conseil, 185, Hameau de Chateau, F-38360 Sassenage, France
- SUNDIAL:
- Contact person: Jeremy Peckham,
Vocalis Ltd., Chaston House, Mill Court, Great Shelford, Cambs CB2
5LD UK, email: jeremy@vocalis.demon.co.uk
- SUNSTAR:
- Joachin Irion, EG Electrocom Gmbh,
Max-Stromeyerstr. 160, D- 7750 Konstanz, Germany
- VERBMOBIL:
- Corpus contact: B. Eisen,
University of Munich, Germany
- Wall Street Journal, Cambridge, zero (WSJCAM0):
-
Corpus contact: Linguistic Data Consortium (LDC), Univ. of
Pennsylvania, 441 Williams Hall, Philadelphia, PA, USA 19104-6305,
(215) 898-0464
- Waxholm:
- see CAR above.
Contact information for the corpora mentioned in
section 13.10 is provided here in alphabetical order.
- Electrotechnical Laboratory (ETL) Character Database:
-
Distributor: Image Understanding Section, Electrotechnical
Laboratory, 1-1-4, Umezono, Tsukuba, Ibaraki, 305, Japan.
- National Institute of Standards and Technology (NIST):
-
Distributor: Standard Reference Data, National Institute of Standards
and Technology, 221/A323, Gaithersburg, MD 20899, USA.
- U.S. Postal Service:
- Distributor:
CEDAR, SUNY at Buffalo, Dept. of Computer Science, 226 Bell Hall,
Buffalo, NY 14260, USA.
- University of Washington:
-
Distributor: Intelligent Systems Laboratory, Dept. of Electrical
Engineering, FT-10, University of Washington, Seattle, WA 98195, USA
Next: 12.7 References
Up: 12 Language Resources
Previous: 12.5 Terminology