next up previous contents index
Next: 12.7 References Up: 12 Language Resources Previous: 12.5 Terminology

12.6 Addresses for Language Resources

12.6.1 Written Language Corpora

Contact information for the corpora mentioned in section gif is provided here in alphabetical order.

British National Corpus (BNC):
smbowie@vax.oxford.ac.uk

Consortium for Lexical Research (CLR):
lexical@nmsu.edu

Dansk Korpus (DK):
olenc@coco.ihi.ku.dk (Ole Norling-Christensen)

European Corpus Initiative (ECI):
(in Europe): eucorp@cogsci.edinburgh.ac.uk

European Corpus Initiative (ECI):
(in U.S.) LDC: ehodas@unagi.cis.upenn.edu

Frantext of Institut National de la Langue Francaise (INaLF-CNRS):
emartin@FRCII171 (Eveline Martin)

Institut für deutsche Sprache (IDS):
neumann@ids-mannheim.de (Robert Neumann)

Instituut voor Nederlandse Lexicologie (INL):
postmaster@hnympi52.bitnet

International Computer Archive of Modern English (ICAME):
stigj@hedda.uio.no (Stig Johansson)

Istituto di Linguistica Computazionale (ILC-CNR):
glottolo@vm.cnuce.cnr.it (Antonio Zampolli)

Linguistic Data Consortium (LDC):
ehodas@unagi.cis.upenn.edu (Elizabeth Hodas). The WWW page is http://www.cis.upenn.edu/ ldc. Information about the LDC and its activities can also be obtained via anonymous FTP ftp.cis.upenn.edu under pub/ldc. Most of the data are compressed using the tool Shorten by T. Robinson which is available via ft svr-ftp.eng.cam.ac.uk

Norsk Tekstarkiv:
per.vestbostad@hd.uib.no (Per Vestbostad)

Spanish Reference Corpus Project:
marcos@emduam11.bitnet (Francisco Marcos Marin), Sociedad Estatal del V Centenario

Stockholm-Umea Corpus (SUC):
gunnel@ling.su.se (Gunnel Kallgren); ejerhed@ling.umu.se (Eva Ejerhed); Sprakdata gellerstam@svenska.gu.se (Martin Gellerstam)

Text Encoding Initiative (TEI):
lou@vax.ox.ac.uk (Lou Burnard), u35395@uicvm.bitnet (C.M. Sperberg McQueen)

University of Helsinki:
fkarlsso@ling.helsinki.fi (Fred Karlsson)

12.6.2 Spoken Language Corpora

Contact information for the corpora mentioned in section gif is provided here in alphabetical order.

ACCOR:
Project contact: Prof. W. Hardcastle, sphard@queen-margaret-college.main.ac.uk; Prof. A. Marchal, phonetic@fraix11.bitnet (The British English portion of the ACCOR corpus is being produced on CDROM with partial financing from ELSNET)

ALBAYZIN:
Corpus contact: Professor Climent Nadeu, Department of Speech Signal Theory and Communications, Universitat Politecnica de Catalunya, ETSET, Apartat 30002, 08071 Barcelona, Spain, nadeu@tsc.upc.es

ARS:
CSELT (coordinator), Mr. G. Babini, Via G. Beis Romoli 274, I-101488, Torino, Italy

ATR, ETL & JEIDA:
Contact person: K. Kataoka, AI and Fuzzy Promotion Center, Japan Information Processing Development Center (JIPDEC), 3-5-8 Shibakoen, Minatoku, Tokyo 105, Japan, TEL. +81 3 3432 9390, FAX. +81 3 3431 4324

Australian National Database of Spoken Language (ANDOSL):
Corpus contact: Bruce Millar, Computer Sciences Laboratory, Research School of Information Sciences and Engineering, Australian National University, Canberra, ACT 0200, Australia, email: bruce@cslab.anu.edu.au

BREF:
Corpus contact: send email to bref@limsi.fr

Bramshill:
LDC (as above)

CAR & Waxholm:
Corpus contact: Bjorn Granstrom bjorn@speech.kth.se

Center for Spoken Language Understanding (CSLU):
Information on the collection and availability of CSLU corpora can be obtained on the World Wide Web, http://www.cse.ogi.edu/CSLU/corpora.html

Chinese National Speech Corpus:
Contact person: Prof. Jialu Zhang, Academia Sinica, Institute of Acoustics, 17 Shongguanjun St, Beijing PO Box 2712, 100080 Beijing, Peoples Republic of China

ERBA:
Corpus contact: Stefan Rieck, Lehrstuhl Informatik 5 (Pattern Recognition), University of Erlangen-Nurnberg, Martensstr.3 , 8520 Erlangen, Germany, Email: rieck@informatik.uni-erlangen.de

ETL:
see ATR above.

EUROM1:
Project contact for Multilingual speech database: A. Fourcin (UCL) adrian@phonetics.ucl.ac.uk; or the following for individual languages:
D: D. Gibbon (Un.Bielefeld) gibbon@asl.uni-bielefeld.de
DK: B. Lindberg (IES) bli@stc.auc.dk
F: J.F. Serignat (ICP) serignat@icp.grenet.fr
I: G. Castagneri (CSELT) castagneri@cselt.stet.it
N: T. Svendsen (SINTEF-DELAB) torbjorn@telesun.tele.unit.no
NL: J. Hendriks or L. Boves (PTT Research) boves@lett.kun.nl
SW: G. Hult (Televerket) or B. Granstrom (KTH) bjorn@speech.kth.se
UK: A. Fourcin (UCL) adrian@phonetics.ucl.ac.uk
Contact for SAM-A EUROM1:
E: A. Moreno (UPC) amoreno@tsc.upc.es
G: J. Mourjopoulos (UPatras) mourjop@grpafvx1.earn
P: I. Trancoso (INESC) imt@inesc.pt

EuroCocosda:
Corpus contact: A Fourcin, email: adrian@phonetics.ucl.ac.uk

European Language Resources Association (ELRA):
For membership information contact: Sarah Houston, email: 100126.1262@compuserve.com

European Network in Language and Speech (ELSNET):
OTS, Utrecht University, Trans 10, 3512 JK, Utrecht, The Netherlands, Email: elsnet@let.ruu.nl

Groningen:
Corpus contact: Els den Os, Speech Processing Expertise Centre, P.O.Box 421, 2260 AK Leidschendam, The Netherlands, els@spex.nl (CDs available via ELSNET)

JEIDA:
see ATR above.

LRE ONOMASTICA:
Project contact: M. Jack, CCIR, University of Edinburgh, mervyn.jack@ed.ac.uk

Linguistic Data Consortium (LDC):
see LDC above.

Normal Speech Corpus:
Corpus Contact: Steve Crowdy, Longman UK, Burnt Mill, Harlow, CM20 2JE, UK

Oregon Graduate Institute (OGI):
see CSLU above.

PAROLE:
Project contact: Mr. T. Schneider, Sietec Systemtechnik Gmbh, Nonnendammallee 101, D-13629 Berlin

PHONDAT2:
Corpus contact: B. Eisen, University of Munich, Germany

POINTER:
Project contact: Mr. Corentin Roulin , BJL Consult, Boulevard du Souverain 207/12, B-1160 Bruxelles

POLYGLOT:
Contact person: Antonio Cantatore, Syntax Sistemi Software, Via G. Fanelli 206/16, I- 70125 Bari, Italy

Relator:
Project contact: A. Zampolli, Istituto di Linguistica Computazionale, CNR, Pisa, I, E-mail: giulia@icnucevm.cnuce.cnr.it; Information as well as a list of resources, is available on the World Wide Web, http://www.XX.relator.research.ec.org

ROARS:
Contact person: Pierre Alinat, Thomson-CSF/Sintra-ASM, 525 Route des Dolines, Parc de Sophia Antipolis, BP 138, F-06561 Valbonne, France

SCRIBE:
Corpus contact: Mike Tomlinson, Speech Research Unit, DRA, Malvern, Worc WR14 3PS, England

SPEECHDAT:
Project contact: Mr. Harald Hoege, Siemens AG, Otto Hahn Ring 6, D-81739 Munich

SPELL:
Contact person: Jean-Paul Lefevre, Agora Conseil, 185, Hameau de Chateau, F-38360 Sassenage, France

SUNDIAL:
Contact person: Jeremy Peckham, Vocalis Ltd., Chaston House, Mill Court, Great Shelford, Cambs CB2 5LD UK, email: jeremy@vocalis.demon.co.uk

SUNSTAR:
Joachin Irion, EG Electrocom Gmbh, Max-Stromeyerstr. 160, D- 7750 Konstanz, Germany

VERBMOBIL:
Corpus contact: B. Eisen, University of Munich, Germany

Wall Street Journal, Cambridge, zero (WSJCAM0):
Corpus contact: Linguistic Data Consortium (LDC), Univ. of Pennsylvania, 441 Williams Hall, Philadelphia, PA, USA 19104-6305, (215) 898-0464

Waxholm:
see CAR above.

12.6.3 Character Recognition

Contact information for the corpora mentioned in section 13.10 is provided here in alphabetical order.

Electrotechnical Laboratory (ETL) Character Database:
Distributor: Image Understanding Section, Electrotechnical Laboratory, 1-1-4, Umezono, Tsukuba, Ibaraki, 305, Japan.

National Institute of Standards and Technology (NIST):
Distributor: Standard Reference Data, National Institute of Standards and Technology, 221/A323, Gaithersburg, MD 20899, USA.

U.S. Postal Service:
Distributor: CEDAR, SUNY at Buffalo, Dept. of Computer Science, 226 Bell Hall, Buffalo, NY 14260, USA.

University of Washington:
Distributor: Intelligent Systems Laboratory, Dept. of Electrical Engineering, FT-10, University of Washington, Seattle, WA 98195, USA



next up previous contents
Next: 12.7 References Up: 12 Language Resources Previous: 12.5 Terminology