From: registry@dfki.uni-sb.de (The Software Registry) NATURAL LANGUAGE SOFTWARE REGISTRY There are now many natural language software projects, both commercial and noncommercial. In order to facilitate the exchange and evaluation of software, the Natural Language Software Registry is cataloging projects for the community. The current version includes + speech signal processors, such as the Computerized Speech Lab (Kay Elemetrics) + morphological analyzers, such as PC-KIMMO (Summer Institute for Linguistics) + parsers, such as Alveytools (University of Edinburgh) + semantic and pragmatic analyzer, such as NLL (University of Saarbruecken) + generation programs, such as FUF (Ben Gurion University of the Negev) + knowledge representation systems, such as Rhet (University of Rochester) + multicomponent systems, such as ELU (ISSCO), PENMAN (ISI), Pundit (UNISYS), SNePS (SUNY Buffalo), + NLP-Tools, such as GULP (University of Georgia) or Linguist (Kansai Research Laboratory) + applications programs (misc.) If you have developed a piece of software for natural language processing that other researchers might find useful, you can include it by returning the description form below. If you are willing to have your software reviewed, please send us a version along with this form. We are also interested in reports and documentation, even for software not reviewed. Anyone wishing to obtain the document or the questionnaire is invited to contact one of the following addresses: Ftp-Serer: Germany: ftp.dfki.uni-sb.de (134.96.188.252) (directory: pub/registry, user name: anonymous) USA: crlftp.nmsu.edu (128.123.1.33) (directory: pub/non-lexical/NL_Software_Registry, user name: anonymous) dri.cornell.edu (128.84.180.39) (directory: /pub/Natural_Language_Software_Registry or /pub/NLSR, user name: anonymous) UK: svr-ftp.eng.cam.ac.uk(129.169.24.20) (directory: /pub/comp.speech/info/ NaturalLanguageSoftwareRegistry, user name: anonymous) WWW-Server: Germany: http://cl-www.dfki.uni-sb.de/cl/registry/draft.html e-mail: registry@dfki.uni-sb.de post: Natural Language Software Registry Deutsches Forschungszentrum fuer Kuenstliche Intelligenz (DFKI) Stuhlsatzenhausweg 3 D-66123 Saarbruecken Germany If you have questions, please do not hesitate to drop us an email message. Christoph Jung, Markus Vonerden NATURAL LANGUAGE SOFTWARE REGISTRY QUESTIONNAIRE ------------------------------------- 0. Note ------------------------------------- This questionnaire is designed to be evaluated automatically. Therefore there are several (hopefully easy) restrictions in filling the form. Upper-case-Strings with two stars are key-words needed by the analyzing scripts. Please don't delete them. Give your answers between the brackets [[ ]], replacing the default-strings written in them. If there are three dots "...", your answer could be a free structured, multi-line ASCII-text. If the string "+/-" occurs, please decide wether your answer is positive ("+") or negative ("-"). You are free not to answer a certain question. To do that, delete the default string and write NOTHING between the brackets. Special signs are prefered to be in TeX-notation. If you don't know how to do that, don't worry and write them in your prefered mode. ------------------------------------- 1. general questions ------------------------------------- 1.1 name of your system **NAME[[ ... ]] 1.2 licence **RESEARCH[[ +/- ]] **COMMERCIAL[[ +/- ]] 1.3 authors {people responsible for program design and implementation} **AUTHORS[[ ... ]] 1.4 person to contact {person who distributes the software} **CONTACT_NAME[[ ... ]] **CONTACT_INSTITUTION[[ ... ]] **CONTACT_DEPARTMENT[[ ... ]] **CONTACT_STREET[[ ... ]] **CONTACT_CITY[[ ... ]] **CONTACT_COUNTRY[[ ... ]] **CONACT_PHONE[[ ... ]] {with Zip & access code} **CONTACT_EMAIL[[ ... ]] {Internet electronic mail address & network} 1.5 status {current level of your system} 1.5.1 coverage level {rough estimate of the scale of the project} **DEMONSTRATION[[ +/- ]] **SMALL_RESEARCH[[ +/- ]] **LARGE_RESEARCH[[ +/- ]] **PRODUCTION_QUALITY[[ +/- ]] **HIGH_VOLUME[[ +/- ]] 1.5.2 is there a stable version of the system ? **STABLE[[ +/- ]] 1.5.3 is there continuing development ? **CONTINUING_DEVELOPMENT[[ +/- ]] 1.5.4 examples {number and type of examples on which the system was tested} 1.5.4.1 example type **WORDS[[ +/- ]] **SENTENCES[[ +/- ]] **PARAGRAPHS[[ +/- ]] **OTHER_TYPES[[ ... ]] 1.5.4.2 number of examples processed successfully (as a power of 10) **SUCCESSFUL_EXAMPLES[[ ... ]] 1.6 support {which services are available to users} **UPGRADES[[ +/- ]] **SOURCE_CODE[[ +/- ]] **CONSULTING[[ +/- ]] **OTHER_SUPPORT[[ ... ]] 1.7 format for software distribution **DISK[[ +/- ]] **FTP[[ +/- ]] **WHICH_FTP_SITE[[ ... ]] {Internet address of the ftp-site} **TAPE[[ +/- ]] **OTHER_FORMAT[[ +/- ]] 1.8 price {distribution costs} **PRICE[[ ... ]] 1.9 restrictions on use {legal obligation of the user} **RESTRICTIONS[[ ... ]] ------------------------------------- 2. type of system ------------------------------------- 2.1 primary task of system {primary activity for which the system was designed } **LINGUISTIC_ANALYSIS[[ +/- ]] **TEST_OF_LINGUISTIC_THEORY[[ +/- ]] **WHICH_LINGUISTIC_THEORY[[ ... ]] {which theory is tested} **TEST_GENERATION[[ +/- ]] **MACHINE_TRANSLATION[[ +/- ]] **TEXT_PROOFING[[ +/- ]] **DATABASE_INTERFACE[[ +/- ]] **OTHER_TASKS[[ ... ]] ------------------------------------- 3. detailed questions ------------------------------------- 3.1 description {a free text describing the program} Summarize the main goals and ideas. Indicate what makes the project a useful and interesting tool for research applications. **DESCRIPTION[[ ... ]] 3.2 program 3.2.1 programming language **PROGRAMMING_LANGUAGE[[ ... ]]] 3.2.2 platform {software or hardware required to run the program - such as a particular operating system} **PLATFORM[[ ... ]] 3.2.3 lines of source code **LINES_OF_SOURCE_CODE[[ ... ]] 3.2.4 kilobytes of executable **KILO_OF_EXECUTABLE[[ ... ]] 3.2.5 program components {major modules} **PHONOLOGICAL_ANALYZER/GENERATOR[[ +/- ]] **MORPHOLOGICAL_ANALYZER/GENERATOR[[ +/- ]] **PARSER/GENERATOR[[ +/- ]] **SEMANTIC_INTERPRETER[[ +/- ]] **KNOWLEDGE_REPRESENTATION[[ +/- ]] **DISCOURSE_STRUCTURE[[ +/- ]] **PRAGMATIC_FEATURES[[ +/- ]] **OTHER_COMPONENTS[[ ... ]] 3.2.6 program extensible by {augmentation of algorithms} **THE_DEVELOPER[[ +/- ]] **THE_COMPUTATIONAL_LINGUIST[[ +/- ]] **THE_LINGUIST[[ +/- ]] **THE_PROGRAMMER[[ +/- ]] **THE_EXPERIENCED_USER[[ +/- ]] **THE_NEW_USER[[ +/- ]] 3.2.7 Which components are available independently ? **MODULARITY[[ ... ]] 3.3 data 3.3.1 data components {give count, language and features, like: 120,000 entry wordlist for French 5,000 word LFG lexicon Swahili w/ affixes, English gloss 15 rule TG for Dutch cross-serial dependencies 200 node knowledge base for AIDS case histories w/ 10 30-node cases.} **DATA_COMPONENTS[[ ... ]] 3.3.2 data components extensible by {augmentation of data} **D_THE_DEVELOPER[[ +/- ]] **D_THE_COMPUTATIONAL_LINGUIST[[ +/- ]] **D_THE_LINGUIST[[ +/- ]] **D_THE_PROGRAMMER[[ +/- ]] **D_THE_EXPERIENCED_USER[[ +/- ]] **D_THE_NEW_USER[[ +/- ]] 3.3.3 Which data components are independent of program ? **D_MODULARITY[[ ... ]] 3.3.4 character set used for language data **PROGRAMMABLE[[ +/- ]] **WHICH_PROGRAMMABLE[[ ... ]] **FIXED_16BIT_UNICODE[[ +/- ]] **FIXED_8BIT_ISO[[ +/- ]] **WHICH_FIXED_8BIT_ISO[[ ... ]] **PROPRIETARY_ASCII[[ +/- ]] **FIXED_7BIT_ISO[[ +/- ]] **WHICH_FIXED_7BIT_ISO[[ ... ]] **EXTENDED_ASCII[[ +/- ]] **WHICH_EXTENDED_ASCII[[ ... ]] **OTHER_CHARACTER_SETS[[ ... ]] 3.4 languages 3.4.1 current languages {languages for which lexicons or grammars are supplied, or to which the algorithms are applicable} **CURRENT_LANGUAGES[[ ... ]] 3.4.2 range of applicable natural languages {which other natural languages can be substituted} **RANGE_OF_NATURAL_LANGUAGES[[ ... ]] { give theoretical or technical limits } 3.6 man years of work **MAN_YEARS_OF_WORK[[ ... ]] 3.7 documentation {manuals and research reports} 3.7.1 list documents in which the software is described **DOCUMENTATION[[ ... ]] 3.7.2 user documentation **USER_DOCUMENTATION[[ ... ]] 3.7.3 system documentation **SYSTEM_DOCUMENTATION[[ ... ]] -------------------------------------- 5. Note -------------------------------------- If you are willing to have the software reviewed, please send us a version along with this information. We are also interested in reports and documentation, even for software not reviewed. Sincerely, Christoph Jung, Markus Vonerden Natural Language Software Registry Deutsches Forschungszentrum fuer Kuenstliche Intelligenz (DFKI) Stuhlsatzenhausweg 3 D-66123 Saarbruecken Germany phone: +49 (681) 303-5282 e-mail: registry@dfki.uni-sb.de