NATURAL LANGUAGE SOFTWARE REGISTRY QUESTIONNAIRE ------------------------------------- 0. Note ------------------------------------- This questionnaire is designed to be evaluated automatically. Therefore there are several (hopefully easy) restrictions in filling the form. Upper-case-Strings with two stars are key-words needed by the analyzing scripts. Please don't delete them. Give your answers between the brackets [[ ]], replacing the default-strings written in them. If there are three dots "...", your answer could be a free structured, multi-line ASCII-text. If the string "+/-" occurs, please decide wether your answer is positive ("+") or negative ("-"). You are free not to answer a certain question. To do that, delete the default string and write NOTHING between the brackets. Special signs are prefered to be in TeX-notation. If you don't know how to do that, don't worry and write them in your prefered mode. ------------------------------------- 1. general questions ------------------------------------- 1.1 name of your system **NAME[[ ... ]] 1.2 licence **RESEARCH[[ +/- ]] **COMMERCIAL[[ +/- ]] 1.3 authors {people responsible for program design and implementation} **AUTHORS[[ ... ]] 1.4 person to contact {person who distributes the software} **CONTACT_NAME[[ ... ]] **CONTACT_INSTITUTION[[ ... ]] **CONTACT_DEPARTMENT[[ ... ]] **CONTACT_STREET[[ ... ]] **CONTACT_CITY[[ ... ]] **CONTACT_COUNTRY[[ ... ]] **CONACT_PHONE[[ ... ]] {with Zip & access code} **CONTACT_EMAIL[[ ... ]] {Internet electronic mail address & network} 1.5 status {current level of your system} 1.5.1 coverage level {rough estimate of the scale of the project} **DEMONSTRATION[[ +/- ]] **SMALL_RESEARCH[[ +/- ]] **LARGE_RESEARCH[[ +/- ]] **PRODUCTION_QUALITY[[ +/- ]] **HIGH_VOLUME[[ +/- ]] 1.5.2 is there a stable version of the system ? **STABLE[[ +/- ]] 1.5.3 is there continuing development ? **CONTINUING_DEVELOPMENT[[ +/- ]] 1.5.4 examples {number and type of examples on which the system was tested} 1.5.4.1 example type **WORDS[[ +/- ]] **SENTENCES[[ +/- ]] **PARAGRAPHS[[ +/- ]] **OTHER_TYPES[[ ... ]] 1.5.4.2 number of examples processed successfully (as a power of 10) **SUCCESSFUL_EXAMPLES[[ ... ]] 1.6 support {which services are available to users} **UPGRADES[[ +/- ]] **SOURCE_CODE[[ +/- ]] **CONSULTING[[ +/- ]] **OTHER_SUPPORT[[ ... ]] 1.7 format for software distribution **DISK[[ +/- ]] **FTP[[ +/- ]] **WHICH_FTP_SITE[[ ... ]] {Internet address of the ftp-site} **TAPE[[ +/- ]] **OTHER_FORMAT[[ +/- ]] 1.8 price {distribution costs} **PRICE[[ ... ]] 1.9 restrictions on use {legal obligation of the user} **RESTRICTIONS[[ ... ]] ------------------------------------- 2. type of system ------------------------------------- 2.1 primary task of system {primary activity for which the system was designed } **LINGUISTIC_ANALYSIS[[ +/- ]] **TEST_OF_LINGUISTIC_THEORY[[ +/- ]] **WHICH_LINGUISTIC_THEORY[[ ... ]] {which theory is tested} **TEST_GENERATION[[ +/- ]] **MACHINE_TRANSLATION[[ +/- ]] **TEXT_PROOFING[[ +/- ]] **DATABASE_INTERFACE[[ +/- ]] **OTHER_TASKS[[ ... ]] ------------------------------------- 3. detailed questions ------------------------------------- 3.1 description {a free text describing the program} Summarize the main goals and ideas. Indicate what makes the project a useful and interesting tool for research applications. **DESCRIPTION[[ ... ]] 3.2 program 3.2.1 programming language **PROGRAMMING_LANGUAGE[[ ... ]]] 3.2.2 platform {software or hardware required to run the program - such as a particular operating system} **PLATFORM[[ ... ]] 3.2.3 lines of source code **LINES_OF_SOURCE_CODE[[ ... ]] 3.2.4 kilobytes of executable **KILO_OF_EXECUTABLE[[ ... ]] 3.2.5 program components {major modules} **PHONOLOGICAL_ANALYZER/GENERATOR[[ +/- ]] **MORPHOLOGICAL_ANALYZER/GENERATOR[[ +/- ]] **PARSER/GENERATOR[[ +/- ]] **SEMANTIC_INTERPRETER[[ +/- ]] **KNOWLEDGE_REPRESENTATION[[ +/- ]] **DISCOURSE_STRUCTURE[[ +/- ]] **PRAGMATIC_FEATURES[[ +/- ]] **OTHER_COMPONENTS[[ ... ]] 3.2.6 program extensible by {augmentation of algorithms} **THE_DEVELOPER[[ +/- ]] **THE_COMPUTATIONAL_LINGUIST[[ +/- ]] **THE_LINGUIST[[ +/- ]] **THE_PROGRAMMER[[ +/- ]] **THE_EXPERIENCED_USER[[ +/- ]] **THE_NEW_USER[[ +/- ]] 3.2.7 Which components are available independently ? **MODULARITY[[ ... ]] 3.3 data 3.3.1 data components {give count, language and features, like: 120,000 entry wordlist for French 5,000 word LFG lexicon Swahili w/ affixes, English gloss 15 rule TG for Dutch cross-serial dependencies 200 node knowledge base for AIDS case histories w/ 10 30-node cases.} **DATA_COMPONENTS[[ ... ]] 3.3.2 data components extensible by {augmentation of data} **D_THE_DEVELOPER[[ +/- ]] **D_THE_COMPUTATIONAL_LINGUIST[[ +/- ]] **D_THE_LINGUIST[[ +/- ]] **D_THE_PROGRAMMER[[ +/- ]] **D_THE_EXPERIENCED_USER[[ +/- ]] **D_THE_NEW_USER[[ +/- ]] 3.3.3 Which data components are independent of program ? **D_MODULARITY[[ ... ]] 3.3.4 character set used for language data **PROGRAMMABLE[[ +/- ]] **WHICH_PROGRAMMABLE[[ ... ]] **FIXED_16BIT_UNICODE[[ +/- ]] **FIXED_8BIT_ISO[[ +/- ]] **WHICH_FIXED_8BIT_ISO[[ ... ]] **PROPRIETARY_ASCII[[ +/- ]] **FIXED_7BIT_ISO[[ +/- ]] **WHICH_FIXED_7BIT_ISO[[ ... ]] **EXTENDED_ASCII[[ +/- ]] **WHICH_EXTENDED_ASCII[[ ... ]] **OTHER_CHARACTER_SETS[[ ... ]] 3.4 languages 3.4.1 current languages {languages for which lexicons or grammars are supplied, or to which the algorithms are applicable} **CURRENT_LANGUAGES[[ ... ]] 3.4.2 range of applicable natural languages {which other natural languages can be substituted} **RANGE_OF_NATURAL_LANGUAGES[[ ... ]] { give theoretical or technical limits } 3.6 man years of work **MAN_YEARS_OF_WORK[[ ... ]] 3.7 documentation {manuals and research reports} 3.7.1 list documents in which the software is described **DOCUMENTATION[[ ... ]] 3.7.2 user documentation **USER_DOCUMENTATION[[ ... ]] 3.7.3 system documentation **SYSTEM_DOCUMENTATION[[ ... ]] -------------------------------------- 5. Note -------------------------------------- If you are willing to have the software reviewed, please send us a version along with this information. We are also interested in reports and documentation, even for software not reviewed.