The expanded system of the coordinated dictionaries and its using for universal semantic coding and translation of the polysemous source text
Annotation. Semantic coding is based on supplementing the polysemantic words and word-combinations by components of dictionary entries that the author chooses in the basic explanatory dictionary of the source language (the native language for the author). The universality consists in subsequent machine translation into other languages using bilingual dictionaries coordinated with a basic explanatory dictionary. The process of coding takes into account not only the wide range of meanings of polysemantic words, but also the ambiguity or incompleteness in the expression of actions and states, where in one language such features as gender and number, character of action and state, etc., are determined only by the context, while in another language they may be reflected in concrete grammatical forms.
Essence of the proposed method. A preliminary identification of the source text.
Characteristics of the expanded system of the coordinated dictionaries.
The special sections of differentiated (morphological and syntactic) meanings in the coordinated dictionary entries.
The semantic coding of the source text.
The translation of the encoded source text into target languages.
Some characteristic examples explaining operations of semantic coding and translating
Some considerations of general character.
(examples of coding and translation, fragments of coordinated dictionary entries).
App. 1. The phrases containing polysemantic words.
App. 2. The phrases containing polysemantic set expressionsand set expressions characterized by the structural formula with variable lexical components.
App. 3. The coding and translation with using the differentiated (morphological) meanings of the verbal forms.
App. 4. The coding and translation with using the differentiated (morphological) meanings of the pronouns.
The presuppositions.As a rule, polysemantic words and expressions inevitably exist in source texts. Translation of these words and expressions depends on the context. Analysis of the sense and form of a wide context is impractical with traditional machine translation programs.
Many source texts published in the world require translation into other languages. Among these are articles in national scientific magazines, descriptions of inventions published in the regional funds of patents etc. All are of interest for users speaking other languages.
Therefore, the source text which can be potentially translated into many target languages should contain certain
additions which, in a universal form, would indicate polysemantic words’ and expressions’ actual semantic meanings.
Such additions should also reflect the grammatical (morphological and syntactic) features of the wider context. During translating into target languages these additions allow to choose the corresponding grammatical forms of words or words' groups and also to choose syntactic links between words.
Such semantic additions eliminate both the polysemy of the source language, and the mutual ambiguity arising during translation; it will allow to receive correct over the sense and over the form the machine translation into any target language.
Some rather abstract methods of semantic coding of source text are known [1-3, 3*].
One of the early methods, based on semantic analysis of polysemantic words and their semantic coding, is described in US Patent 5,285,386 “Machine translation apparatus having means for translating polysemous words using dominated codes”.
This method of semantic analysis is universal in that it does not depend on the target language of the translation.
Some codes of "philosophical" nature are offered. This method uses four classification categories to present all information on a word: the great category (it is designated by the first numeral), middle, small and thin categories of classifications.
All words in a thesaurus are classified in ten great classifications, including "character / nature", "property", "change", "action", "feeling", "person", "propensity", "society", "culture", "subject".
Each great classification is divided into ten middle classifications. In this embodiment a code with a symbol S is given by the following form:
S0 belongs to "character/nature"
S02 belongs to "weather" in "nature"
S028 belongs to "wind" in "weather"
S028a belongs to "pressure" in "wind" ….
The given codes of classification have hierarchical character: the range of meanings for the upper semantic code is wider than for the lower semantic code.
As has been noted, the method described in this patent can have a universal character; however the offered hierarchical system of the codes describing semantic meanings of words and dependences between words has too abstract character, i.e. it is far from the real problems of translation of expressions and word-combinations with their various usage and complex syntactic structures, etc.
Elements of semantic coding of similar abstract character are offered in US Patent 5,523,946 “Compact encoding of multi-lingual translation dictionaries”.
Each word is a file containing information codes pertinent to some base abstract language; this information is based on the concepts of type - part of speech, gender, number, punctuation mark, area of meaning of a given word (for example, a field of knowledge, actions, …).
For example, the code for the verb “to know” may include the attribute (sth) which limits the field of knowledge.
The coding is done by an editor of the source text.
The following source phrase is given as an example: "to have down pat, be an expert"; this phrase is divided into the following symbols: "to" "have" "down" "pat" "," "be" "an" "expert".
The codes for each symbol and its concept have the following form:
1991 "to" 872 "have" 501 "down" 18005 "pat" 17 ","
427 "be" 95 "an" 2407 "expert".
The string, representing this phrase, contains the following numeric sequence: 1991 872 501 18005 17 427 95 2407.
As stated in the description, the given translation file consists of about 6700 groups of concepts.
In sum, this method contains elements of semantic coding in the form of concepts of an abstract basic language.
This coding, however, is also very superficial inasmuch as it is not concerned the concrete semantic meanings of words, but is based on general attributes (part of speech, gender, field of knowledge, etc.).
Finally, there is the Universal Networking Language project (UNL) [3,3*], which has being in development for a long time. This project aims to create a universal artificial language by developing codes, which, in some abstract form, would reflect the set of semantic and grammatical features of a source language, and, at the same time, reflect translational specifics of a set of target languages.
In other words, these artificial semantic codes would have to be refined and coordinated as much as possible, i.e. they would need to simultaneously reflect the intra-language polysemy and the mutual ambiguity, arising in translation[16, 16 *].
A special apparatus of coding of semantic meanings is elaborated for this purpose. The UNL has its own dictionary UW (universal word) made up on the basis of English, which is an input. Specifically, it is English words that are used to determine semantic and syntactic constraints or restrictions.
The text on UNL is universal words with additional symbols of type:
aoj a thing which is in a state or has an attribute: aoj(red (aoj >thing), ball(icl > thing))… “ball is red”,
Each meaning can have also a set of particular meanings, for example:
seq (occur, occur) / seq (occur, do) / seq (do, occur) / seq (do, do) / seq (occur, (aoj>thing))...
Example of designation:
“because Mary arrived, John is happy” … agt: 01 (arrive (icl > occur), Mary (icl > person) agt: 02 (happy (icl > do), John (icl > person) rsn (:02, :01);
“horse”… horse (icl > animal) horse (icl >male) horse (icl > apparatus).
Codes of attributes are added to a universal word and they specify variants of use:
category of tense @future,… / modality @obligation, … / character of expression @emphasis, @focus, …, @begin-soon, @begin-just… @progress… @complete, @state, @repeat … @generic, @indef … @affirmative, @confirmation, @exclamation, … etc.
The sentence “The dog caught a wild cat” will have following UNL-representation:
These examples indicate that artificial language has the special dictionary structure, special forms of representation of semantic meanings and also has own morphology and syntax; all these features are expressed as complex chains of restrictions and attributes.
The UNL is based on double machine translation (see the scheme): (I) from the source language into an artificial language (Enconventer) and (II) from an artificial language into target language (Deconventer). On the first stage the author can make the interactive changes directly in the text of an artificial language by means of operations of the direct translations and the reversed translations (Author / Editor).
It is possible to point out the following properties of system UNL:
a) as already noted above, the analysis of sense and the form of a wide context is practically impracticable by means of programs of machine translation.
The double machine translation stipulated in system UNL will lead to inevitable summation of mistakes in transfer of sense of the text and to mutual influence of ambiguities, arising on each of stages; in other words, the mistakes arising at the first stage, can generate a circuit of mistakes at the second stage;
b) the author has an possibility to make some changes in the text in an artificial language as it is stipulated in system UNL; but for this purpose the author should know professionally this language, i.e. should be able to indicate the real deviations which have arisen between the source text and its reversed translation by means of combinations of symbols;
с) it is known, the operations of direct and reversed translations are carried out a phrase by a phrase and they demand a multiple recurrence; these operations can be applied only at translations of short texts and may have extremely unstable character (similarly to a control system containing elements with unpredictable reactions).In others words, if the basic programs of the machine translation (Enconventer, Deconventer) make the distortions of sense of the text, then the essential increase of a level of accuracy due to the interactive sanction (Author / Editor) will not occur.
Probably, the universal language UNL can be used for transfer of the sufficiently short and "monosemantic, unequivocal" texts.
To sum up: one can assume that the abstract methods based on artificial codes allow to solve a problem of universality, but they do not contain the elements allowing to raise accuracy of machine translation.
Essence of the proposed method. A preliminary identification of the source text.
Proceeding from the general reasons, a source polysemy (or a source ambiguity) leading to wrong interpretations of words and word-combinations should be eliminated prior to the beginning of machine translation.
The information necessary for a very precise semantic coding of the source texts is already present in the existing explanatory dictionaries and bilingualdictionaries. The detailed presentation of semantic meanings in an explanatory dictionary characterizes the internal polysemy of the source language. The illustrative examples... below we use ill.e.(s), included in the dictionaries, are important in that they reflect the various forms of usage of word and word-combinations; the translations of the specified examples in the bilingual dictionaries reflect the mutual ambiguities of the source and target languages.
The basic idea is to use the elements of the explanatory dictionary of the source language instead of some artificial codes; these elements can naturally reflect the sense of a wide context.
An author would need to carefully examine the meaning of a given word in its context, compare that to the available dictionary entries, and, then, key in the actual meaning of the word, or a corresponding illustrative example, as a supplement to the word, thus insuring a proper translation into target languages.
In other words, this is a preliminary identification of semantic meanings, forms of use, etc., by means of an explanatory dictionary of the source language.
The question is how to transform the various elements of dictionary entries, including the aforementioned ill.e.(s), into universal semantic codes. In other words, the problem is how to coordinate these elements with the translations contained in the bilingual dictionaries. To resolve this problem we propose a system consisting of a basic explanatory dictionary of the source, native language, which we shall call further a Coordinated Explanatory Dictionary (c.e.d.), and a set of special dictionaries of source language - target languages, which we shall call further the Coordinated Bilingual Dictionaries (c.b.d.) [19, 19*].
Characteristics of the expanded system of the coordinated dictionaries.
The following elements of each c.b.d. shall exactly match the c.e.d.:
- the vocabulary of source language, a sequence of the vocabulary entries,
- the sequences of semantic
meanings, each identified by the same digital, alphabetic
and symbolic designations; this also concerns the particular
and closely related sub-meanings of each semantic meaning,
- all expressions, their semantic meanings and their own illustrative examples,
- all grammar/usage commentaries on the headword, derivative words, compound words, etc.
The specialized and thematic sections of the dictionaries shall similarly match.
For illustration let us present some fragments of the coordinated dictionary entries.
Thec.e.d. thuscombinesthefunctionofanexplanatorydictionarywiththoseofatranslationdictionary. Thatis, thec.e.d. must reflect those elements of the source language, which have special meaning at least in one of the target languages belonging the system of coordinated dictionaries of this source language. Really, if ambiguities of one type do not depend on target language, then the ambiguities of other type will arise only in translating into the certain language.
Thus, together with the multitude of semantic and grammatical features of the source language, the c.e.d. should reflect the translation - specific features of all target languages.
Therefore, the entries should be “maximally differentiated”, i.e. they should simultaneously reflect both the intra-language polysemy and the mutual ambiguity,namely:
-the c.e.d. entries should contain the non-phraseological expressions having the especial translation (the non-word-for-word translation) at least into one of the target languages out;
- the c.e.d. entries should reflect the meanings of morphological nature, due to the presence in any of the target languages of separate grammatical forms, which are absent in the source language;
- the c.e.d. entries should also reflect the meanings of syntactic nature, for example those indicating possible syntactical links, clarifying syntactical ambiguities in the source language, etc.With this purpose, the differentiated (morphological, syntactic) meanings are entered into dictionary entries in the form of the special (additional) section that follows those semantic word meanings to which this section is general. These sections have a symbolic designation (morphm) (an abbreviation of morphological meanings) and (syntaxm) (from words syntax, meanings).
The following example reflects a polysemy of pronoun you and also reflects the coordination of entries when in one of the languages semantic word meanings are defined by a context, however in other languages these meanings have the separate grammatical forms:
In the given example the pronoun you replaces the person or group of persons of masculine gender or feminine gender (what depends on sense of the text). In translation into Russian a pronoun you correspond two forms - ты, вы. At last, in translation into a Hebrew a pronoun you corresponds to four forms expressing a masculine gender and a feminine gender both in a singular and in plural:אַתֶן / אַתֶם/ אַת/ אַתָה.
How we see, the specified features of target languages are reflected in English c.e.d. where the meaning of a pronoun you is presented in the form of four private meanings (a)…(d) - it takes into account the specified features of two target languages – Russian and Hebrew.
An expanded fragment of coordinated entries showing the differentiated nature of dictionary entries is presented below:
As follows from the table, the sections of illustrative examples and of the non-phraseological word-combinations are coordinated and as much as possible expanded; the use of elements of these sections at semantic coding allows to take into account the features of the use of words.
Some explanations, concerning the section of differentiated (morphological, syntactic) meanings, will be presented below.
The structure of systems of coordinated dictionaries is presented on Fig.1.
As it has noted been above, the entries should be “maximally differentiated”, i.e. they should simultaneously reflect both the intra-language polysemy and the mutual ambiguity.
The account of the all types of the ambiguities somewhat expands volume of dictionary entries. In this connection a system of coordinated dictionaries of given source language may consist of several subsystems of the coordinated dictionaries - for related groups of target languages (Fig. 2).
In the some cases, the subsystem can include only two languages - source and target; it means, that the bilingual dictionary, only one in this subsystem, should be coordinated with an explanatory dictionary.
The coordinated dictionary entries are in details described below.
Let's explain in more detail the sense of mentioned special (additional) sectionсs.
1. The differentiated (morphological) meanings.
We are concerned here with words, which can express, depending on context, different grammatical attributes (i.e. the meanings having grammatical nature), but they have no corresponding grammatical forms in the source language. However in the some target languages there are the separate grammatical forms of a word and / or connected words for the expressing these meanings.
These groups of words may include the following:
1) personal and reflexive pronouns, which, in a given source language, have no separate forms to denote the person of masculine or feminine gender / the person or a group of persons / a group of persons of masculine or feminine gender / the person or an inanimate object, and also possessive pronouns, which, in a given source language, have no separate forms to indicate the gender and / or grammatical number of the belonging object, or the separate forms specifying a gender of the person to which it belongs (i.e. when the meaning depends on who or what is being described in context); at the same time those or other target languages can have the pronouns (and / or words, contextually associated with them) which have the concrete grammatical forms to express above mentioned attributes.
Let’s illustrate this with other characteristic example of the possessive pronoun-noun yours. This pronoun does not have any other grammatical forms, but in a combination with a context may express:
- masculine / feminine / neuter gender of belonging object or plural of this object; this must be taken into account when translating into Russian, where one of next pronouns may correspond, depending on a context, to the pronoun yours: тво|й, -я, -ё, -и …;
-a person or a group of persons to whom the belonging is expressed; this, too, must be taken into account when translating into Russian, where one of next pronouns may correspond, depending on a context, to the pronoun yours: тво|й, -я, -ё, -и; ваш, -а, -е, -и;
- masculine / feminine gender of person or persons to whom the belonging is expressed; this must be taken into account when translating into Hebrew, where one of next pronouns may correspond, depending on a context, to the pronoun yours: שֶלךָ -belongs to a male, שֶלָך - belongs to a female, שֶלָכֶם - belongs to a males group, שֶלָכֶן -belongs to a females group.
The described polysemy of the pronoun yours, having a grammatical nature, is reflected in the following dictionary entry:
As we see, in Hebrew the dependence of personal pronouns on the gender and number of the subject is even greater than in Russian or English.
2) verbs, participles, verbal adverbs, which in the source language have no separate forms unambiguously expressing the character of an action and /or a state, reached as a result of the action (i.e. it is possible to judge the character of the action and / or the state only from context);
at the same time, the specific grammatical forms of some verbs, participles and verbal adverbs are used in some target languages to express the shades of the meanings of the actions and / or the states required by the source text;
below are some characteristic examples of grammatical meanings in English and Russian:
a) in Russian a verb, in context, may express one of the following meanings, which correspond to a specific grammatical form in English (, p. 113-162):
- (неопр.)… (the separate group of the Indefinite Tenses is used for this purpose in English);
- (длит.)… (in English this is expressed by the separate group of Continuous Tenses);
- (к опред. м-ту) … (in English this is expressed by the separate group of Perfect Tenses);
- (изв. дл-ти)… (in English this is expressed by the separate group of Perfect Continuous Tenses);
- (состояние),(действие, неопред.),(действие, длит.), (действие, к опред. м-ту)… a Perfective verb in the verbal form of the Passive Voice may express an action or a state, achieved as a result of the action (in English this is expressed by separate group of Passive VoiceTenses, , p.166-184);
- (в наст.-будущ.), (в прошл.)… a verb expresses an unlikely or improbable action in a conditional sentence in a Subjunctive Mood with the particle “бы”; the time of fulfillment of the condition depends on the context (and possibly the time is pointed in a sentence previous to or subsequent to the present one).
Also in Russian, a participle (in context) may express one of following meanings to which the separate grammatical forms conform in English (, p.113-162):
- (одновр.), (ранее) … a participial form in the Active Voice expresses simultaneous actions or actions preceding the action expressed by the predicate;
- (реальн.)… a participial form in the Passive Voice of the Imperfective indicates the real action expressed by the participle;
- (вообще) … the same as above, but the action happening irrespective of time.
The following table shows differentiated (morphological) meanings in a coordinated dictionary entry of the verb заканчивать:
b) in English a verb, in context, may express one of the following meanings to which at Russian the separate grammatical forms conform ():
- (imperf-ve), (perf-ve) … a verb in the Infinitive or Indefinite Tense form, not having special grammatical forms of the imperfective or the perfective, expresses, in context, an action of imperfective or of perfective; this should be considered during translation, for example, into Russian, in which verbs have separate imperfective and perfective forms;
- (action) the fact or process of doing, (state) the existing condition in which an object is …a phrase in the Passive Voice expresses an action performed on an object or a state of an object, except when some other meaning characterizing an action is not indicated, for example, (imperf-ve) or (perf-ve);
- (indicative), (subjunctive) … the verb in the form of Indefinite Infinitive or Perfect Infinitive in combination with the verb "should" expresses an action in the form of Indicative or Subjunctive Mood.
Also in English the participle (in a combination with a context) may express one of the following meanings to which in Russian the separate grammatical forms conform (.p. 279-287):
- (simult.) … the participial form expresses an action, which is simultaneous with the action of a predicate;
- (preced.) … the participial form expresses an action, which precedes the action of a predicate;
- (imperf-ve), (perf-ve) … the participial form expresses an action of Imperfective or of Perfective.
Examples of differentiated (morphological) meanings are presented below in a coordinated dictionary entry of the verb to ask:
3) the some nouns which in the source language have no forms of a singular and a plural, while in translation to those or other target languages they acquire, depending on a context, separate forms of a singular and a plural.
In the all cases, mentioned above, the author should carefully examine the context so as to add to the word the actual differentiated (morphological) meaning - it will provide a true translation into target languages.
2. The differentiated (syntactic) meanings.
The meanings which express possible syntactic links of a given word with other words in a sentence are also presented in the dictionary entries.
The specified syntactic links characterize some of the predicative relations (regarding a predicate) and non-predicative syntactic relations (attributive, objective, adverbial).
Use of the differentiated (syntactic) meanings in semantic coding allows to exclude syntactic errors in machine translation of such source sentences that have syntactic ambiguity or syntactical incompleteness; in particular, such source sentences that contain remote or isolated explanatory appendices (attributive, adverbial), which, in turn, contain their own additions.
Examples of such appendices may be: isolated attributes removed from an object (a noun, a pronoun), isolated adverbial modifiers, removed from a predicate (as opposed to uniform members of a sentence which are in identical syntactic attitudes with one member of sentence, they act as one syntactic function and are united with each other by enumeration or by means of coordinative conjunctions [12*]).
For example,in Russian the syntactic relations are often based on the fact that words – an removed, isolated attributes (adjectives, participles) of some contextual word - have grammatical characteristics reflecting the gender and / or grammatical number of the word being defined (it concerns also the additional attributive sentences and nominal parts of compound predicates etc.); meanwhile, the translations of these words into target languages may not have the specified attributes , i.e. the syntactical relations entirely depend on word order.
Such differences in languages may lead to syntactical ambiguity or incompleteness of the translated phrases.
Known programs of machine translation in many cases distort the meanings in translating phrases containing syntactically complex forms.
The differentiated (syntactic) meanings are presented in the dictionary entries in the form of separate sections (syntaxm) that follow those semantic word meanings to which they are general. These sections may deal with syntactic links of the following types of words:
- an adjective which may be a removed, isolated attribute/ apposition (its part) to some contextual word (c.w.);
- an adjectival participle, which may be part of a participial phrase, acting the part of the removed, isolated attribute/ apposition to some c.w.;
- a verbal adverb which may be part of a dangling participle acting the part of the removed, isolated adverbial modifier to the c.w. expressing the action.
The following designations are hereinafter accepted in the sections of differentiated (syntactic) meanings: c.w.* / к.с.* - the concrete contextual word which the author should mark in the source text (in all cases, concerning the identification of syntactic links of words).
Some characteristic examples of dictionary entries are presented below:
All that’s required is for the author to carefully examine the context and to include, in addition to the given word, the actual differentiated (syntactic) meaning, and to mark the proper contextual word / контекстное слово (c.w.*/ к.с.*). During translation this will guarantee true syntactic link between words.
An expanded example of coordinated dictionary entries of the verb to take was presented above.
Examples of the various fragments of the dictionary entries are presented in the second part of given article.
The following describes the sequences of operations of semantic coding and translation.
The semantic coding of the source text.
This process is performed on the computer of the author of the source text, using a service program containing the coordinated explanatory dictionary of the source language.
The author analyzes the source text and marks the next word, using a particular font in those cases where, in the opinion of the author, (or, sometimes, of the service program), a given word possesses at least one of following characteristics:
a) the given word is polysemous word; its combination with the neighbouring words does not provide enough information for selecting the semantic meaning closest to the source text;
b) its grammatical form, together with connected words, does not reflect a particular nuance/shade of actual sense of the source text, though in translation into those or other target languages the given word and /or the connected words may have specific grammatical forms; their choice strictly depends on the context (as seen in the above examples of the differentiated (morphological) meanings);
c) it may, the given word, together with its some neighbouring words, represents a word-combination. In order to translate such word-combination it is necessary to search the known word-combinations related to the given word. It is also possible that there are lexical or structural differences, which, although do not influence the allegorical meaning of a word-combination, but complicate or do impossible a machine search. For example:
- some attributes or adverbial modifiers may have been wedged into the word-combination, giving it a more specific meaning as a whole,
- or parenthetical words may have been added,
- variable components may have been added at the beginning or at the end of the word-combination,
- those either other own words or their sequence have been changed, etc. ;
in this connection, it is necessary to define which words belong to the word-combination, to find the borders inside the phrase, to define the key word and, at last, to choose the meaning corresponding to the context;
d) it may, the given word, together with some neighbouring words, represents a polysemous word-combination;
e) it may, the given word has several closely-related semantic meanings which may have the inadequate translations or translations with inadequate forms of the use (with respect to the general orientation of the source text or the character of actions and circumstances).
In many cases the correct translation of a part of a phrase, or the whole phrase, may only be achievable by selecting the most suitable ill.e. Use of this ill.e. may, in the subsequent translation of the source phrase into the target languages, allow to select the correct variant of the translation of the given word, as well as properly reflect the specifics of the use of the word, and also to reflect features of use of a word, for example, features of the syntactic structure specific to phrases of similar type in this target language, or feature of verbal forms, etc.
f) it may, the given word, together with some neighbouring words, may contain the removed / separated apposition (attributive, adverbial), which may inside have their own additions.
Further the service program finds in c.e.d. the dictionary entry corresponding to the word which was marked by author; then author fixes meaning of this word, comparing the source text with those or other elements of dictionary entry; for this purpose he carries out following operations:
a) in case the given word together with the neighboring words coincides with one of word-combinations of phraseological character (set expression, idiom, in English language it can be as well a phrasal verb) or corresponds to the structural formula characterizing lexical variability, the author marks one of the next elements of dictionary entry :
- the concrete word-combination,
- or the structural formula,
- or the concrete semantic meaning, closest to the source text,
- or the ill.e. in which the using of the word-combination coincides with the source text, corresponds to character of the described actions and the adverbial modifiers.
The author also marks the specified elements when there are a lexical or structural distinctions between a word-combination in the source text and in the dictionary entry.
Then author by means of the service program places as the addition to the given word: the headword, the word-combination or its structural formula, the its semantic meaning or the ill.e. at this semantic meaning.
Author also marks in the source text by special (bold) font the given word and other words which are a part of the word-combination (except for the wedged words), and, besides, by means of special boundary symbols designates the left and right bounds of the word-combination in the given sentence (for example, by angular brackets);
b) but in case the given word does not belong to any of a phraseological word-combinations (set expressions, idioms), author marks the concrete semantic meaning of the word, closest to the source text, on condition that the grammatical features characterizing changeable forms and forms of the government in other words, do not contradict the source text (it is a question of the following grammatical features: form of verb's using - in Imperfective or in Perfective; form of verb's government, including the prepositions, declensional endings; form of noun's using- attributive, predicative, etc.);
c) if the given word together with the neighboring words coincides with one of non-phraseological word-combinations at this meaning of word, author marks this word-combination; then author marks these words in the source text by a special font and boundary symbols, as it is described above;
d) if the given word not figures in one of these word-combinations, but in the group of the ill.e.(s) at this meaning of word it is possible to find the concrete ill.e. with the analogous using of the word and with the analogous character of the described actions and adverbial modifiers, author marks this ill.e.;
e) if in the dictionary entry there is a section of differentiated (morphological) meanings (morphm), author may mark the meaning corresponding to a gender and/or a number, a person or an inanimate object, a character of action or a state, etc., etc.; in case it is necessary to identify the concrete pronoun, author also marks c.w. which replace this pronoun;
f)at last, if in the dictionary entry there is a section of the differentiated (syntactic) meanings (syntaxm), author may mark the meaningspecifying that this word is a part of attribute or adverbial modifier isolated / removed from c.w. with which it is connected syntactically, and marks this c.w.
In some cases author marks the concrete ill.e. in the dictionary entry of one of the neighboring words connected with the given word, provided that the specified ill.e. corresponds to the semantic meaning and all grammatical features mentioned above.
Then the service program transfers into the source text the headword and those elements which are marked in the dictionary entry, placing all in the form of the semantic additions to the given word.
In case the c.w. was marked, the service program places this c.w. in the semantic addition (or the service program counts up an ordinal number of this word in the given sentence and places this number in the semantic addition).
If some group of the contextual words was marked, the service program places this group of words in the addition.
Then author marks in the source text the next word possessing one of described above features, etc. – up to the end of the text.
In the end, the received coded text is ready to translation into other languages.
The scheme reflecting the process of universal semantic coding is presented on Fig.3.
further see: continuation
1. US Patent 5,285,386 Machine translation apparatus having means
for translating polysemous words using
The others sources:
А. В. (1967), Англо-русский фразеологический словарь, Советская энциклопедия, М., 1233
Research Institute of Automation, Leningrad, head of laboratory
– senior researcher.
Fields of interest:
New Fields of interest:
He can be reached at:
See also: "Experience of preliminary identification and translation of the text's fragments with use of the system of coordinated dictionaries" by Panich Iuli
Please see some ads as well as other content from TranslationDirectory.com: