Professional Documents
Culture Documents
Agenda
Introduction
Description of problem Why is language
so important?
Dealing with Natural Language Text
Application Examples
Machine Translation
Future Directions
Conclusions
2
Internet
Web sites
Blogs
Customer Feedback
Dealer Feedback
Lessons Learned
Corporate Knowledge
Warranty Claims
Internal documentation
Spoken Dialog systems
3
Present Status
Mostly key-word based
Very little intelligence, no background knowledge
or context
Limited natural language dialog interpretation
Most of the processing is left to the human user
Difficult to build computer systems that can
retrieve information in an intelligent manner
Future State
Semantic Web information on the web is
organized using structured tagging based on
XML, RDF, OWL, SWRL
machine-processable data on the web
standard interface to data
rich knowledge representations through
ontologies
Allows for the development of systems that cab
retrieve information in an intelligent manner
6
Knowledge Sources in
Manufacturing
13
Standard Language
Controlled language where the grammar and
syntax is restricted.
Developed at Ford Body & Assembly to describe
the vehicle assembly process.
Contains information about tools, parts and work
required to build a vehicle.
Contains over 5000 words, 1000 abbreviations
that can be used by the process engineers.
Standard Language is checked by Artificial
Intelligence (AI) system.
14
Examples of Standard
Language
1. ALIGN-AND-SEAT DOOR TRIM PROTECTOR
2. FIRMLY PRESS SEALER INTO JOINT TO
AFFECT A POSITIVE SEAL
3. APPLY DAUB OF SEALER TO THE JOINT OF
THE CENTER FLOOR PAN AND FRONT
FLOOR PAN AT ROCKER PANEL
4. PUSH SEAT REARWARD TO EXPOSE
FRONT ATTACHMENTS
15
17
Verb Phrase
Noun Phrase
Verb
Noun
Preposition
Secure
Bracket
Using
Noun Phrase
Noun
18
20
Individuals
Classes (with hierarchy); think sets
Properties (w/ hierarchy); not part of class
Equivalence
Property characteristics/restrictions
Complex classes
21
GSPAS Ontology
Thing
Tools
Parts
Lexical Nodes
Operations
HAMMER
Attributes: Size,
Part of Speech,
Subsystem-id, etc.
22
23
Ergonomics Analysis
Check the assembly work instructions to
determine what type of physical action is being
described
Check the assembly work instruction to
determine what object is manipulated
Check the associated parts and tools for part
weight and tool properties
Flag potential ergonomics concerns at the
process level and at the work allocation level
Knowledge can be represented as a business
rule
24
Machine Translation
The Spirit is willing but the flesh is weak
"The vodka is tempting, but the meat's a bit
suspect".
The alcohol is arranged, but the meat is weak.
This kind of spirit is wants, but the flesh and
blood is weak.
The spirit is willing, but the flesh is impossible
The spirit puts out the flag and does, the flesh
omits but.
25
Machine Translation
Use of computers to translate from one
language to another
Examples: Babelfish
Translation accuracy is highly dependant on the
quality of the source text
Use proper grammar, punctuation, shorter
sentences, active voice to improve quality
Customize translation systems for each
application domain
26
Problem Description
Need to translate assembly build instructions from
English to the language used at the assembly plants
A single vehicle may require several thousand process
sheets to describe the assembly process
Large amount of assembly instructions are frequently
modified
Large volume of translations precludes the use of human
translators
Specialized terminology requires technical glossaries
MT performance can be improved greatly by improving
the source text
27
Application Description
Machine Translation is integrated into the process
planning for manufacturing system known as GSPAS
(Global Study Process Allocation System)
The translation process is fully automated and does not
require human intervention
Translation occurs automatically after a process sheet is
validated by the AI system and before it is released to the
assembly plants.
We currently translate build instructions for 26 different
vehicle lines in 5 languages (we also have a separate
glossary for Mexican Spanish)
Data is read in from an Oracle database, processed
through the translation system and the output is then
written out to the Oracle database
28
Machine Translation
Source: Process build instructions in English
Target: Process build instructions in Spanish, German,
Portuguese, Dutch & Turkish
Translate both controlled language and embedded freeform text
Example: SECURE BUMPER BRACKET {FOR LHS
ONLY} TO VEHICLE BODY USING POWER TOOL
Utilize customized SYSTRAN translation engine,
automotive and Ford-specific terminology glossaries and
embedded tagging
Future plans include additional parsing and tagging
information to improve translation accuracy
29
Machine Translation
Implementation in GSPAS
Worked with Systran & Apptek to customize their
translation software for our requirements.
Develop technical dictionaries that contain Ford
terminology with correct translation for each
language pair.
Develop and integrate the translation process
into GSPAS.
Developed a system to check and improve the
source text prior to translation
30
Translation Statistics
Language pairs being translated:
English/German, English/Spanish,
English/Dutch, English/Portuguese, EnglishSpanish (Mexican), English-Turkish
Ford specific terminology in Standard Language:
over 5000 words, 13,000 noun phrases, over
1000 abbreviations and acronyms .
Typically translate over 200,000 records each
month
Over 10,000,000 records already translated.
31
32
Uses of AI Technology
Apply natural language processing (NLP) along
with knowledge representation and reasoning to
improve the source text
Analyze the source text; utilize the ontology to
identify terminology
Convert the source text to a more translatable
form by adding articles, replacing abbreviations,
improving grammar and punctuation
Utilize XML tagging and ontology lookup to
improve the structure of free-form source text
35
Conclusions
Machine Translation is a cost-effective way to
translate information with high quality if you are
willing to customize the application to your
requirements
Machine Translation is not an out of the box
solution
Machine Translation accuracy can be greatly
improved by controlling and improving the
quality of the source text
38
39
Questions
?????
40