Working from audio recordings
By
José Henrique Lamensdorf,
An
English <-> Portuguese translator
São Paulo, Brazil.
His translation career began in 1973,
as an intern while studying Mechanical Engineering
johel@mandic.com.br
http://jh.lamensdorf.com.br/local/
Get the List of 4,500+ Translation Agencies Now! No Recurring Membership Fees!
Sometimes
it happens; not too often, but it does. A translator
is asked to work from an audio recording. The client
assumes that a translator frequently does this; so
doing it in any of the translator's working languages
should be a piece of cake.
In
view of the concerns expressed by first-timers about
such an endeavor in translator newsgroups on the web,
I take this chance to voice out a few ideas.
1.
The job
The
request might be for transcribing, translating, or
both, from an audio recording.
It
can never be overemphasized that one of the translator's
duties is to educate their clients. Some clients assume
that translators who do dubbing/subtitling/captioning
work from printed text. Though frequent flaws seen
on TV might support such reasoning, it is definitely
wrong.
So,
if the client asks for both, hold your horses!
Ask what they need the transcript for. If it's just
to have the translation, sell them out of the transcript!
It is not necessary. You can translate directly from
the audio.
A
transcript is useful for a written record of what
was said in the recording, for developing printed
material from its contents, or for rebuilding a script
for an eventual re-enactment, but definitely not
for dubbing/subtitling.
Therefore,
it is important to know what the client intends to
do with the transcript or translation. If it's video
for dubbing or subtitling, this kind of work requires
special skills; make sure you have them before accepting.
2. The media
A
recording might be on various kinds of media, and
some of them present specific difficulties.
If
it is on waxed cylinders, long-play records, 16 mm
(or any other size) film, Betacam (or any other strictly
professional) video tape, you don't have the least
obligation to have the equipment to play it. Ask the
client to have it transferred to suitable media. Open-reel
quarter-inch audio tapes may belong to this class
or not, as once in the past home equipment for them
was pretty common.
Let's
analyze the kind of media you might get to transcribe
or translate from.
-
Standard audio cassette tapes
This
is a quite likely format, as anyone can inconspicuously
have such a hand-held or desktop recording apparatus.
Likewise, a bulkier unit might have been plugged into
the local audio system during the recording, or used
for transfer from other media. The problem is in playing
it for your work. As we'll see later, these jobs requires
a lot of play, stop, and rewind a bit; or at least
a lot of play-and-pause. Standard cassette players
usually have a small delay in executing some, most,
or all of these operations, so you might miss a couple
of words every time you use them on a recording that
calls for a lot of short rewinds, which can render
the job extremely tedious and tiresome. Likewise,
it's too easy to rewind more than necessary, which
makes the process more time-consuming than it has
to be.
The
alternative is to use a transcriber, which
is a cassette player (and often a recorder, too)
controlled by foot switches, featuring immediate
action on play, stop, rewind, etc. so that nothing
is missed. However such machines are expensive,
I saw some for US$ 300 in the USA, and one should
consider whether they would be used frequently enough
to justify the investment.
- Micro-cassette
audio tapes
These
offer exactly the same advantages, disadvantages,
problems, and solutions (including transcribers)
as the standard cassettes above. The only likely
additional disadvantage is that the units are
usually palm-sized, and their control keys therefore
are so small that their frequent operation is
even more difficult.
-
VHS
video cassette tapes
These
are intended for both audio and video. The acronym
VHS stands for Home Video System. Of course
there are sophisticated units in this format,
but these are quite expensive, and not usually
found in every household. Unless you happen to
have a VHS unit suitable for accurate video editing,
every pause or stop will cause you to miss a few
seconds; this will require a short rewind, however
longer than the audio cassettes described before.
My advice in this case is to transfer the recording
to a more suitable format.
-
Audio
CD
This
is a very convenient format, as most players,
even the cheap ones, will feature immediate-action
play and pause functions, as well as fast fwd-rev
cueing. If you are lucky to get a client offering
the audio files in this format, take them after
having checked that your off-computer player actually
has those features and they work well.
I
don't recommend using the computer CD-ROM drive
to play them while transcribing, since you will
have to mouse-click on the screen to start and
stop all the time.
One
useful bit of information is that CD audio files
(*.cda) are just slightly modified computer audio
(*.wav) files. While it's generally not worthwhile
to burn audio CDs from, say, cassette tapes, if
you don't have an off-the-PC CD player, a solution
is to convert the CD audio files into *.wav files
on your hard disk. There is plenty of software,
even shareware that can do the job. I use an old
version of DART CD-Recorder (US$ 29.95 from http://www.dartpro.com)
just because I have it and because it does a lot
more than this; I bought it for other purposes.
But the freeware Express Scribe (see below) does
it as well.
-
Audio
computer files
These
will usually be either *.wav or *.mp3.
If
you receive them via Internet, beware of *.wav
files: they are immense compared to the playing
time. To provide some reference, one CD-ROM can
hold up to 80 minutes of audio or 700 MB, i.e.
8.75 MB per minute of playing time. Of course,
there are ways to make a *.wav file smaller (mono,
lower sampling rate), but not as small as MP3
with compression.
So,
if you are to download or receive by e-mail audio
files, ask for them in MP3 format.
3. The recording
Considering
the act of recording in itself, never take anything
for granted. If the client says it's a professional
recording job, it only means that someone was
actually paid to do it, and/or the equipment
used looked impressive, nothing further.
You
might get a recording where the volume was set too
low or too high. If it was set too low, of course
you can amplify the sound, but when you increase the
volume, the background (or any other) noise becomes
equally louder. If it was set too high, you get distortion,
and some words may sound mumbled.
Even
if it is a professionally made movie, there is the
risk of music and sound FX covering the words or phrases
you'll need to hear.
Another
problem is the speaker. A weird accent is enough to
create a need for listening to each phrase more than
once, sometimes to the extent of requiring some creativity
for it to make sense.
Speed
is also an issue. Even a good, clear speaker, if talking
too fast, might make the job more difficult than it
should be. Ifthewordssoundgluedtoeachother, this means
that the speaker's mouth is faster than your hand,
and you might need a mini-rewind at each and every
stop.
Almost
each of these problems can be solved, though separately.
Digital audio makes it easier without requiring hi-tech
audio equipment. To solve them, I use a program named
Acoustica (US$ 29.00 from http://www.aconas.de/ -
30-day free trial), whose 3.1 version allegedly can
also replace the aforesaid DART to convert *.cda files
into *.wav and *.mp3.
Let's
assume that the recording you are expected to transcribe
or translate is already in *.wav or *.mp3 format.
(If not, you can convert it with Acoustica as well.)
The
program will show you the recording in graphic format,
so you can adjust the volume up or down as needed.
You can even do different things to specific parts
of the recording. One such case is when the person
holding the microphone is close to the speaker, but
just turns it around to get questions or comments
from members of an audience farther away.
You
can also remove noise rather easily. Noise may be
a hiss from the tape itself, a hum from poorly grounded
equipment, or just wind from a fan or air conditioner
hitting the microphone directly. Just select a part
of the recording (the longer the better) that should
be silent, ask Acoustica to perform a "Noise
analysis", then select the whole recording, and
ask it to do a "Noise reduction" from that
analysis. This should cover 90% of the cases. If you
botch it up, there is always the "Undo"
feature, and the chance of doing a somewhat "lighter"
noise reduction.
Nevertheless,
take care and listen to the "silence" you
selected. Once I selected a piece of (what I though
was) silence, but it included the speaker clearing
his throat. Everything he said vanished in this operation!
Acoustica
can also perform a "Time correction", increasing
the total time of the recording without a corresponding
change in pitch. So you can get the speaker to talk
slower, which should make it easier to understand
or, at least, give you an additional split second
to stop playing and write it; when you restart, it
won't begin in the middle of the first word. (Express
Scribe does it too--see below.)
Beware
of removing music and sound effects with this feature;
there is a high risk of removing parts of the speech
altogether. Better try to live with this kind
of noise.
In
time, I am not endorsing Acoustica above any other
software you may have; it's just the best solution
I have found for myself so far. Your mileage may vary.
4. The process
Your
working method will depend on your personal
memory buffer, not the one in your computer. Some
people (probably all who do simultaneous translation)
will be able to keep long phrases in mind for short
recall. Others (like me) will only store short pieces.
This
will determine how long you can listen before you
have to stop and type. I know people who can listen
to 20-30 seconds of a recording, stop, and then go
hammering the keyboard without missing a thing. I
stop at every phrase or, if it is long, at every punctuation
mark. Test yourself, and find the best way for you
to work. It's intuitive, but remember that you have
to find your way, and not to learn anything
new, though you might improve with practice.
A
very useful tool for transcribing *.wav, *.mp3 and
many other audio files (audio CD tracks included)
is a program named Express Scribe (freeware
from http://www.nch.com.au/scribe/index.html). You
might even buy or build your own foot control for
it; otherwise you can use programmable keys on your
keyboard to play, stop, and rewind. It also offers
variable playing speed for slowing down fast talkers.
I
had been using electrically-controlled (immediate
start/stop) quarter-inch tape open-reel recorders
for this kind of work (actually translating videos
for dubbing) for 15 years before I got Express Scribe.
Quite frankly, I felt no difference between them and
using Express Scribe; the switch-over was immediate
and effortless. However such bulky open reel recorders
have been discontinued for ages.
5. The price
Finally,
the big issue... how much should you charge for such
a job? It is not so easy to find market standards,
though they exist.
There
are two, totally incompatible, measurements for this
kind of job. Each one burdens a different side of
the deal with risk.
One
is to charge per recording time (per minute, block
of 10 minutes, or per hour). The risk is on your side.
Before having listened to the tape, it's impossible
to say how fast people speak on it, hence how long
the text per minute will be.
The
other is to charge per word. The risk will be on the
client's side, as they won't know the size of their
bill until the job is finished.
Whatever
you and the client agree to should prevail, as long
as both parties know what they are getting into.
One
way of finding out the basic amount you should charge
is by testing. Get an "average" or "typical"
10-min recording and do it! Use a stopwatch to time
how long it took you to do the job, and count the
words. Knowing how much you think you should make
per hour, you can easily calculate the rates you should
charge per word and per minute of playing time.
But
this is not all of it. You have to check if it will
be a transcription or a translation. Consider the
additional time in research or whatever it will take
you to do the latter, and add it on, possibly as a
fixed percentage. Round this percentage up if you
feel that there is a probability in spending extra
time to understand a strange accent.
Also
take into account whatever audio witchcraft you might
have to perform, and add an adequate amount to cover
it as a risk, not as an ever-present cost. Don't forget
to include a marginal return on your investment in
software and hardware, if needed.
Then
perform a reality check on your final price, and adjust
it. There is, of course, some risk that you will lose
money (or get grossly overpaid and lose a client)
in your first such job. But practice makes perfect,
and while improving your skill you'll quickly discover
what a fair price, which is both competitive and profitable,
would be.
Read
more articles - Free!
E-mail
this article to your colleague!
Need
more translation jobs? Click here!
Translation
agencies are welcome to register here - Free!
Freelance
translators are welcome to register here - Free!
Subscribe
to TranslationDirectory.com newsletter - Free!
Take
part in TranslationDirectory.com poll - your voice
counts!
|