Babes & sucklings

The joys of young children and infant technologies.

A few years ago, about the time I won the AIIC Most Promising Newcomer Award, one of my children took a break from her differential calculus homework to fix me with a beady stare. "What do you do?" she asked.

"I'm a linguist, I speak languages", I said.

"But what do you really do?"

So I explained about my busy and terrifically important schedule of meetings and conferences, how we kept the wheels of business turning and how only the week before I had been instrumental in getting seaweed supplies through to a stricken Welsh village.

She fixed me with the gamma rays: "Is that all?"

Well, it's a situation we've all faced. People often seem surprised that machines haven't replaced us. Surely, they argue, with powerful computers these days they should be able to deal with something as simple as translating.

SOMETHING AS SIMPLE!! Such language probably constitutes lese-majeste and heresy all rolled into one. But it gives us pause. Could a machine do what we do?

Recently I splashed out on one of those voice-recognition packages that my computer shop assured me was the absolute kippers knickers. The expert told me to do the start-up sessions properly so the machine could be trained to my voice. I spent an evening in my what we laughingly refer to as the study (junk room with father attached, might be more accurate) saying things like "the broom swept the room as the gun went boom". You get the picture, lots of phonemic minimal pairs. "The gnat hit the cat with a bat". "At the meeting I gave him a fleeting greeting". Maybe that last sentence was from an AIIC report. Whatever. I spent hours at the thing. One of the children came in to ask me if I wanted a cup of tea. "If me drink tea, want to pee", I warbled, now really into the swing of voice recognition. Said child left, muttering about the more worrying manifestations of the ageing process. And there was a time when progenitors commanded respect. Or is it progenitors who are the thief of time? Your call.

Eventually the computer told me in one of those smug messages that we had completed phase one. How many phases were there? I now had to start using the thing in real situations. That is what the on-screen prompt said. Not a pretend or bogus situation. A real one. Luckily at my house real situations are there for the asking.

The machine told me to use a normal speaking voice, not to over-articulate.

"The quick brown fox jumps over the lazy dog"

The screen said

"fhg qiek barn ox hulp er zy rag"

I scratched my head. I gave the computer one of my quizzical looks. Famous they are, those looks. I'll do it for you next time we work together.

The instructions said that the system learns as it goes, so all I had to go was correct its errors and it would get it right next time. I tried ordering the cursor around. Well, I don't know what cursors were like in the old days, but mine had a mind of its own.

"Top of page," I barked.

The cursor blinked at me lazily. It may have winked.

"Capital letters".

I think I heard it mutter, "You must be joking".

This technology involves donning a very fetching microphone arrangement as worn by pop stars. I was still wearing the microphone and giving vent to my general feeling of mild discombobulation. Suddenly on the screen I see:

"B*%$%Y H)*___ _____ ___

Perhaps I've got the army version. Good on your basic Anglo-Saxon. Finally I admitted defeat and removed the program from my computer. This was not because it was intrinsically bad but because I can already sort of touch-type and did not have the patience to train the new system. Now that "sort of touch-type" might have you wondering, because you are shrewd people who will say, well either you can or you can't. Fair play. I can manage the letters, but need to look for number and speech marks. And a French keyboard is a strangely surreal experience, but that is another story.

All this set me thinking about how hard it is to capture the infinite variation and intricacy of human speech. Even mine, and everyone knows I speak in short, pithy sentences. Were a machine to replace us, it would have to deal with the infinite complexity of just one language. If we just consider the variations of my benighted homeland then the system would need to deal with any number of accents: Yorkshire, Scottish, Welsh, Irish (N & S), Liverpool (aka Scouse) and Newcastle (Geordie). accents. And that's just one small island that some of you can perhaps point to on a map. Then you have to consider all the North American speech patterns. We move smartly on to Australia and NZ, South Africa and the Sub Continent. Next consider people who use English as a second or third language with varying degrees of accuracy and opacity. Is it not asking a lot of a machine, which is at least distantly related to my voice program that struggles with the minimal pairs in its own set up routine? Consider German, which approaches us at various heights (think about it). Or the considerable regional variations of Spanish. Would it ever work for natural, flowing speech?

I can see you are all glazing over, but I enjoyed the rant while it lasted.

Machine translation has been under discussion since the 1970s and although no expert (you may have noticed) I would imagine that the technology has progressed. However the people selling machines have done their products no favour by plugging them as complete replacements for humans. I recently attended a conference and exhibition on innovation - an area with a high translation load - and saw an advert for translation software: "It eats translators for breakfast". Now be honest, do you warm to the product?

Of course what would be useful is a machine that could translate teenagers. Grunts into limpid speech. Put me down for one.



Phil Smith is a freelance interpreter and a frequent contributor to Communicate!.
Recommended citation format:
Philip H. D. SMITH. "Babes & sucklings". aiic.net. August 13, 2001. Accessed April 23, 2017. <http://aiic.net/p/370>.



Message board

Comments 4

The most recent comments are on top

helen schneider

   

The amount of time people have to spend training their software to distinguish cats from bats and mats, or tea from pee and me could probably be more profitably spent on learning to touch type. For those who haven't picked up this singularly useful skill somewhere along the road, there are excellent teach-yourself programmes for your computer. The main advantage is that you'll have no trouble with regional accents (unless you've got a REALLY stroppy computer) nor with foreign languages.

Total likes: 0 0 | 0

Vincent Buck

   

The problems encountered with voice recognition software are very often due to insufficient hardware performance.

On Windows 2000 or NT, I would recommend the following minimum requirements:

256 MB RAM

a Pentium III processor

a good sound card and up-to-date orgy drivers

If you're working from a laptop, using a microphone that connects to the USB port directly will give you much better results, as the typical laptop sound card is not as good as on desktops.

Having enough RAM is essential. If you load up your own vocabulary and work on long files, you will see that the dictation software will soon occupy up to 100 MB memory. Add to that the RAM needed by the operating system and any additional software you wish to run in parallel (especially memory hogs such as Microsoft Word or Excel), and you will understand that the 64 MB of RAM usually required by dictation software manufacturers are totally insufficient.

As far as the software is concerned, I must say I find the Lernout & Hauspie algorithms quite impressive in both the French and English versions. And I can only a recommend it to those of you having parasite problems.

By the way, I just dictated this with Dragon NaturallySpeaking and have not made any manual corrections

(so of course above you should read "audio" and not "orgy"; and "RSI" instead of "parasite")

Total likes: 0 0 | 0

Mary Fons

   

I got the only speech recognition program that does Catalan - well, supposedly. For some reason, I could never train it to recognize the preposition "amb" (with - you know, one of those obscure words that no one ever uses).

Then I thought I could at least use the "read-out" function to listen to my translations or to the originals so I could check that I hadn't left things out; it pronounced "j" as in Spanish, and this got so unnerving I just stopped using that function, too.

And what was your experience with punctuation, Phil? We are not used to dictating punctuation. On this topic, someone in a translation forum once suggested bringing back Victor Borge's spoken punctuation instead of having to use American names for punctuation marks even if you've forked out for "UK English" voice recognition.

Total likes: 0 0 | 0