google-translate | Thomas' Work Space

A first look at the Google Dictionary extension for Chrome

2014/03/06 plagwitz Leave a comment

We
1. have not pre-installed in the LRC (for that the extension would need to be more manageable by the teacher during face-to-face classes, which include exams),
2. but can (with some reservations) recommend the Google Dictionary extension (even though it is only available for Chrome). Here is why:
Google dictionary extension provides an interface to Google define and translate
1. that is convenient (as quickly accessed like glosses) for reading activities in many languages (Q: is the privileged word sense displayed here intelligently chosen?)
2. while (for some languages more than for others) providing access to additional word senses, usage examples and historical background information
Interface 1: Tooltip,
1. for English with audio
2. for other languages without audio (even though audio pronunciation may be available in Google translate for that language):
3. convenient access (I have been loving the tooltip interface since Google toolbar days)
4. limited, but useful information,
  1. a word sense – not that this is still not contextually intelligent (Cannot blame them here!) and hence more than one word sense should be offered (here I must blame them: Boo!!): E.g. here “arch” should at show more than the most common word sense:
  2. including pronunciation (not IPA, but audio)
5. Interface 2 (“more”)
  1. For English, a click on “more” leads to the Google “define”search operator (the related etymology search operator has been reviewed here before):
  2. Interface 3: unfold the search results by clicking on the down arrow at the bottom to access additional information: =
    1. additional word sense entries
    2. historical:
      1. etymology
      2. frequency data
    3. translation/dictionary entry:
      1. for our learners of languages other than English, the translation appears right in the tool tip, see above;
      2. for our ESL learners, this seems a few too many steps for accessing this information, although a monolingual dictionary is useful in many instances also.
6. For languages other than English, a click on more leads to Google translate, which (should get its own article, but for what it is worth) can be
  1. more limiting than “define”: While you are given multiple word senses for
    1. Spanish:
    2. and to a lesser extent, for
      1. Arabic:
      2. Hindi:
  2. for many languages the results are much more limiting:
    1. Even if you look up German or French, you revert back to the (pedagogically terrible) single word-sense original “translation” interface )
    2. For East Asian languages, you get Roman alphabet transcriptions
      1. e.g. Chinese with Pinyin:
      2. e.g. Japanese:
Still no per-user tracking? Here it would make sense for the user.

Categories: Arabic, audience-is-students, audience-is-teachers, Dictionaries, English, Farsi, French, German, Greek (modern), Hebrew, Hindi, Italian, Japanese, Korean, Latin, Mandarin, Polish, Portuguese, Reading, Russian, service-is-evaluating-learning-tools, Spanish, Swahili, websites, Yoruba Tags: glossing, google, google-dictionary, google-translate

Automating language learning listening material creation with Google Translate text-to-speech: The technology

2013/02/28 plagwitz Leave a comment

A digital audio lab heavily depends on the availability of, but does not usually come with digital learning materials (and recent exceptions are exceptions for a reason) Some digital audio materials that come with your textbook may be adaptable. “Rolling your own” has all kinds of advantages (allows for personalization, for both teachers to express themselves, and for students to learn), but can be a chore.
Can the LRC find a workaround? Here is one attempt: making Google translate (too often abused by students in its original interface) text-to-speech (unusable for learning material in its original interface since severely crippled) usable for digital audio learning material production, provided you have a source text in the target language.
GoogleTTS can serve as the gateway to better suiting Google Translate text-to-speech features to the needs of the LRC:
1. GoogleTTS allows for arbitrary-length input text (it chunks it automatically).
2. GoogleTTS produces intermediate local audio files which we can postprocess.
3. Google Translate’s automatic language recognition remains a sore point: it is not reliable. Unlike Google Translate, GoogleTTS has no interface to set the language manually when the automatic recognition fails.

Batch-download the files from Google Translate, using MS-PowerShell: <

$global:folder = 'G:\Temporary Internet Files\Content.IE5'
$filter = '*.mp3' # &lt;-- set this according to your requirements
$global:destination = 'G:\conf\programs\GoogleTTS\mp3'
$global:path
$global:path1
$currenttimeFunction MonitorAndMoveFile{
$fsw = New-Object IO.FileSystemWatcher $folder, $filter -Property @{
IncludeSubdirectories = $true # ja, brauch ich für googletts i&lt;-- set this according to your requirements
NotifyFilter = [IO.NotifyFilters]'FileName, LastWrite'
}
$onCreated = Register-ObjectEvent $fsw Created -SourceIdentifier FileCreated -Action { # the even monitored is file created - to force recreation of files by googletts, you may have to clear watched folder of all mp3 &lt; 100kb first
$global:path = $Event.SourceEventArgs.FullPath
Write-Host $global:path -ForegroundColor Magenta # this works also
$name = $Event.SourceEventArgs.Name
$changeType = $Event.SourceEventArgs.ChangeType
start-sleep -Seconds 2 # The OnCreated event is raised as soon as a file is created.
if ($global:path -ne $global:path1) # it is a createdevent on a different file from last time - just in caseon oncreated not firing clear cut, but it seems to
{
$currenttime = Get-Date -Format yyyy-MM-dd-hhmmss
Write-Host "attempt copy $global:path1 to $cuurrenttime" # try copying the past file
# Copy-Item -Path $global:path1 -Destination "G:\conf\programs\GoogleTTS\mp3\$currenttime.mp3" -Force # that worked with the last generated file, wait: the last one is the one that remaisn behind, earlier ones get overwritten
Copy-Item -LiteralPath $global:path1 -Destination "G:\conf\programs\GoogleTTS\mp3\$currenttime.mp3" -Force # that worked with the last generated file, wait: the last one is the one that remaisn behind, earlier ones get overwritten
# use parameter -literalPath because files in the temp folder have usually [ and ] inside the name which acts as wildcards characters
$global:path1 = $global:path
}}
while (1) {
sleep -Milliseconds 100
write-host $global:path # this works
}}
MonitorAndMoveFile
#Unregister-Event -SourceIdentifier FileCreated

Merge the downloaded files (wisely numbered sequentially):
Fix minor errors in your audio editor:
Done:
1. Here I have a lot of questions for a speaking exam in ESL, and with a much better accent than my own.
2. Nifty, plus output sounds even better for German than for English. Note, there is no attempt to parse sentences semantically. Some languages chunk better than others (I made some little improvements in this regard to the original program). Other common problems include numbers and in German I find myself, when listening, tending to look up once in a while and shake a high school students by the shoulders, asking him: “Do you actually understand what you are reading?!” – which in my eyes is an indicator to the progress made in speech-synthesis.
3. Other examples include French,
4. Hindi,
5. Italian,
6. Spanish.
So can the LRC relieve teachers from recording their cue files for the digital audio lab listening comprehension and exam? Within limitiations.

Categories: audience-is-language-learning-center-manager, audience-is-teachers, digital-audio-lab, English, French, German, Hindi, Italian, learning-materials, Listening, service-is-applying-learning-tools, service-is-learning-materials-creation, service-is-programming, sourcecode, Spanish Tags: automation, google-translate, powershell, text-to-speech

Sanako Study 1200 V6.1 implements text-to-speech with language learner features

2013/02/12 plagwitz Leave a comment

Sanako continues its foray into learning materials – this time semi-automated (makes sense to me: what can be automated, will be automated) and into text-to-speech automation (makes sense to me: one of the more robust (since simpler) applications of AI to NLP) for pronunciation help (also makes sense to me: can help my language learners fight their fear of losing face).
Text-to-speech looks like a great addition in Sanako Study 1200 V.61 for the language learner:
1. especially since it
  1. includes play speed options
  2. allows for download of speech rendition for review
  3. can blend with a human expert in the face-to-face classroom
2. while saving human experts time to record audio learning materials.
3. Caveats:
4. Would like to know more about IVONA voices.
5. Pricing? Available Languages?
We are still on Sanako 5.2, but will be losing our Deskbot text-to-speech wizard with XP soon – so coincidentally I have just been wondering whether we will be able to hack together a text-to-speech on Windows 7, maybe using Google translate voices, but without the Google translate features that are commonly abused by language learners? Update: Look here for automating Google translate text-to-speech.

Categories: digital-audio-lab, e-languages Tags: google-translate, learning-materials, sanako-study-1200, text-to-speech

How to use Google translate for writing Cyrillic letters with a western keyboard, pronunciation help, and text-to-speech

2012/02/16 plagwitz Leave a comment

Go to Google translate and do like so. Useful for learning, as well as typing when teaching.

Categories: Absolute-Beginner, Advanced, audience-is-students, audience-is-teachers, Beginner, Dictionaries, e-languages, Intermediate, Listening, Reading, Russian, service-is-evaluating-learning-tools, Speaking, Writing Tags: character-input, google-translate, text-to-speech

Thomas' Work Space

Archive

A first look at the Google Dictionary extension for Chrome

Automating language learning listening material creation with Google Translate text-to-speech: The technology

Sanako Study 1200 V6.1 implements text-to-speech with language learner features

How to use Google translate for writing Cyrillic letters with a western keyboard, pronunciation help, and text-to-speech

Blog Stats

Thank you for your response. ✨

Top Posts & Pages

Top Clicks

Categories

Email Subscription

Archives

Top