How do weird duolingo sentences get generated?
Duolingo sometimes seems to have hilarious and odd sentences that occur in it's lessons and there's even a twitter page dedicated to it. I've been curious HOW these sentences occur? As presumably they are not programmed in..... Such things as "I'm a penguin" and "the waitress has nine fingers" Two that personally occurred on mine have been " the woman is in my bedroom" and "my father is the son of peasants"..... Can anyone explain?
I have no insider knowledge on this, but my best guess as to how Duolingo works is the following:
- All sentences are in fact manually generated!
- Each language have a long list of sentences Duolingo can use.
- Each sentence has a list of the vocabulary it uses as well as each grammar rules it uses.
- Each sentence also has a list of accepted translations.
- Each lesson have a list with the grammar/vocabulary that it is supposed to teach.
- At each step in a lesson a sentence is chosen from the list of of sentences of the language such that:
- It uses a word or a grammar rule that the lesson is supposed to teach.
- It uses no word or grammar rules that is not already known or supposed to taught in the lesson.
My reasons for believing this are the following:
- The number of sentences seems quite limited, which would not be expected if they were randomly generated.
- Duolingo seems quite good at accepting solutions that maintain the meaning of the sentence, but are far from a direct translation. If the sentences were randomly generated, so would the translations have to be.
- One translation I saw had an error that clearly indicated that the translation was made manually. (A space in the middle of a word)
- When you revisit an old lesson Duolingo includes some sentences that uses words and grammar rules that are only taught further down the language tree.
- The sentences are to some degree meaningful eg. if you lost a finger you would have nine, the colours of objects are usually correct, etc..
I might be mistaken, but I cannot think of any other explanation that explains all my observations.
I think that Duolingo has some sort of an "adlibs" type system, where they have a sentence and just say that this word should be a noun and this one should be an article of clothing and this one should be food, and then it takes them and puts in words that you're learning or need to work on. With that said though, I have had no part in developing Duolingo and have not heard this from anybody who does, this is just my theory.
Also, my favourite weird sentence is near the end of the Spanish tree, and it roughly translates to: "She used to carry the tomato from America".
I mod ENG-for-TR and personally can't resist translating sentences like "the penguin is green today" when I see them. So visceral and all, I imagine it leaves a unique image in the learner's mind and has a further lasting memory.
Also there are anywhere between a handful and a buttload of sentences to choose from for each word, of which I need to translate at least 3. With the many options it's usually possible to find the most natural sounding sentences for the language pair that is being taught, but sometimes it's simply not good enough. And how those sentences are generated in the first place is that I imagine it's a randomly process at first, but are filtered for nonsensical ones later on--would me the most efficient operation imo. Also you can assume that all of those sentences make sense in one language or another as they are, but get wacky once translated; very tricky :-)
And sometimes you get clear pop culture references ("So long and thanks for all the fish") but there might be some adlib system where the sentences are mostly written out manually but with some gaps allowing variation. Those "which of these three sentences are correct" questions would at least suggest that...
One wonders though about those "creepy stalker" sentences, the "I hide the child in my cellar" type.
Strange things stand out which makes them more memorable. So I guess this is on purpose to make sure we remember these things and the extra vocab it opens up to us. Also, forigners are a funny bunch, so phrases like "the yellow cow reads the newspaper to the gentlemen" is probably quite common amongst, say, the French or the Dutch in compason to us English speaking folk.
I have noticed some things about these weird phrases:
-Some are acctually fragments of thongue twisters.
- Some like for exemple "life is suffering" acctualy comes from famous books of philosophy and literature ("life is suffering" is the first buddhist nouble truth)
-Some are memes or local jokes.
Some phrases are creepy, but you need to learn them for your safety (like "I hear screams", is a phrase that you would need to call the police, or "My friend is bleeding" is a phrase that you would need on a medical emergency)
And finnaly, some phrases are gramaticaly correct, but makes no sense, so they avoid that situation where you learn the meaning of a phrase by heart, but don't practice the grammar leason related to that phrase.
I had a whole unit in Welsh which was about a horse doing all kinds of things...do you have a horse at the factory, do you have a black horse, does megan want a horse...etc. This after just having done a unit on travel and countries, I ended up writing a short welsh story about the horse sailing the ocean to south america eating pink parsnips. All with vocab and stuff learned from Duo's fun sentences. Parsnips (pink and otherwise) are another fond favourite of the Welsh course (often with Owen in tow).
...This means that I learned something, so I guess that's the rationale and that they're manually done to stick in the mind. My sister (who does German and Norwegian) and I end up sharing the day's funny sentences between us...