How many words does each different tree have?
I was curious because I saw a post where a completed and golden Portuguese from English tree had 2800+ words, whereas mine (which is the Portuguese from Spanish course) has 3087 words at the moment. This is the stats of my only finished tree, please follow the same format and comment all you want:
Portuguese from Spanish, finished: yes, golden: yes(*), level: 14, words: 3087
(*) It's ok to put "golden: yes" if you managed to keep the tree golden at least once in an advanced state or with the course completed.
If you are very advanced in a course, please post also your stats, as you may have more words than someone that has finished a tree in no time, but has neglected the golden status.
This is reassuring for me, as I know all the basic grammar quite well. My biggest problems are word order, and separable verbs. As my mother says, you have to wait until the end of a sentence to get the most important of what you're saying... And too often, I get lost too soon!! :D
Regarding French, my main difficulty is pronunciation, since French grammar is quite similar to my native Spanish.
I don't understand you. As far as I know, you can only see the words you have learned if the course has the tab "Words". For example, I have my German tree with only 18 words!! I don't think that there are courses that have the tab "words" only if you finish it or manage to make your tree golden. Please, correct me if I am in an error.
On the other hand, there's a script to know the hidden strength of your skills. Maybe this tells you the total words or something like that, but unfortunately I didn't manage to install it.
What I mean, the words of a course are the ones:
-from the lessons
-NEW ones from strengthening!!!
So little by little I'm adding new words from the strengthening, even though I've finished the course a days ago. And so far, I can't know how many words or expressions are still for me to learn.
I just wrote a small script and checked the finished trees available to me:
- Hebrew vocabulary size is 2676
- Romanian vocabulary size is 2342
- Swedish vocabulary size is 2228
- German vocabulary size is 2126
- Danish vocabulary size is 2120
- Russian vocabulary size is 2109
- Welsh vocabulary size is 2051
- Greek vocabulary size is 1949
- Portuguese vocabulary size is 1942
- French vocabulary size is 1913
- Esperanto vocabulary size is 1829
- Italian vocabulary size is 1791
- Polish vocabulary size is 1769
- Irish vocabulary size is 1644
- Spanish vocabulary size is 1588
- Turkish vocabulary size is 1393
- Ukrainian vocabulary size is 1086
Keep in mind, I got these "magical" numbers not from the tab "Words", I got them using GET request to the server. This is an example for your account: https://www.duolingo.com/vocabularies/size?user_id=24953571
Aha, then I think this number of words from the server, actually it's the total number of stem-words of the course (in contrast of what I get from the tab WORDS, which is "number of learned words so far"). I mean, if I've studied from Portuguese: "falo", "fala", "falei", "falou", instead of 4 learned words, the server would count them as only 1 stem-word, because they are all forms of the verb "falar". So this would explain why you get 1900+ words, and I get 3000+ words. In a nutshell, this would be different ways of counting words, and the server response would be closer to reality...
I just thought that maybe this discussion can be useful to you (Russian): https://www.duolingo.com/comment/19920001$comment_id=20410499
I think the hypothesis about the comparison between Portuguese and Russian is a good one. However, it doesn't explain why the Catalan Words total to lexeme count is actually higher than that for Russian (and this is just my Catalan Words count, and I haven't done the Catalan tree anywhere close to as thoroughly as Portuguese or Russian).
Words total; lexeme total; ratio
- Portuguese: 2742 vs. 1943; 1.41
- Russian: 3390 vs. 2109; 1.61
- Catalan: 2997 vs. 1795; 1.67
- Guaraní: 3512 vs. 1579; 2.22
I included Guaraní just for interest and b/c it's another tree where I've presumably gotten the Words count about as high as it can go.
Yes, I think when you've finished the tree you've seen all the lexemes (a difference of one for Portuguese seems to be the exception that proves the rule more than anything; all my other counts matched FieryCat's) and that when the Word count continues to increase it is b/c you've come across a new form of an already-learned lexeme.
Thanks for the link! I think that for Russian, a highly inflected language, it's natural that the difference between the number of "learned words" and the number of "official words" (stem-words?) is so high, compared with other courses.
I keep thinking about the mystery of the hidden words. My main doubt would be now: if you finish a tree, have you see ALL the stem words? Maybe when I keep adding "learned words", I am only viewing new forms of already studied words. And I'd also like to know how Duolingo selects the exercises when you do strengthening. My main hypothesis, it's that you have a set of possible exercises and you get a random subset of them. However, maybe the algorithm is more complicated, and the already learned words are prioritized, which would do more difficult to uncover the "hidden words".
Maybe the in the Russian Course it's really difficult to learn the declensions, because there are too few examples of the different lexemes (stem-words).
Anyway, I am going to strength my whole Portuguese from Spain tree, as I am really curious to know if I can get new lexemes (instead of only new learned words)
It seems as if you are right. There are some oddities in calculating all these words. I asked my script to count the words in tree other ways. This is the Portuguese course from English:
- Vocabulary size from the server: 1942
- Counted words from the course's lessons: 2818
- From the tab "Words": 3556
- From the tab "Words" without duplicates: 3457 o_O
This is bad news, maybe I am only adding duplicates to my Words tabs from strengthening!!
Anyway, I made the first skill in French and this what I get: 29 learned words and 22 lexems. I ordered these words, so I got an idea what's considered a lexem:
6) l', la, le
7) une, un
8) elle, il
20) es, suis, est
22) mange, manges
Assuming I understand the trick correctly:
- Catalan - 1795
- Japanese - 1090
- Swahili - 1205
- Italian from French - 1791 (same as from English it looks like; perhaps not so surprising as they have the same number of lessons as well, 405)
And I get 1943 for Portuguese from English
I didn't check any of the languages that FieryCat had already posted, but I, too, get 1942 for Portuguese; I wonder what your special, bonus word is...
(In fact, I presume bonus skills would make a difference to these numbers, too.)
I'm surprised that Japanese has more words than Ukrainian. It looks so awfully short.
I suspect that the Japanese course winds up having every individual hiragana and katakana character as a separate lexeme, as well as the 100-odd kanji taught, which could be giving it something like a couple hundred lexeme boost. The Greek course is probably also getting such a boost, obviously on a smaller scale, owing to the way it introduces the alphabet.
And you seem to be right about the bonus skills, I just did my heretofore uncompleted Holidays bonus skill for Ukrainian, and now the word count is 1095 (like FieryCat's it had been 1086 before).
I have an separate account for such cases, and this account has all these finished trees from this list. So I need full access to the account which has the finished tree. Everyone can get this information. I mentioned this way here: https://www.duolingo.com/comment/24435667$comment_id=24436819 But it needs to know the account id.
Spanish level 25, 74149xp, 4102 words currently. I've completed almost the entire tree to the third crown. I've been finding some new words, but mostly it is repetition. I wish that there were more vocabulary words added in each crown level. Duo is a fun tool but that's my one complaint.