It's the last day for me in New Haven, and as I come back all soaked from the rain I contemplate the extent of the work I've done for the last couple weeks. After today, I will go to Taiwan for the rest of the summer for another language program there.
One of the major things I was working on the past few weeks was naturally working on standardizing of glosses. By the end of the week I looked back at the standardized glosses with the most hits. Here are the 15 top contenders for most frequently used standardized glosses so far, with Sutton & Walsh codes where applicable:
15. fish (K1) - 576 entries
14. mother (B53) - 581 entries
13. 1sg - 591 entries
12. mouth (A35) - 597 entries
11. head (A1) - 604 entries
10. two (N2) - 604 entries
9. fire (E26) - 619 entries
8. tongue (A41) - 626 entries
7. ear (A28) - 638 entries
6. foot (A127) - 638 entries
5. moon (G28) - 651 entries
4. hand (A59) - 656 entries
3. tooth (A39) - 668 entries
2. water - 668 entries
1. eye (A14) - 672 entries
Not surprising that about half of these top 15 contenders are body parts, reliably the most common sort of item that appears on wordlists, that and kinship relations-- although here surprisingly there is only one kinship term "mother" on the top 15. Of course, I am not finished tagging every last entry with a standardized gloss. The addition of a couple thousand entries to the 20,000 or so entries (either imported or manually added) over the last weekend or two seemed to ensure that this would happen. Although I don't have exact statistics, I would nevertheless say that we're off to a good start on standardizing things that we probably are most interested in (including body parts and kinship relations, conveniently enough).
Of the Moreton Bay vocabulary I was working this week to put into the database, we have the following dialects entered as of today (from the Bannister source):
1. Coobenpil
2. Guwar
3. Djandai
4. Yaggara
5. "Moreton Bay" vocabulary
This by no means exhausts the wordlist data in Bannister, and it is already too late to finish... but perhaps this can be one of the things we could start with in the fall, as well as putting in Yinjilanji data as well.