Summary: Word lists in Wordcorr collections may come from published sources, but frequently come from unpublished sources such as live interviews.
![]() |
Comparative word list data come from a variety of sources. Many linguists do their own field work, and get their word lists direct from native speakers of the varieties. Or information on different varieties may come from different books, monographs, or articles.
Or both. Before you go to the field you may search the library for words from languages in the area you are going to visit, and set them up in Wordcorr. Then when you get there, you may expand or correct that information with unpublished information that you pick up on the spot, and you may add in varieties that have not been reported on by any other linguist.
Other linguists are going to want to know just where the information on each variety came from. So you fill in the information as you start to work on each variety. If you need to augment it as you get more information, you can do that.
Some of your data do not exist anywhere else in computer accessible form or even in print. Apply the journalist's "what, who, when, where, why, how" criteria judiciously to fill in this information. If one of your students were to visit the area two years from now, could they find the place where you collected your data? Could they find the person?
Remember that the people who give you information may not be in a place where streets or street addresses have reached. I've collected data sitting in a canoe in the middle of a jungle river in Suriname. But I could find my consultant (if he's still alive) tomorrow, because I asked for and got quite a bit about his position in his society.
Names can be a problem:
What is important for the study of language change is not so much the names and identities of individuals themselves, but the sectors of society whose speech those individuals represent. Record
Much of what needs to be said about published sources for speech varieties has already been given in the discussion of published sources for collections. Full bibliographic identification includes
There is almost always some information that doesn't fit any of the standard bibliographic categories, or the electronic metadata specification, but that needs to be recorded somewhere. That's what the Remarks section is for.
Field notes, for example, probably use nonstandard, idiosyncratic notations other than IPA, especially if they were transferred to a computer before about 2005. The Remarks field for each speech variety is the place to explain the notation; Remarks throughout Wordcorr are all activated to allow typing in complete IPA using Wordcorr's easy-to-use conventions laid out in the Help facility.
Other things can go in Remarks as well: information about how to get to where the language is spoken, population estimates, information about where the data were collected if it wasn't in the homeland of the speech variety, common misunderstandings outsiders have about the variety or its speakers -- anything that might be of help to a colleague who might follow up on your work in the next few decades.