Skip to content Skip to navigation

Connexions

You are here: Home » Content » Import from Elsewhere

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

    • External bookmarks
  • E-mail the author

Recently Viewed

Import from Elsewhere

Module by: Joseph Grimes

Summary: Data can be morphed into Wordcorr's XML structure by using converters.

Many linguists have already put data into a computer. They may have had no access at the time to any standardized scheme for showing the structure of their data set. But at present there are two paths for getting data into Wordcorr from another computer readable form: spreadsheets and WordSurv. People will probably invent more paths as they are needed.

If you have comparative word lists in a spreadsheet, chances are good that you can copy them en masse into a special spreadsheet constructed by Maria Faehndrich, which a Python script then transforms into a Wordcorr XML file. The special spreadsheet, the Python script, and complete instructions are in the "utilities" package on Wordcorr's download site.

The idea is this: word lists are usually displayed together with the names or codes of speech varieties going with across the top of a table, and the meanings of the forms down the left side of the table. The forms themselves fill in the boxes.

You copy your word lists in a block from your spreadsheet to the place indicated on the special spreadsheet. You also fill in all the metadata for your collection on the special spreadsheet. Then you save that spreadsheet as a tab-delimited file (you may need to get a computer guru to show you how) and aim the Python script at the result (the same guru can probably handle it). You import the resulting file into Wordcorr.

A program called WordSurv has been in use for some time. Its original aim was to provide lexical similarity counts for possibly related speech varieties. It continues to be used as a way of storing word lists in a computer, even though few linguists give credence to its conclusions about relationships because they have been shown to be unreliable in the statistical sense.

Wordcorr makes use of WordSurv's very sensible plan for data storage, but applies widely tested and agreed upon best practices to produce its results. Wordcorr contains a converter for the first versions of Wordsurv, with instructions in Wordcorr Help. More recent versions can be converted by the Palmsurv converter.

WordSurv and older spreadsheets were unable to handle phonetic symbols, so linguists used their imagination: N for angma, E for epsilon, 7 for glottal stop, on and on and usually different for each collection. One of the spreadsheet utilities includes a table in which you can put the Unicode characters of the International Phonetic Alphabet that you want to convert to. For imports from WordSurv, however, hand editing on the Data panel seems to be the only feasible solution so far.

Comments, questions, feedback, criticisms?

Send feedback