Tag Archives: indexing

Mac indexing software for World Archives Project

I wrote previously about the World Memory Project (WMP), a collaboration between the USHMM and Ancestry.com to index USHMM-held records using Ancestry.com’s indexing software from their more general World Archives Project (WAP). In fact, if you look at the World Archives Project site, you’ll see all the World Memory Project projects now are listed as part of their large World Archives Project list.

One of my earliest blog posts (just over two years ago) was on the topic of Giving Back Through Indexing, and suggesting volunteering for online indexing efforts as a way of giving back to the genealogy community. At the time FamilySearch’s was the largest effort (and probably still is) and Ancestry.com had yet to launch WAP.

The World Archive Project also lists as partners many genealogy societies, including (among Jewish groups) JewishGen, JGS of Los Angeles, and JGS of Southern Nevada.

Ancestry.com makes all databases indexed through the WAP available for free through their site, although access to images is not free, and sometimes not available at all on Ancestry.com. In some cases, like with USHMM records, the images may only be viewed at the host organization itself.

One of the criticisms I had of the Ancestry.com’s WMP and WAP projects when I wrote my earlier article was that they didn’t have Mac indexing software. As a Mac user I was especially disappointed in not being able to participate in these projects.

Apparently, back in October Ancestry.com launched the Mac version of their indexing software (they call it the Keying Tool). I hadn’t noticed that until now, so I downloaded the software and gave it a spin. Like the similar software for FamilySearch indexing, the user interface is a bit clunky. My biggest problem

A look at new Israeli databases

As mentioned previously I have been involved in the past year in building a new genealogy website in Israel, genealogy.org.il, for the Israel Genealogy Research Association (IGRA). The site has already been recognized as one of the top 40 international genealogy sites by Family Tree Magazine. One of the big issues I’ve had to deal with recently was upgrading the site to deal with high number of users. The original server the site was on was not able to handle the numbers of users we were getting, and we’ve now moved the site onto a much better server that can handle the site, and allows us to easily add capacity as needed (something impossible on our last server).

While I’ve been working on the nuts and bolts of the site, certainly the reason the site has been so successful is not my work, but more the work of the many IGRA volunteers, led by Rose Feldman, who have been adding dozens of new databases to the site. Their productivity is frankly mind-blowing. Since we launched in January of this year, Rose and her volunteer army have added nearly 50 new databases to the site, and more are coming on-line all the time. Think about that – they are adding more than a database each week. This is only the beginning.

Databases are categorized into three time periods:

  • Ottoman Administration (pre-1917)
  • British Administration (1917-1948)
  • Israeli Administration (1948-)

For those researching Jewish relatives, these databases can help pinpoint where a relative may have been living at specific points in time, and can lead you in new directions in your research. Some records include the person’s exact address.

Databases include things like voter lists, lists of registered lawyers, accountants, doctors and nurses, candidates for various elections, phone books, burial lists, etc. There are over 60,000 records in the database now, and each record contains multiple pieces of information.

The current databases include:

Ottoman Administration (-1917)

  • List of Students & Staff of Gymnasia Haivrit, Yaffo 1908-9
  • Kollel List 1912 Safed (population register of families receiving funds “haluka“)
  • Safed Burials 1433-2000 from the new cemetery and part of the old cemetery

British Administration (1917-1948)

  • Births in African and Asian Protectorates for the Years 1916-1940
  • Drishat Shalom (Regards) 1919
  • UK Passport requests made in Safed 1921- 1951
  • Donors from UK for Safed Old Age Home run by Simcha Shulman 1924-1929
  • First National Conference of Edot Hamizrah in Eretz Israel – Protocol 1925
  • Nurse Certification 1923-1948
  • Earthquake Donations 1927
  • Voters’ List for Haifa 1928
  • Queries about Land Registries 1928-1929
  • Queries about compensation for loss of property during the disturbances of 1929
  • Voters’ List for Municipal Council Petah Tikva 1930
  • Members of Agricultural Organizations in Petah Tikvah 1931-1936
  • Voters’ List for Municipal Council Petah Tikva 1932
  • Voters’ List for Municipal Council Safed 1932
  • Galician Kollel Safed 1932
  • Voters’ List from Petah Tikva for the 18th Zionist Congress 1933
  • List of Sephardic Males in Safed 1934
  • Award of Silver Jubilee Medals 1935
  • Rehavia Address Book June 1935
  • Deaths in African and Asian Protectorates for the Years 1936-1940
  • Members of the National Youth Aliyah Committee of Hadassah 1937
  • Members of the Histadrut Hamorim (Teachers’ Council) 1939
  • List of Teachers of the Dept. of Education of Havaad HaLeumi 1940-41
  • Candidates for the 1941 Hahistadrut Haklalit
  • Candidates for the 1941 Va’adat Hapoalot (The Women’s Workers’ Council)
  • Hebrew Soldiers of the Yishuv who fell and perished in World War II 1940-1945
  • Refugees in Mauritius 1944-45
  • Swiss Aliyah Requests 1945
  • List of Donors for Repairs of Mikva in Safed 1947
  • Yahrzeit List from the Safed Old Age Home
  • List of people who have files in Beit HaMeiri in Safed

Israeli Administration (1948-)

  • List of registered doctors 1948-1957
  • List of Candidates for the First Knesset (C0nstituent Assembly) 1949
  • Candidates for the 1949 Hahistadrut Haklalit (General Council Elections)
  • Candidates for the 1949 Moatzet Hapoalot (The Women’s Workers’ Council)
  • Candidates for the 1949 Histadrut Hahaklaim (Farmers’ Council)
  • Candidates for the 1949 Histadrut Hapekidim (Clerks’ Council)
  • Candidates for the second Knesset elections 1951
  • Practical Nurses 1951-52
  • List of Candidates for the Third Knesset 1955
  • Authorized People to act as Accountants 5716 (1955-56)
  • List of Persons Authorized to Act as Lawyers in Civil Courts in 1956
  • Authorized People to act as Accountants 5717 (1956-57)
  • Candidates for the 1959 Histadrut Hapekidim (Clerks’ Council)
  • Candidates for the 1959 Histadrut Hahaklaim (Farmers’ Council)

Some databases that are coming online soon, include:

  • Rehavia Address Book June 1937
  • Candidates for the 1959 HaHistadrut Haklalit (General Council Elections)
  • Candidates for the 1959 Va’adat Hapoalot (The Women’s Workers’ Council)

The best news is that the launch of a new advanced search engine for these records is coming online very soon (as in days). Few genealogy societies, if any, will have the kind of advanced search capabilities we will have on the site. It’s an exciting time for researching genealogy in Israel.

To stay up-to-date on the launch of the new search engine and on the ongoing launch of new databases, follow IGRA on their Facebook Page (facebook.com/israelgenealogy) or via Twitter (twitter.com/israelgenealogy). Of course, if you want to comment on the web site and where it can be improved, you can always comment here and I’ll see what I can do.

A look at progress on the USHMM records indexing

Back in May I wrote about how the US Holocaust Memorial Museum and Ancestry.com had teamed up to index some of the USHMM’s millions of records online. The indexes would be created via volunteers using special software provided by Ancestry.com, and the indexes would be be freely available on Ancestry.com (although not the images which would only be viewable on site at the USHMM itself). This project was dubbed the World Memory Project (similar to their existing volunteer indexing project the World Archives Project) and the first results were introduced some months later.

The World Memory Project currently has over 2400 volunteers and has indexed over a million records. The databases that are currently available include:
All of the databases can be searched at once through the main search page.

The 1940 US Census

It’s rare that massive new sources of genealogical information are released, and certainly rare that such sources are released for free. Every ten years in the United States, however, the census from 72 years earlier is released. In the past it has taken a lot of time to get the census made available to the public, primarily because of the massive cost in digitizing and indexing information on tens of millions of people.

On April 2, 2012, the 1940 US Census will be released to the public. Besides the obvious benefit of having information on the over 130 Million residents of the United States in 1940, there are other reasons to be excited about this release.

For one, it is the first time that the National Archives is releasing the census in digital form. In the past, companies needed to scan millions of pages of microfilm to create their own digital images of the census records. On April 2, 2012, the National Archives is releasing the entire 1940 census in digital form. There will not be an index to those records, which brings us to the second reason this release is exciting: Many genealogy companies and organizations have been planning for this release for years and it will be indexed in record time.

For starters, Stephen Morse on his great One Step website, has created with Joel Weintraub and the help of volunteers, ways of finding the 1940 Enumeration District (ED) of any address in the United States. They even have a quiz that helps you determine what the proper way to figure out the ED for where your family lived in 1940. When the census records are released, searching by ED will be the only way to find records in the census. If you know where your family listed in April 1940 (when the census was taken), then you can find the records for that address using Steve Morse’s tools. FOr a very detailed look at how the process will work, see Stephen’s article Getting Ready for the 1940 Census: Searching without a Name Index which appeared in the Association of Professional Genealogists Quarterly this month.

Next, Ancestry.com has announced that they will be making the images and their index to those records (which they will develop on their own) free through at least the end of 2013. It’s unknown how long it will take Ancestry.com to index the records, but presumably their index would be available before the end of 2013.

Archives.com, which has been seeking in recent years to compete with Ancestry.com as a lower-cost service, announced that they have partnered with the National Archives to be the official host of the images that will be released on April 2, 2012. The official site the images will be released on has not yet been announced, but Archives.com has posted information on this partnership at archives.com/1940census.

More recently, it has been announced that three different genealogy companies have joined forces to index the 1940 US Census together and thus make the 1940 census searchable for free as well. These are Archives.com, FamilySearch.org and FindMyPast.com. They will be using FamilySearch.org’s indexing tool (which I discussed almost exactly a year ago here) to coordinate the indexing project.

One interesting point is that it makes sense that Archives.com is involved since they are hosting the images for the National Archives (and have no public indexing tool of their own), and it makes sense that FamilySearch.org is involved (since they have the indexing tool and have previously proven themselves by indexing the 1930 US Census), but the odd man out seems to be FindMyPast.com. What’s interesting is that FindMyPast.com just re-directs to FindMyPast.co.uk, as it is actually a British genealogy site. Is FindMyPast planning to move into the US genealogy market and is the 1940 census their means of doing so? or are they just planning on offering the 1940 census index to their British users as a means of tracking relatives that moved to the US? The use of FindMyPast.com in the press release instead of FindMyPast.co.uk makes this an interesting question.

Together, the three companies have set up the 1940 Census Community Project. You can check out the information on the project now, and if you’re interested in helping index the 1940 US Census, you can download FamilySearch.org’s indexing tool now and try it out with other projects FamilySearch.org is indexing.

In addition, one of the interesting pages the project has released is what the enumerator was supposed to ask each family when adding them to the census. This gives you a good idea of what to expect when the 1940 US Census is released.

So there you go, we’re 105 days away from the release of the 1940 US Census images. Now you know how you’ll be able to find your family (if they were living in the US on April 1, 1940) when the census is released.

Giving Back Through Indexing

If you’ve done any genealogical research in past dozen years you’re probably amazed at how much information is available online. The Internet has certainly changed how genealogy is done, even if the number of records online is still a tiny percentage of what records are actually out there in the real world. Yet, have you ever wondered how all this information made its way online? Certainly finding a hand-written birth certificate or census record of your distant ancestor in the 19th century is not something that happens without a human being spending the time to decipher the handwriting and enter all the relevant information into a database. Yet how do these databases get created?

In the world of Jewish genealogy, probably the biggest project to index records is JRI-Poland, which has put indexes of over four million Jewish birth, marriage and death records from Poland (and places that were once part of Poland) online. JRI-Poland doesn’t actually put the records online, but an index to those records. Usually the information in that index contains the most important details that you would find if you could view the actual record, such as the name of the person, sometimes the names of the parents, associated dates, etc. although on the flip side, without seeing the actual records you can never be sure that all the information is 100% accurate and you don’t know what was left out (sometimes records have notes written on them that contain information important to genealogists). JRI-Poland works by locating relevant records in archives across Poland, figuring out how many records exist for each town covered in the archives, then soliciting donations for the indexing projects for each town. The idea is that if you know your family is from a specific town, then you’ll likely contribute to have the records indexed for that town. In their favor, if a town has a lot of records to index (thus costing more to index) there are usually more descendants of those people to help contribute to the indexing projects. Once the money is raised for a town project, JRI-Poland makes copies of all the index pages from the archive and then pays people in Poland to create the electronic indexes of the records. Why use locals instead of crowdsourcing the indexing like other sites do? I think the main issue is that records from Poland from the 19th century and the early 20th century (when most of these records come from) can be written in a mix of Polish, German and Russian. It all pretty much depends on who was in control of the particular town at the time the records were created. Poland was divided among the various empires in the area multiple times the same town might have been under the control of the Austro-Hungarian Empire (which generally used German for records) or Russia (which obviously used Russian for records). Finding descendants of the people from those towns that speak those languages and is difficult to do. In order to make the indexing easier and to insure there are fewer mistakes, JRI-Poland finds local Poles who know the languages they need, and pays them to do the indexing. So your contribution in the process is fairly simple, pay the money.

There are other ways that online databases get created. Sometimes individuals or groups take it upon themselves to put either records or indexes of records online. Sometimes people just post their contributions on their own web pages, sometimes they contribute them to existing online projects (like JewishGen) and sometimes they coordinate their efforts through sites set up for such projects, like USGenWeb or the Immigrant Ships Transcribers Guild (ISTG). These sites help volunteers to coordinate their efforts and bring records online, where a set of local vital records on USGenWeb, or a ship passenger manifest on ISTG. Sometimes these kinds of records are difficult to locate, so indexes to the indexes pop up, like Census Finder.

One very good example of a volunteer site is GenTeam.at, which indexes Austrian records. As a group they have indexed over 2.7 million records from across Austria, including hundreds of thousands of Jewish vital records. The records are all indexed by volunteers and the site is free to use (although you must register first).

Another way databases or indexes of records get created is by companies that make money from making the records available online. These companies spend a lot of money in acquiring records, scanning records and creating indexes of these records so that they are searchable on their sites. These companies either have experts in their employ who create the indexes of the records, or pay people to do the work for them (usually overseas where labor is cheaper). Examples of web sites that do this are Ancestry.com, Footnote.com (recently bought by Ancestry.com), WorldVitalRecords.com, etc. Footnote.com, for example, made a deal with the National Archives and Records Administration (NARA) to gain access to many of the files they have (such as naturalization records) and make them available online. Sometimes these sites will let you search their records for free, but if you want to actually see the records and the information in them, you need to join the website and pay a membership fee. There obviously is money to be made in this area as the large companies each have over a million subscribers each paying them money monthly or annually. In these cases, your membership money goes towards their indexing efforts, but not in a directed fashion of course. You’re not contributing to an indexing effort as much as the company’s bottom line, but in the end you do get access to new records.

I won’t go into detail now about the role of genealogy in the Mormon Church, but needless to say it is important on a religious level, and the church has invested a lot of time, effort and money into collecting records from all over the world. They have collected billions of records from countries across the globe, generally on microfilm, and keep those microfilms in a secure underground archive in Utah. From the original microfilms kept in that archive, copies are made available in their Family History Library (FHL) in Salt Lake City, and in Family History Centers (FHC) across the US and across the globe. For decades the only real way to access these records was to go to the FHL in Salt Lake City, or go to an FHC near you and request access to specific microfilms, which if they didn’t have they could borrow from the FHL. Accessing these records were difficult for another reason, which is that if you didn’t speak the language the records were written in, you would need to hire someone who spoke that language to go to the FHL or an FHC and sit in front of a microfilm reader and find th records you were looking for (if they exist at all). Obviously this was (and still is) an expensive proposition.

Of course, with those billions of records on microfilm it was only logical that people would start asking to make the records available online. The problem putting them online is not only to scan all those microfilms, but to have people create the indexes that will make the records searchable. The Family History Library’s online presence is known as FamilySearch.org, and it is through that web site that Mormons as part of their religious duty collaborate in creating their own family trees, but also where the Mormon Church has started to make those billions of records available online, for free. With such a massive undertaking, FamilySearch.org had to come up with a way to find help in creating the indexes for all their records – what they came up with is FamilySearch Indexing, where they allow anyone who has access to a computer and Internet to help them index their massive collection of records. FamilySearch claims that over 300,000 volunteers have indexed over 7 Million records since 1996. This year they were trying to index 200,000 records (with only a few days left in the year they’re around 185,000 records).

To help out yourself, you start by signing up on their site, and then downloading a computer program which lets you do the actual indexing. The program lets you specify the difficulty of the records you’re willing to work on (easy ones are recent records typed or written in block print, harder ones can be handwritten in a fancy script and be written a hundred of fifty years ago when handwriting was different than it is today) and what languages you can understand. Once you’ve set up the program it runs your through some easy sample records so you get how it works, and then you can start indexing records by downloading them in batches. Batches are collections of records that make up a kind of work unit. You work on all the records in the batch and when you’re done, you can submit them for review and get another batch of records to work on. A single batch might have only a few records to transcribe if they’re difficult, or perhaps dozens of records if the records are all listed on one page and are easy to read.

As you index more and more batches of records, you earn points. Generally, easy records earn you one point and harder records can earn you more points. Mainly, the points are just a way to keep track of how much work you have contributed to FamilySearch Indexing. The site does offer a Premium Membership to volunteers who earn more than 900 points in a calendar quarter. This gets you a Premium Membership for the rest of that quarter and the whole next quarter. FamilySearch estimates one would earn 900 points a quarter by spending about half an hour a week working on indexing. What does a Premium Membership get you? Well, it seems that while FamilySearch.org owns a lot of the records they put online, they don’t own ALL of the records they make searchable online. In some cases they need to pay the owners of these records whenever someone accesses the image of the actual record. As such, in order to see those records, you need to be a Premium Member, either because of your records indexing or because of membership in an organization that sponsors FamilySearch.org (such as the Mormon church). If you do find a record that is restricted to Premium Members only, you could of course index 900 records that calendar quarter to gain access to it, although sometimes you may find that the record exists elsewhere and just knowing it exists is enough to send you to another site to find the actual records without doing that much work.

I would say, however, that if you use FamilySearch.org, which for what you can access is always free, you should think about contributing to their efforts by doing some indexing yourself. It doesn’t cost you anything but your time, and the next time you find a record on FamilySearch.org that is connected to your family, just remember that the file was probably made searchable by a user like you who contributed their time to indexing.

As one of this blog’s focuses is Jewish genealogy, and there has been some controversy with the Mormon Church and the Jewish community concerning the Mormon practice of posthumous baptism of their ancestors (some of whom were Jewish), and even the posthumous baptism of people not related to the church members, in particular famous people – including, for example, Anne Frank, who has apparently been posthumously baptized at least nine separate times, I want to point to an overview of this whole topic at JewishGen: The Issue of The Mormon Baptisms of Jewish Holocaust Victims And Other Jewish Dead. The issue is going to come up eventually, so in the context of discussing FamilySearch in this post I figure now is as good a time as ever to bring this issue up. I’m not going to dwell on the whole issue, except to point out that even though there have been several attempts between Mormons and Jews to resolve this issue, it continues. There are voices on different sides of the issue – those that believe that the very idea of posthumous baptisms of Jews is sacrilegious, and those who view the religious rites of a different religion as irrelevant to their own. This fight has engendered strong words on both sides, and frankly I have mixed feelings about the whole thing. Certainly many Jewish genealogists have worked hard to make sure whatever information they have on their family is not made available online (such as on public family tree web sites) in order to prevent the possibility that their research might lead to one of their ancestors being baptized posthumously. In this way Mormon genealogy efforts have made Jewish genealogy more difficult due to the added security and protection many Jewish genealogists have implemented with their family trees, yet on the other hand FamilySearch.org has made many very useful records available to all researchers for free, and that has helped all genealogists, including Jewish ones. I think every non-Mormon needs to make a decision on their own what they feel about this issue, and how closely they want to deal with FamilySearch considering that it is a branch of the Church of Latter Day Saints. As I’ve shown above, I don’t think there is a problem to use FamilySearch.org to search for records, and I’ve encouraged people to give back by helping index new records, but beyond that interaction, each person needs make up their own mind about sharing additional information with FamilySearch.org and their parent organization, the Church of Latter Day Saints.