Keeping Your Computer Up-to-Date

AppFresh
AppFresh

The com­put­er is one of our most impor­tant genealog­i­cal tools.

Many of us remem­ber when this was not the case. I have my fair share of mimeo­graphed fam­i­ly group sheets filled out in fad­ing pen­cil wait­ing in a stack to be scanned. But today, with your research find­ings stored in a dig­i­tal data­base and your research con­sist­ing of a blend of pay and free web­sites, with the local and state repos­i­to­ries you want to vis­it tagged in a Google Map, and with your lat­est pho­tos of grave­stones shared on Flickr and Find­A­Grave, you need a com­put­er and you need it to work.

Whether you have a Mac or a Win­dows machine, the key to keep­ing your sys­tem work­ing is main­te­nance. Just like with a car, you should have a sched­ule for main­tain­ing your com­put­er. With a car, every 3,000 or 5,000 miles, you need to change the oil; peri­od­i­cal­ly, you need to rotate the tires. It helps to check the air pres­sure, air fil­ters, and oil lev­el from time to time. There is a sim­i­lar reg­i­men you should fol­low to keep your com­put­er run­ning smooth­ly, so you can focus on your research and not on recov­er­ing from a cat­a­stroph­ic com­put­er issue.

Virus Checking

Those of us who use Macs often come off as smug about the lack of a need for virus check­ing soft­ware. This impli­ca­tion is that the supe­ri­or design of the Mac­in­tosh wards off all threats. (We can be such pains!) Of course, the Mac­in­tosh is just as vul­ner­a­ble as any oth­er oper­at­ing sys­tem. Since OS X has been released, not as many virus­es writ­ten for the Mac, but it takes only one virus to endan­ger your data or your pri­va­cy. So, while Macs are less like­ly to get virus­es, the Mac OS is not with­out its vul­ner­a­bil­i­ties. Addi­tion­al­ly, with cross-plat­form files (such as Microsoft Word files) can arrive with a virus and be sent on with that same virus, whether or not the virus infects your machine.

In addi­tion to virus­es, it is impor­tant to under­stand that there are spy­ware appli­ca­tions that are designed to gath­er data about you and your online iden­ti­ty. These often run based on your brows­er, and are there­fore often plat­form inde­pen­dent. So, no mat­ter what kind of com­put­er your have, you should have anti-virus and anti-spy­ware soft­ware, and keep the virus and spy­ware def­i­n­i­tions up-to-date.

For both the Mac and the PC, the two main­stays of the secu­ri­ty mar­ket, Nor­ton (us.norton.com) and McAfee (www.mcafee.com) offer a suite of prod­ucts that pro­vide pro­tec­tion against virus­es, adware, spy­ware, and a vari­ety of oth­er online threats. The biggest hur­dle for me in using virus pro­tec­tion like the pro­grams sold by McAfee and Nor­ton is hat they some­times take over your com­put­er when you are not expect­ing it to do so. For the Mac, there is also ClamX­av (www.clamxav.com), a free open-source virus pro­tec­tion soft­ware pack­age. While ClamX­av is free, it does not proac­tive­ly scan new or changed files; you have to remem­ber to run it. There­fore, you get less pro­tec­tion, but also more con­trol over what your com­put­er is doing at any giv­en moment.

Virus and mal­ware pro­tec­tion fall in the cat­e­go­ry of adap­tive main­te­nance. They are ways of adapt­ing to changes in the envi­ron­ment.

System Security Updates

Both the PC in Win­dows Vista and Win­dows 7 and the Mac in OS X pro­vide peri­od­ic updates to the sys­tem soft­ware. Some of these are option­al. They might be updat­ing a com­po­nent of the oper­at­ing sys­tem that you do not use, for exam­ple. But, often the updates will be issues to close up secu­ri­ty holes in the oper­at­ing sys­tem. This is known as “adap­tive main­te­nance.” The When­ev­er you receive a secu­ri­ty-relat­ed upgrade for your oper­at­ing sys­tem, you should allow it to install. The soft­ware ven­dors will usu­al­ly not announce secu­ri­ty issues with their soft­ware until a fix is avail­able, so you will prob­a­bly not even know there is a prob­lem. How­ev­er, those who would like to exploit secu­ri­ty issues with the oper­at­ing sys­tem are con­stant­ly on the look­out for these issues, so you should let the experts at Microsoft and Apple give you the ben­e­fit of their attempts to keep you and your genealog­i­cal data safe.

Secu­ri­ty issues are often also dis­cov­ered with desk­top appli­ca­tion, espe­cial­ly Adobe Acro­bat and the var­i­ous browsers, Inter­net Explor­er, Fire­fox, Chrome, and Safari. Be aware of how your soft­ware ven­dor will make updates avail­able. Some updates, such as sys­tem updates for Win­dows or the Mac OS and many appli­ca­tions will be deliv­ered to your sys­tem auto­mat­i­cal­ly, when­ev­er it is con­nect­ed to the Inter­net and there has been a patch released.

In gen­er­al, you should install these sys­tem and appli­ca­tion updates as soon as it is fea­si­ble to do so. If you have any con­cern with whether the updates you are receiv­ing are autho­rized by and deliv­ered from the ven­dor, go to the sup­port or down­loads area of their web­site to ver­i­fy that the change is valid, and learn what defect or vul­ner­a­bil­i­ty the change is intend­ed to address.

Simply Staying Current

You have invest­ed mon­ey in the soft­ware you use every day. More impor­tant­ly, you have invest­ed time in it. You have spent time learn­ing how to use it, fig­ur­ing out its fea­tures and foibles. Any soft­ware that you use a lot for your geneal­o­gy research, whether as a data­base for your records, or as a way to write or share your find­ings, should be pro­tect­ed in anoth­er way. It should be kept rea­son­ably cur­rent. This does not mean that you need to be as assid­u­ous as you should be with installing OS secu­ri­ty patch­es. How­ev­er, you should not be more than two major releas­es behind the released prod­uct. In oth­er words, if the prod­uct is on ver­sion 7, you should be run­ning at least ver­sion 5. This is a gen­er­al rule of thumb, and may vary depend­ing on how much the ven­dor has changed its prod­uct.

There are a cou­ple of pow­er­ful web­sites and desk­top appli­ca­tions that can help you keep on top of keep­ing your appli­ca­tions cur­rent. For both the Win­dows OS and the Mac OS, there is CNet’s Tech­Track­er (for­mer­ly Ver­sion­Track­er), with both free and sub­scrip­tion ser­vices (www.cnet.com/techtracker-free). For the Mac OS, there is a handy desk­top soft­ware pack­age, AppFresh (metaquark.de/appfresh/) which uses the osx.iusethis.com web­site to keep track of changes to appli­ca­tions, wid­gets, pref­er­ence panes and appli­ca­tion plug-ins. In addi­tion to check­ing for new ver­sions of all the appli­ca­tions sub­mit­ted to osx.iusethis.com, AppFresh also keeps track of Apple and Microsoft Updates (and soon, Adobe updates), to help you keep your sys­tem cur­rent with the lat­est releas­es of the soft­ware you use on a reg­u­lar basis. The tool also allows for Sparkle updates, which are built into many Mac OS prod­ucts to auto­mat­i­cal­ly keep an installed prod­uct aware of updates.

Regular Maintenance

With your com­put­er oper­at­ing sys­tem and the appli­ca­tions you run on it safe, you can focus the bulk of your ener­gy on the search for and analy­sis of genealog­i­cal data. After all, your com­put­er is sim­ply a tool for your research, for find­ing, gath­er­ing, arrang­ing, and stor­ing your genealog­i­cal find­ings. You are doing the key intel­lec­tu­al work of assess­ing sources, think­ing through unique ways to find your way past “brick­wall” prob­lems. It would be a shame if this work were lost because of a virus or a secu­ri­ty hole. More com­mon­ly, sim­ply by neglect of a stan­dard process, your sys­tem may degrade in its per­for­mance, and you will lose the ben­e­fit it can pro­vide you and get drawn into many hours of main­te­nance and repairs, of try­ing to reassem­ble the con­tent you have brought togeth­er. We all know, and I have talked about in this col­umn, the need for back­ups. In addi­tion to back­ing up your sys­tem, you should also main­tain what you have.
An ear­li­er ver­sion of this arti­cle appeared in the Nation­al Genealog­i­cal Soci­ety Mag­a­zine. Used  by per­mis­sion.
Categories Uncategorized

Pam Slaton: “Searching for …”

Pam Slaton, host of "Searching for ..."
Pam Sla­ton

Update: 9 March 2011

I am not Pam Sla­ton, and do not even know her. A lot of folks are post­ing here think­ing they are con­tact­ing Pam, but, unfor­tu­nate­ly, they are not. I wish I could pass infor­ma­tion on to her, but I am not in touch with her.

This was news to me: Oprah Win­frey’s OWN tele­vi­sion net­work has a show that fol­lows a pro­fes­sion­al geneal­o­gist. The show, enti­tled “Search­ing for …” runs Mon­day nights at 9/8 Cen­tral. Pam Sla­ton, the geneal­o­gist the show focused on helps reunite the adopt­ed with their birth fam­i­lies, and oth­er fam­i­ly mem­bers with one anoth­er after they have been sep­a­rat­ed for some time and lost touch with one anoth­er.

On the OWN site, they write:

“Search­ing For… is a doc­u­men­tary series that fol­lows the real-life work of Pam Sla­ton, a pro­fes­sion­al inves­tiga­tive geneal­o­gist, stay-at-home mom and New Jer­sey house­wife.

“View­ers can expect an intense­ly per­son­al ride when cam­eras fol­low Pam and her clients through each step as they track down lost loved ones. Each searcher’s sto­ry is dif­fer­ent, and the results are unpre­dictable and emo­tion­al­ly charged. Whether Pam’s clients find a joy­ous reunion, painful rejec­tion or trag­ic loss, they all walk away with the clo­sure they were des­per­ate to find.

“Pam Sla­ton’s career as a pro­fes­sion­al inves­tiga­tive geneal­o­gist began near­ly 20 years ago. Want­i­ng to find her own birth moth­er, Pam hired to a pro­fes­sion­al searcher. The expe­ri­ence was the most dev­as­tat­ing of her life, and Pam vowed that no one else should have to go through what she did. She keeps her own pain in mind when help­ing clients on their jour­neys. And her results are astound­ing! Pam has an 85 per­cent suc­cess rate, fol­lows a strict “no find, no pay” pol­i­cy, and is one of the most sought-after pro­fes­sion­al searchers in the coun­try.”

I will have to take a look.

One of the key aspects of geneal­o­gy shows, which this one looks to have in spades, is an emo­tion­al com­po­nent that most non-geneal­o­gists seem to not expect. With a focus on re-unit­ing liv­ing peo­ple, Pam Sla­ton’s niche in geneal­o­gy seems to be focused direct­ly on emo­tion­al con­tent which should dri­ve the show. Unfor­tu­nate­ly, I don’t know how many peo­ple know about this show.

Google Docs Goes Native

Google Docs was once an appli­ca­tion that was “like Microsoft Word” or “like Pow­er­Point”, and could read and write files from those pro­grams as well as Excel. But main­ly, you under­stood that you were edit­ing your file and stor­ing it, in Google’s pro­pri­etary for­mat.

Then, in Jan­u­ary 2010, Google announced that they would allow users to store any file for­mat in their Google Docs envi­ron­ment. That start­ed to look like anoth­er cloud stor­age offer­ing. Frankly, it did­n’t make a lot of sense to upload files you can­not even open in that envi­ron­ment. Google took a big step toward address­ing that week, mak­ing some key for­mats native­ly view­able with­in Google Docs.

On their blog, they say:

The Google Docs View­er is used by mil­lions of peo­ple every day to quick­ly view PDFs, Microsoft Word doc­u­ments and Pow­er­Point pre­sen­ta­tions online. Not only is view­ing files in your brows­er far more secure than down­load­ing and open­ing them local­ly, but it also saves time and doesn’t clut­ter up your hard-dri­ve with unwant­ed files.

Today we’re excit­ed to launch sup­port for 12 new file types:

  • Microsoft Excel (.XLS and .XLSX)
  • Microsoft Pow­er­Point 2007 / 2010 (.PPTX)
  • Apple Pages (.PAGES)
  • Adobe Illus­tra­tor (.AI)
  • Adobe Pho­to­shop (.PSD)
  • Autodesk Auto­Cad (.DXF)
  • Scal­able Vec­tor Graph­ics (.SVG)
  • Post­Script (.EPS, .PS)
  • True­Type (.TTF)
  • XML Paper Spec­i­fi­ca­tion (.XPS)

Not only does this round out sup­port for the major Microsoft Office file types (we now sup­port DOC, DOCX, PPT, PPTX, XLS and XLSX), but it also adds quick view­ing capa­bil­i­ties for many of the most pop­u­lar and high­ly-request­ed doc­u­ment and image types.

In Gmail, these types of attach­ments will now show a “View” link, and click­ing on this link will bring up the Google Docs View­er.

For me, one of the few annoy­ing aspects of how Gmail and Google Docs work togeth­er has been that, in the ear­ly days, sim­ply open­ing up a Word doc­u­ment in my Gmail would auto­mat­i­cal­ly cre­ate a doc­u­ment in Google Docs, or that it would­n’t allow me to pre­view it, and would force me to down­load the file. Now, I will sim­ply be able to View these doc­u­ments, and have them dis­ap­pear into the brows­er cache at the end of the ses­sion.

More Technology News for Genealogists

Google

Ear­li­er this week, Apple announced a new sub­scrip­tion pay­ment mod­el for the iPad.

Google respond­ed yes­ter­day with a much more flex­i­ble sub­scrip­tion mod­el using Google Check­out (a Pay­Pal com­peti­tor), and pro­vid­ing 10% in rev­enue for Google (in com­par­i­son with Apple’s 30%). Google does not require that the in-app pur­chase price be at least as inex­pen­sive as any oth­er web offer­ing of the prod­uct. It’s a more open pro­gram, and hope­ful­ly will gain trac­tion and help fos­ter a more sus­tain­able sales mod­el for con­tent providers.

Until and unless oth­er mod­els come along, expect to see genealog­i­cal con­tent providers, as they move into the tablet space, to opt for the Google pric­ing mod­el, which will bet­ter align with their oper­at­ing prof­it mar­gins.

SlideShare

SlideShare is a site that allows you to upload Pow­er­Point-style slides to share with oth­ers. (I post all my slides at SlideShare: http://www.slideshare.net/genealogymedia. This week they announced a free 1‑click con­fer­enc­ing prod­uct, Zip­cast. I have not tried it, but it looks inter­est­ing, as most con­fer­enc­ing sys­tems that share slides require that the slides be uploaded in real time, as images of from the per­son shar­ing the slides. Zip­cast might be faster, because the slides will not need to be uploaded dur­ing the meet­ing, and will already be opti­mized for web view­ing at SlideShare.

Don’t be sur­prised if your next geneal­o­gy meet­ing does not hap­pen in per­son, but instead over SlideShare’s Zip­cast.

Categories Uncategorized

Subscriptions on the Apple App Store

Magazines on the iPad
Apple’s iPad

Apple announced today that they will be sup­port­ing sub­scrip­tions on the App­Store. A lot of us have been think­ing that would make for a good day, as it nev­er made sense for own­ers of the iPad to only be able to buy some­thing like a mag­a­zine for the iPad one issue at a time (often for more than a print sin­gle copy).

How­ev­er, the way that Apple is doing this is caus­ing a great deal of con­ster­na­tion out­side of Cuper­ti­no.

First, they are demand­ing 30% of every sub­scrip­tion sale. This is a sim­i­lar rate that is paid on mag­a­zines at the news stand, but not hav­ing to pro­vide that dis­count to mag­a­zine stands is part of what allows mag­a­zine sub­scrip­tions to be so inex­pen­sive. Apple does allow peo­ple who sell sub­scrip­tions to do so “out­side the app.” But, again, the bar­gain they are ask­ing peo­ple to make is dra­con­ian. In their press release, they write:

“How­ev­er, Apple does require that if a pub­lish­er choos­es to sell a dig­i­tal sub­scrip­tion sep­a­rate­ly out­side of the app, that same sub­scrip­tion offer must be made avail­able, at the same price or less, to cus­tomers who wish to sub­scribe from with­in the app.” In oth­er words, the time hon­ored tra­di­tion of the “cut-out-the-mid­dle­man” buy direct dis­count is not going to be allowed.

This means that Ama­zon can­not sell books in the iOS ver­sion of the Kin­dle read­er, even though that read­er only has a link to Ama­zon’s web­site to make that pur­chase. (For titles sold through Ama­zon’s Dig­i­tal Text Pro­gram, authors and pub­lish­ers get a 70% roy­al­ty. Sim­ple math shows that if Ama­zon gives Apple the remain­ing 30%, they will be spend­ing mon­ey to sup­port pub­lish­ers, authors, and Apple, with­out a pen­ny going to pay for Ama­zon’s serv­er farms, let alone its employ­ees or share­hold­ers.)

Ama­zon does not have a sim­i­lar pol­i­cy. If you sell a book on Ama­zon, you can set the price, or let Ama­zon set guide­lines on the price ($2.99 — $9.99 and 20% less than the cheap­est print ver­sion of the title), and get a bet­ter per­cent­age of the sales price. But there’s noth­ing to stop some­one from sell­ing a Kin­dle-for­mat­ted book for $9.99 through Ama­zon and $7.99 direct­ly from them. This is called the agency mod­el, and it means that when Ama­zon acts as the pub­lish­er or author’s agent, they get income, when they don’t … they don’t get income, and fur­ther­more, they make no stip­u­la­tions about how much the author or pub­lish­er can sell the Kin­dle book for out­side of the Ama­zon store.

At best, this announce­ment by Apple will make legit­i­mate ven­dors of books, mag­a­zines, and audio and video think twice before offer­ing their ser­vices at cur­rent prices through the App Store, since doing so would incur a steep fee that they did not have before. At worst, some com­pa­nies will play, but oth­ers will be left out. It seems like a sure way for Apple to make good rev­enue from those who remain, and to sti­fle com­pe­ti­tion from the likes of Hulu and Net­flix (video rentals), Ama­zon (books and mag­a­zines), and Rhap­sody (music).

A com­pre­hen­sive arti­cle on the reac­tions appears on Read­WriteWeb: “A Round-Up of Reac­tions: Apple’s Greedy, Anti-Com­pet­i­tive, Evil, Bril­liant Announce­ment.” This arti­cle points out that the Wall Street Jour­nal mus­es about the legal­i­ty of the announce­ment:

“Apple Inc.‘s new sub­scrip­tion ser­vice could draw antitrust scruti­ny, accord­ing to law pro­fes­sors,” writes the Jour­nal’s Nathan Kop­pel. Accord­ing to the arti­cle, the antitrust argu­ment hinges on two pri­ma­ry points — whether or not Apple is exert­ing “anti­com­pet­i­tive pres­sures on price” and whether Apple is a “dom­i­nant play­er in the mar­ket.”

But what does this mean for geneal­o­gists? We may nev­er know for sure. If Apple’s strat­e­gy goes for­ward, but actu­al­ly does have a chill­ing and anti­com­pet­i­tive impact, a lot of con­tent and ser­vices, some not yet con­ceived of, may not come to a dom­i­nant plat­form. Geneal­o­gists are rav­en­ous con­sumers of books, includ­ing e‑books and audio books. This may delay or stop the deliv­ery of a lot of titles that might oth­er­wise have been avail­able. Hope­ful­ly, Apple will re-think their announce­ment, at least as it con­cerns how ven­dors price and sell their con­tent off the iPad.

Using the Wayback Machine for Genealogy

Geocities Has Closed
Geoc­i­ties Has Closed

The Way­back Machine, a project of The Inter­net Archive, (cur­rent ver­sion: http://web.archive.org/; new beta ver­sion at http://waybackmachine.org/) is an attempt to archive the com­plete con­tent of the Inter­net. Brew­ster Kahle, the co-founder of the Inter­net Archive spoke about the project at the Sat­ur­day keynote address at Root­sTech 2011.

The key pur­pose of the Inter­net Archive is to make the Inter­net avail­able for future his­to­ri­ans and oth­er researchers, in order that they might know what we were say­ing and doing in this often ephemer­al envi­ron­ment called the Inter­net.

But it can also help us in the here and now. If you ever encounter a pub­licly avail­able site that has dis­ap­peared, you may find it else­where on Google, but, fail­ing that, you may find it in the Inter­net Archive.

For exam­ple, on an old Rootsweb page that I am in the process of migrat­ing to this site, I have a link that is no longer work­ing. (As the lin­go goes, I have “link rot”.)

I try to link to:

http://www.geocities.com/Heartland/Hollow/1936/index.html

When I try to nav­i­gate to this site, I get a mes­sage say­ing:

“Sor­ry, the GeoC­i­ties web­site you were try­ing to vis­it is no longer avail­able.
GeoC­i­ties has closed, but there’s a lot more to explore on Yahoo!”

This does not offer much solace. How­ev­er, when I go to the Way­back Machine and enter the URL I was search­ing for, I receive the fol­low­ing link:

http://web.archive.org/web/*/http://www.geocities.com/Heartland/Hollow/1936/index.html

Alter­nate­ly, if I go to the beta ver­sion of the new Way­back Machine and enter this search I get to:

http://waybackmachine.org/*/http://www.geocities.com/Heartland/Hollow/1936/index.html

This page shows me the var­i­ous snap­shots the Inter­net Archive got around to mak­ing of this page. When I click on the most recent, I see that it has a link to a new loca­tion:

http://freepages.genealogy.rootsweb.ancestry.com/~pre1800vias/

I can also look at oth­er snap­shots to see what the site looked like at that time.

The Inter­net Archive can­not instan­ta­neous­ly cap­ture the whole Inter­net, but every cou­ple of months, it tra­vers­es most of the pub­lic web, cap­tures what has changed, and moves on. You should not rely on it, either as a web user, or as a web­mas­ter, how­ev­er it can prove very handy at times. Try it the next time you run across a link that you are sure used to work, but no longer does.

Categories Uncategorized

RootsTech 2011: Towards a New Genealogical Data Model

On Sat­ur­day at the Root­sTech con­fer­ence in Salt Lake City, there was an open dis­cus­sion ses­sion on genealog­i­cal data stan­dards. There has been a heat­ed dis­cus­sion, lit­er­al­ly going on for years, about a new data mod­el that could replace GEDCOM. A new GEDCOM stan­dard would address GED­COM’s gaps — for exam­ple, being able to store evi­den­tiary analy­sis with­in the data mod­el — and be a liv­ing dynam­ic stan­dard, unlike GEDCOM, which has been sta­t­ic since 1996.

In the first hour, the dis­cus­sion iden­ti­fied sev­er­al issues with the data mod­el:

  • Data in Pro­pri­etary For­mats — Because of gaps in GEDCOM, and the lack of a stan­dards body to address this issue, most soft­ware ven­dors devel­oped their own pro­pri­etary exten­sions, which lim­it­ed the abil­i­ty to share data.
  • Lack of Per­sis­tent URLs (PURLs)
  • Unstruc­tured Text
  • Tag & Link Issues
  • Incon­sis­tent Search Expe­ri­ence
  • Data Ver­sion­ing (Diff/Merge)
  • Inabil­i­ty to Trans­fer Rich Data (rich media)
  • Inabil­i­ty to do Cross-Repos­i­to­ry Search
  • Doc­u­men­ta­tion (in oth­er words, cap­tur­ing the source of a genealog­i­cal state­ment, the abil­i­ty to pro­vide
  • Key as seen (Rep­re­sen­ta­tion) — In oth­er words, how do we nor­mal­ize data while pre­serv­ing the orig­i­nal “as-keyed” ver­sion?
  • Sta­t­ic data inter­change

After the first hour, devot­ed to cre­at­ing this list, we were to vote on buck­ets of tech­no­log­i­cal or fea­ture issues to come up with one or two we could dis­cuss. For me, the biggest issue was not any of these tech­ni­cal issues, it was the lack of a gov­er­nance mod­el. Since no one was signed up to main­tain GEDCOM, it did not change with the times, and died as a stan­dard; in oth­er words, peo­ple saw gaps and addressed them in a pro­pri­etary way, since there was no way to get issues addressed with­in the stan­dard.

I got up and sug­gest­ed we talk about how we build a work­ing gov­er­nance mod­el instead of the issues that the gov­er­nance mod­el would help us solve. For more than a decade, peo­ple have been lament­ing the lack of a stan­dards body to adju­di­cate issues, devel­op a com­mon stan­dard, and sub­mit it for pub­lic review. At the same time, peo­ple have point­ed out the fea­ture gaps, and pro­posed ways to address them. For the fea­ture gap dis­cus­sion to have an effect, how­ev­er, we need to have a place to have these dis­cus­sions that is actu­al­ly designed to main­tain a work­ing stan­dard. Lack of gov­er­nance, not lack of tech­nol­o­gy, is the issue. We vot­ed, and changed the direc­tion of the meet­ing to dis­cuss gov­er­nance.

It was at about this time that Tom Creighton, the CTO of Fam­il­y­Search, got up and announced that Fam­il­y­Search is near­ly ready to announce a new pro­posed data mod­el. This changed the meet­ing imme­di­ate­ly. Instead of an open dis­cus­sion, it became more like a press con­fer­ence, with Tom field­ing ques­tions about what they have done, when the work will be shared, and so on. There was not a lot that he was able to divulge at this point.

Key por­tions of the new pro­posed stan­dard are based on the Gen­Tech genealog­i­cal data mod­el owned by the Nation­al Genealog­i­cal Soci­ety (full dis­clo­sure, I am on the Board of the NGS). The deci­sion to make the new pro­posed data mod­el pub­lic and free has not yet been made by the man­age­ment at Fam­il­y­Search, but is being dis­cussed. This means that there can­not be a date set for the launch of the new stan­dard, as it could remain the intel­lec­tu­al prop­er­ty of Fam­il­y­Search, and unavail­able out­side of Fam­il­y­Search. (Mr. Creighton said that they had dis­cussed the fact that they were devel­op­ing a new stan­dard with sev­er­al soft­ware ven­dors, but had not pro­vid­ed any of them any more detail than that they were work­ing on some­thing.)

This is an excit­ing devel­op­ment in the inter­sec­tion of geneal­o­gy and tech­nol­o­gy. If Fam­il­y­Search decides to share their work, and if a gov­er­nance body can be iden­ti­fied or set up, and final­ly if that gov­er­nance body has the trust of the genealog­i­cal com­mu­ni­ty, includ­ing:

  • the major desk­top and mobile appli­ca­tion devel­op­ers
  • the major web data­bas­es
  • the NGS
  • NEHGS (New Eng­land His­toric Genealog­i­cal Soci­ety)
  • FGS (the Fed­er­a­tion of Genealog­i­cal Soci­eties)
  • BCG (the Board for Cer­ti­fi­ca­tion of Geneal­o­gists)
  • APG (the Asso­ci­a­tion of Pro­fes­sion­al Geneal­o­gists)

we could be near the start of a much more rich tech­nol­o­gy envi­ron­ment. A new data mod­el, address­ing issues with GEDCOM and upgrad­ed and changed through a com­mu­ni­ty gov­er­nance mod­el could lead to inte­grat­ed set of inde­pen­dent­ly devel­oped soft­ware tools that would allow peo­ple to rep­re­sent their research bet­ter than they can with GEDCOM, and bet­ter share their data or move it from one vend­ed prod­uct to anoth­er.

It sounds a lit­tle like Shangri-la as I write it here, but we are talk­ing about the incred­i­ble poten­tial that would be unleashed if most soft­ware ven­dors did not have to fix inde­pen­dent­ly (or ignore) issues with the cur­rent data mod­el, and could instead focus on the next new way to access and work with genealog­i­cal data.

Update, 17 Feb­ru­ary 2011: A sum­ma­ry of the meet­ing dis­cussed here has been post­ed on the Fam­il­y­Search wiki: https://wiki.familysearch.org/en/Genealogical_Data_Standards_(RootsTech_Session)

Categories Uncategorized

RootsTech 2011: Day 3

Internet Archive
Inter­net Archive

Brew­ster Kahle, founder of the Inter­net Archive, gave an incred­i­ble keynote address this morn­ing.

His non-prof­it has been dig­i­tiz­ing and pro­vid­ing on the Inter­net all kinds of media. As he said, “We are in the busi­ness of giv­ing infor­ma­tion away.” He briefly men­tioned “born dig­i­tal” data, but focused his dis­cus­sion on the data we all have in shoe­box­es, what he called the “canon­i­cal box ‘o stuff.”

The Inter­net Archives has 23 scan­ning cen­ters in 6 coun­tries. For exam­ple, they have dig­i­tized doc­u­ments from the Leo Baeck Insti­tute, and did so while remov­ing pri­vate infor­ma­tion via remote cura­tion over the web.

Mr. Kahle also dis­cussed their dig­i­ti­za­tion of video con­tent (8mm, Super8, 16mm,  video tape). He point­ed out that some of this kind of con­ver­sion is avail­able in the con­sumer mar­ket, for about $200 / hour. High­er grade (HD-qual­i­ty trans­fers are also avail­able, but are much more expen­sive.

Specif­i­cal­ly in the genealog­i­cal field, Mr. Kahle said that the Inter­net Archive is involved in cre­at­ing a free genealog­i­cal library — part­ner­ing with Fam­il­y­Search and the Allen Coun­ty Library. Recent­ly, the Inter­net Archive com­plet­ed dig­i­tiz­ing the 1790–1930 Cen­sus and mak­ing it avail­able for free. They are now work­ing on dig­i­tiz­ing pas­sen­ger records. Soon, they will be announc­ing a part­ner­ship with libraries that will allow for 80,000 e‑books to be “loaned” from the library to patrons who are in the library.

For me, this was all pow­er­ful, trans­for­ma­tive infor­ma­tion. But I was most inter­est­ed in Mr. Kahle’s dis­cus­sion of print-on-demand dig­i­tal book­mo­biles, which can pro­vide books as peo­ple need them, at a very low cost. (One exam­ple was that Alice in Won­der­land costs about $1 to print and bind.) Accord­ing to Mr. Kahle, a Har­vard study has shown that it takes a library $3 to loan a book, so $1 to give a book away should be a rea­son­able price. This is being used to pro­vide print­ed books free in India, Egypt, and Ugan­da.

One of the most mov­ing por­tions of the dis­cus­sion was the fact that the Inter­net Archive has dou­bled, to more than 1 mil­lion, the num­ber of books avail­able to the blind and text-dis­abled in the DAISY for­mat for auto­mat­ed read­ers.

A key issue for any archive, Mr. Kahle point­ed out is insti­tu­tion­al respon­si­bil­i­ty: How long, and at what lev­el can a com­pa­ny, or any insti­tu­tion be trust­ed to store infor­ma­tion. He told us not to trust that Flickr, Google, or even his non-prof­it would be around, or make the right deci­sions when it count­ed. So, his rec­om­men­da­tion is to not only have one copy in one insti­tu­tion. He said that the Library in Alexan­dria burned, yes, but it already had lost many of the impor­tant texts that it had gath­ered because of insti­tu­tion­al neglect: “the new guys did­n’t like the old stuff around.”

In 2002, the Inter­net Archive hand­ed 200 TB of their data to the Library of Alexan­dria, which rec­i­p­ro­cat­ed with their col­lec­tion of dig­i­tized Ara­bic mate­ri­als. These kinds of large scale swap agree­ments are crit­i­cal to the redun­dan­cy need­ed to ensure that we do not have anoth­er loss sim­i­lar to what we lost at Alexan­dria, books by Aris­to­tle, the oth­er plays of Euripi­des … At this point, the whole Inter­net Archive is stored in three loca­tions: San Fran­cis­co, Alexan­dria, and Ams­ter­dam. Mr. Kahle acknowl­edged that an earth­quake zone, the Mid­dle East, and a flood plain were per­haps not the best choic­es, but they were not plan­ning on stop­ping there.

For us, as geneal­o­gists, Mr. Kahle pos­es the fol­low­ing ques­tions, which should make us think hard about the respon­si­bil­i­ty we have to take care of our data and doc­u­ments:

  • Can we learn the sto­ries of our ances­tors?
  • Will our descen­dants know our sto­ry?

The Root­sTech con­fer­ence was a great suc­cess. More than 3,000 atten­dees were there, mak­ing it one of the biggest, if not the biggest geneal­o­gy gath­er­ing in the US. Next year, the sec­ond Root­sTech con­fer­ence will be held at the Salt Palace in Salt Lake City, Utah from 2–4 Feb­ru­ary. I plan to be there.

Categories Uncategorized

RootsTech 2011: Day 2

Day 2 of Root­sTech start­ed with a spir­it­ed keynote address by Curt Witch­er of the Allen Coun­ty Pub­lic Library on “The Chang­ing Face of Geneal­o­gy.” His point was: The world is going dig­i­tal and going there quick­ly. Get on board, or be left behind.

Bri­an Pugh of Fam­il­y­Search pre­sent­ed a pow­er­ful talk on how the new Fam­il­y­Search web­site has uti­lized cloud ser­vices (pri­mar­i­ly from Ama­zon Web Ser­vices: http://aws.amazon.com) to pro­vide world class web­site in a cost-effi­cient man­ner. The strat­e­gy has allowed them to auto-scale up and down their ser­vices as need­ed. Addi­tion­al­ly, they are able to cre­ate data snap­shots to quick­ly build new pro­to­types of their site for devel­op­ment and test­ing. They use Ama­zon S3 as a shared filesys­tem for dynam­ic con­tent, though the per­for­mance of S3 is not designed for serv­ing up images, and so on, so they cache the data stored on S3 for actu­al deliv­ery to web browsers.

One thing they are doing on the Fam­il­y­Search web­site is uti­liz­ing Ama­zon Elas­tic IPs to allow for “hot” deploy­ment of new ver­sions of the site. They can build the new ver­sion of the site, test it, and then in a mat­ter of sec­onds, have Ama­zon redi­rect the IP address of the web­site to the new site, while keep­ing the old site in reserve. If they need to fall back to the old site, it’s again only a mat­ter of sec­onds.

They also use Ama­zon MapRe­duce to per­form com­plex com­pu­ta­tions.

Fam­il­y­Search engi­neers have made avail­able pro­gram­ming lan­guage for cre­at­ing cloud based sys­tems, avail­able at: code.google.com/p/lasic. This allows man­agers of cloud envi­ron­ments to quick­ly issue “verbs” such as

  • Deploy
  • Con­fig­ure
  • Shut­down
  • Snap­shot

One key thing that Mr. Pugh said about Ama­zon’s offer­ing in this space, is that it is being wide­ly used. Among oth­ers, he men­tioned that the New York Times, Major League Base­ball, Net­flix, 3M, Activi­sion, ESPN, NASDAQ, The Guardian, and Razor­fish (and I can add the New Eng­land His­toric Geneal­o­gy Soci­ety, based on the Fri­day lun­cheon.)

Lat­er in the day, I was able to attend a view­ing of “Who Do You Think You Are?” at the Fam­i­ly His­to­ry Library. They gave out raf­fle items, and I won a copy of Ances­try for the Mac. I then took advan­tage of the Library being open until mid­night, research­ing my Hills, John­sons, and Crows in Howard Coun­ty and Nance Coun­ty, Nebras­ka.

Categories Uncategorized

RootsTech 2011: Day 1

Yes­ter­day was the first day of Root­sTech, a new con­fer­ence on geneal­o­gy and tech­nol­o­gy held in Salt Lake City and spon­sored by Fam­il­y­Search Inter­na­tion­al, the geneal­o­gy infor­ma­tion arm of the Church of Jesus Christ of Lat­ter-day Saints.

The con­fer­ence start­ed with a lit­tle bit of con­fu­sion: It seemed that there was a rush to the reg­is­tra­tion table just pri­or to the keynote address. This kind of thing can be min­i­mized, of course, by open­ing reg­is­tra­tion the day before, or by send­ing all the light­weight items (tick­ets to lunch­es and events, lan­yard and badge) ahead of time, and then sim­ply exchang­ing one of those tick­ets for a stan­dard back­pack or lap­top case and any oth­er schwag and late-break­ing news.

In any case, the orga­niz­ers offered to let peo­ple reg­is­ter lat­er; they were not going to check badges for the first event. This was some­thing I def­i­nite­ly took advan­tage of, since I did­n’t want to miss the talk by Shane R. Robi­son (Exec­u­tive Vice Pres­i­dent and Chief Strat­e­gy and Tech­nol­o­gy Offi­cer, Hewlett Packard) A World of Infor­ma­tion and Jay Verkler (CEO, Fam­il­y­Search Inter­na­tion­al) Turn­ing Roots, Branch­es, Trees into Nodes, Links, Graphs.

I am not sure what the more genealog­i­cal­ly and less tech­no­log­i­cal­ly mind­ed atten­dees thought of Shane’s speech. It was a well-deliv­ered dis­cus­sion of the future of cloud com­put­ing and glob­al­iza­tion. I found it fas­ci­nat­ing. Of course, with so much of the world so pop­u­lat­ed, and with these oth­er pop­u­la­tion cen­ters (Chi­na, India, Brazil) poised to dra­mat­i­cal­ly move into more of a mid­dle-class exis­tence, there are seri­ous chal­lenges for glob­al sus­tain­abil­i­ty. I was glad to see that Mr. Robi­son had sus­tain­abil­i­ty in the cen­ter of his group of pri­or­i­ties for Hewlett Packard.

Mr. Verkler got up and tied this all back into geneal­o­gy, point­ing out that cloud com­put­ing is hap­pen­ing in a big way already in the geneal­o­gy space: All of the new Fam­il­y­Search web­site is host­ed on Ama­zon EC2 servers in the cloud, not on servers Fam­il­y­Search owns itself.

Lat­er in the day, I spent some time man­ning the NGS booth, looked around at the exhib­it hall, and attend­ed some talks. IBM has a space in the exhib­it hall with games: non-vir­tu­al (pool, air hock­ey, chess) and vir­tu­al (Microsoft Kinect). They were also giv­ing away mas­sages. I also attend­ed jQuery and Web Ser­vices, a talk by Logan Allred. He was cogent and clear. Over lunch, I heard Chris van der Kuyl of bright­sol­id dis­cuss Fam­i­ly His­to­ry in the Age of the Cloud. He did­n’t real­ly talk about the cloud much, but it was an inter­est­ing romp through the inter­sec­tion of tech­nol­o­gy and geneal­o­gy, and a good intro­duc­tion to bright­sol­id as a com­pa­ny.

Jim­my Zim­mer­man’s Ruby Library for Fam­il­y­Search API was also a great talk, so full of details, it was prac­ti­cal­ly a code review. I regret to say that Bar­ry Ewell’s talk, Dig­i­tal­ly Pre­serv­ing Your Fam­i­ly Her­itage, did not impress me. He’s very knowl­edge­able about the top­ic, but his speak­ing style grat­ed on me. He would start a sen­tence, stop in the mid­dle, say a cou­ple of sen­tences that were rel­e­vant to him, then fin­ish the orig­i­nal sen­tence. Maybe he was hav­ing an off day, or was a lit­tle ner­vous in the lights, but it did­n’t make for a good pre­sen­ta­tion in my opin­ion. Michael Buck­’s Top Ten Web Appli­ca­tions Secu­ri­ty Risks (based on OWASP rec­om­men­da­tion) was clear, well thought out, and easy to fol­low.

At the end of the day, bright­sol­id spon­sored a Night at the Plan­e­tar­i­um. There were nachos, sand­wich­es, and pop­corn, but also IMAX films, as well as all the plan­e­tar­i­um exhibits. A great end to the day … except that I also head­ed to the Fam­i­ly His­to­ry Library, which was open until 11.

Categories Uncategorized