Oracle 12c installed . Getting my #GEEK on this week.
… A Muse — ing
Studying up on Oracle v. 12c. As usual, there are many new features to recommend migrating or deploying to the new version of Oracle. Last blog, I talked about just a few: ILM, ADO, HEAT_MAP and how these buzz-worthy acronyms were related to compression inside the database. Before, I get into today’s topic, I wanted to talk about a bit more about the Automatic Data Optimization (ADO).
I failed to make clear yesterday, that the ADO, automatically relocates your cold data or compresses your data as it ages through its Lifecycle automatically. That is the magic. You define the policies and the database will relocate or compress a segment(s) or a row(s) to save space or to clear space on more expensive hard disk, by relocating to slower/less accessible disk storage media. Pretty nifty idea.
By the way, you may be wondering … 8i, 9i, 10g, 11g, 12c what is the pattern or meaning of these major release versions from Oracle.? Well, “8i / 9i” were from the era, when Oracle was the “Internet” database (you know like iPhone, or i-<Anything>). Then “10g / 11g” were to be the “Grid”. Grid never really achieved buzz-worthy status. Now we have “12c”. It should not surprise you that we are now in the “Cloud” era. So Oracle’s letters are for: Internet, Grid, and Cloud . Now you know.
That Cloud and yesterday’s ADO will figure in today’s blog too. You see, I was recently asked about Greenplum. Could I use it? As is my wont, I took a step back and studied the question. Here is my answer.
MPP – RAC(aka Oracle parallel server)
Full SQL (Postgres)
Full SQL (Oracle, ANSI)
Compression since 11g, ADO/ILM 12c
B-Tree / BitMap Indexes
B-Tree / BitMap Indexes
Parallel Query Optimizer
Parallel Query Optimizer
GreenPlum HD (HDFS)
External Tables using an HDFS
I believe that as an Oracle expert (28+ years from v2.0-11g inclusive), that I could effectively use Greenplum on a project. If you look at the above chart, I think you will see what I am about to explain.
Green is an MPP platform. Very nice acrhitecture. Oracle can sit on top of any architecture (MPP, SMP, or any cluster or Highly Available or Fault-Tolerant Failover set of servers) you can setup.
Both use FULL SQL. That means ANSI compliance and with enhancements (POSTGRES for Greenplum and ORACLE, uh, for Oracle).
B-Tree and Bit Map Indexes for both — yawn old hat. Parallel Query Optimizer – been there, seen that for a while.
Greenplum has JDBC/ODBC/OLE interfaces. Oracle has those too, plus a full complement of Pro*C (or many other languages) embedded pre-compiled 3GL languages. Oracle is well supported by Shell Scripts like PHP or PERL that have their interfaces to Oracle. Slight advantage to Oracle. But the point is, Oracle professionals have done this for more than a decade.
External Tables too are a feature in both databases. GreenPlum HD uses the External Table to provide HDFS access in GreenPlum via SQL or other in-database features. Now I had not previously thought to try and use HDFS with Oracle. But the External Table is precisely the feature I would use. Can it be done? A look at Oracle’s documentation answers that:
CREATE TABLE [schema.]table ( column datatype, ... ) ORGANIZATION EXTERNAL ( TYPE ORACLE_LOADER DEFAULT DIRECTORY directory ACCESS PARAMETERS ( PREPROCESSOR HDFS_BIN_PATH:hdfs_stream access_parameters ... ) LOCATION (file1,file2...) );
So I recommend that companies fell free to utilize Oracle consultants on Greenplum databases. There is an awful lot of overlap that the Oracle specialist can leverage from his/her background and transfer to the Greenplum database.
Of course, for companies without Greenplum, it looks like you can use many of the same features already in Oracle including using HDFS filesystems with External Tables.
So get to that BigData, your friendly Oracle expert can help you.
Stanczyk is a bit uncertain. It seems like every day there are some new vital records indexes or even actual register scans themselves made available from congregations all over the Central European — Jewish, Catholic (Roman & Greek), Orthodox, Lutheran/Evangelical lands that make up Poland or a land that was once within the Polish-Lithuanian Commonwealth (the 1st Republic) or any of the variations between those times. So I thought I would step back and take stock of what was available. Yes, I know this will be out of date by tomorrow. But here is a quick & dirty, handy reference list of where to go looking. Clip & Save.
AGAD Księgi metrykalne – Eastern Borderlands (Ukraine, Russia Jewish Pale, etc.) —
(scans by Sygn.: http://www.agad.gov.pl/inwentarze/KMLw301.html#idp1765776 )
Prussian Poland Parishes
BASIA - http://www.basia.famula.pl/en/ – State Archives in Poznan, the Wielkopolska Genealogical Society (WTG “Gniazdo”) project.
Poznan Marriage Project – http://poznan-project.psnc.pl/
Pomorskie Towarzstwo Genealogiczne - http://www.ptg.gda.pl/
All Poland & Eastern Borders (PTG)
METRYKI (parish register scans)– http://metryki.genealodzy.pl/
Szukajwarchiwach (Poland’s National Archives online) - http://szukajwarchiwach.pl/
This is the latest project and is shooting to have 5.8 Million records by the end June (this month) scanned and on-line by Polish Archive or National Museum.
Jewish Record Indexing (JRI) – http://www.jewishgen.org/jri-pl/jriplweb.htm
The venerable project with new life provides indexes to registered users (free) and then you can purchase the actual church record. Great for Jewish Pale & Russian Poland, plus so much more.
Metryk.GenBaza.pl – http://metryki.genbaza.pl/genbaza,list,4,1 (AP GRODZISK). Archive in Grodzisk Mazowiecki (Russian Poland parishes near Warsaw).
Besides the 5 parishes below, you might want to have a look at holdings for:
Austria, Germany, Russia & Ukraine
|Poland, Częstochowa Roman Catholic Church Books, 1873-1948||Browse Images||14 Feb 2013|
|Poland, Gliwice Roman Catholic Church Books, 1599-1976||Browse Images||14 Feb 2013|
|Poland, Lublin Roman Catholic Church Books, 1784-1964||99,510||14 Feb 2013|
|Poland, Radom Roman Catholic Church Books, 1587-1966||18,916||21 Apr 2013|
|Poland, Tarnow Roman Catholic Diocese Church Books, 1612-1900||1,002,155||6 Jan 2012|
Did I miss any? Email Me … Proszę !
http://regestry.lubgens.eu/news.php – from Valerie Warunek (PGSM). Database of Indexed church records (birth/urodzenia, marriage/malzenstwa, death/zgony) from Lubelskie wojewodztwo. No scans (skans), but it does have record (akt) #’s.
Stanczyk, has been sifting through the Index created on genealodzy.pl in their Geneszukacz database. Alright, only the Births Index, so far.
I see they have a total of nearly 7,300 people from those years (1875-1908) in their Birth Index. From Adam … Żyp . There were 58 ELIASZ in their index.Notice they used ELIASZ and not ELIJASZ. I found that interesting. They removed ‘J’ when they produced the index. Was that an error? Or was the indexer an expert? Because, in my heart of hearts, I believe the name (at least back to 1690) was ELIASZ.
It was only since 1869 when the Russian Empire forced Poland to keep records in Russian (Cyrillic) that the ‘J’ appeared from the Russian character ‘я’ (Ya) that ELIASZ became элияшъ . элияшъ is transliterated in a Latin alphabet as ‘Elijasz’.
I only wanted to mention this as while I believe the translated properly produced the index with respect to ELIASZ; You will need to realize that finding the record in Russian/Cyrillic, you will need to look for a different translation (i.e. ELIJASZ/элияшъ) in the indexes and the actual church records.
So now I have an index of ELIASZ born in Pacanow in the years 1875-1908. Now what? I compared the list of 58 with what I already had/knew. I saw an overlap of 22 people. So I have 36 new ELIASZ to resolve and add into the family tree. My options are:
The year range 1875-1908 is not completely in LDS microfilm. Although 1875-1884 is in LDS MF #’s:
So doing research in a local Family History Center or at the Family History Library (Salt Lake) is not an option for the remaining 36. So I now have better options for remote research.
My List of 58 ELIASZ.
My ancestral village, PACANOW, was indexed for BIRTHS (1875-1908). I was able to verify it was correct with my grandfather (whose Birth Record I have) and a few others. I also found some I did not know about !!! I only wish they had the images (like in METRYKI database). Thank you: Wojciech Liśkiewicz (who I think was the indexer)!
Later in the day they(he) also added MARRIAGES(1875-1908) too.
Did you read Stanczyk’s blog from yesterday commenting on the AP Breaking News of the Russian Secret Service having tapes of Zubeidat and Tamerlan Tsarnaev (Царнаев) discussing jihad. I believe the Russians also indicated they had tape of the mother speaking to other person of interest to the FBI also. So go read yesterday’s blog then come back …[I’ll wait for you]
Perhaps you may have missed a link from yesterday on the fine work by Business Insider. They are a fine source of news for what happens in Europe, for those who want yet another viewpoint on which to be informed. You should click on that link (also here) if you wish to see pictures of the family before they came to the US. After I had read that I thought I would write a blog on the school register, which seems to confirm the children (only four, although I have read another account of a fifth child, an unnamed married daughter in Chechnya). Alas, the AP Breaking News story over ran my composing a story of a school register. But here it is, albeit belatedly …
Line 9 - Bela Tsarnaeva, born 1988, female, Chechen, Arrived In School 2001, 7th grade, from Kyrgyzstan
Line 10 - Alina Tsarnaeva, born 1990, female, Chechen/Avar, Arrived In School 2001, 5th grade, from Kyrgyzstan
Line 11 - Dzhokhar Tsarnaev, n/a, male, Chechen, Arrived In School 2001, 1st grade, from Kyrgyzstan
Line 12 - Tamerlan Tsarnaev, 1986, male, Chechen, Arrived In School 2001, 8th grade, from Kyrgyzstan
The handwriting is in Cyrillic. So if you are trying to learn to read Russian, here is a modern version (much easier than old church registers in Old Russian characters/handwriting) — albeit still difficult for Latin Alphabet readers to discern.
So all children entered the Dagestan school system in 2001. They all came from Kyrgyzstan. We now have credible ages for the daughters and confirmation on their names (Bela/Bella, Alina/Ailina).
It is interesting that one daughter (Alina) had a notation of Chechen/Avar. Her mother was an Avar. Previously, all children were listed as being born in Kyrgyzstan. Why does this one child have Avar by ethnicity? Aren’t all of the Tsarnaevich children Chechen/Avar really — since they all have the same mother.
Even though Dzhokhar’s birth was not given, we know his birthdate from other documents (see also the first story) which indicates: 22-July-1993.
So for those genealogists following the genealogy, those are the best birth dates (or birthyears) so far that I have seen in news accounts. This data is also from a good source document, not just someone’s notes (or memory). Genealogy in the news.
week ago Stanczyk wrote about Cholera and its 5th Pandemic. This week I was searching for ‘Status Animarum‘ (Latin for ‘State of the Souls’). That is a type of church census. Often these censuses include three generations. I stopped in a list of Google results. I stopped for Cleveland’s St. Stanislaus’ 1918 Status Animarum.
Now Stanczyk has a branch of the family in Cleveland and at the St Stanislaus parish. 1918 was the tail end of World War I. It was also the main year of the last epidemic, … the flu, sometime called the Spanish Flu. More people died from the flu than from the war. Pestilence won again. I am sure war had something to do with the pestilence and people weakened immune systems.
World War I was different then almost all other wars in US history. We had citizens training and fighting in two armies against a common enemy. You had Polish-Americans serving in the US Army and you had Polish Americans serving in the French colors (Blue Army), Polish-led (General Haller), Canadian-trained, with men from the USA who were Polish ethnically. So a world war and pestilence both ran amok.
This Status Animarum was not the kind like in European parishes that listed two-three family generations and their home, census-like. This was a Status Animarum Report — summarized at the parish level …
Blessedly, with about one thousand men serving in the military (3/4 USA, 1/4 Polish) and the Flu Pandemic, only 18 men had died!
Statistically, Cleveland’s St. Stanislaus had lucked out. The other statistic, 25% of the men served in Haller’s Army (aka the Blue Army). This device allowed the US men to have a presence in the World War, before the USA was ready … emotionally to end its isolationism and enter the war itself. I wonder if this percentage held true in all Polish parishes in the USA?
What desperate times were those? And yet, is today not like a hundred years ago? We shall see. It has been nearly a century since the last pandemic. Will the Chinese bird-flu be the next pandemic? Time will tell. Certainly, there is plenty of warfare about the globe and plenty of sabre rattling.
Another Wordless Wednesday blog post. (Hmmm… somewhat wordless).
1892 Cholera Epidemic … Russian Empire lost > 250,000 people (note the red boxes)…
Google Books - The Cholera Epidemic of 1892 in the Russian Empire: With Notes Upon …
Fulton History - Mount Vernon NY Daily Argus September 27th, 1892
Trove Digitised Newspapers - Brisbane Courier September 14th, 1892
Stanczyk has previously published a map (or two) of Mt. Olivet, Detroit cemetery showing the various sections. Today, I am continuing the Mount Olivet meme to publish all of the section maps I have.
332 – Buczkowski
443 – Orzel
553 – Rozanski
595 – Katolski – Koswicki
623 – Wroblewski
724 – Morawska
Let me hasten to add that Stanczyk is NOT related to the above name samples. You need to follow the link to Mt Elliott cemetery association for more info.
From a forum at genealodzy.pl Stanczyk saw a PDF document mentioned. When I looked at it, I saw it was an inventory (in progress) of the holdings of the National Archive in Kielce (AP w Kielcach). So I have produced a condensed version of their work-in-progress. Yes, most of these are related to Stanczyk’s family tree. For their complete inventory list (which was 424 items), see the link (URL) at the bottom of this table.
Now I mention this particular AP (National Archive) because it is the archive that covers the Russian-Poland partition that my ancestors were from. There may be other inventories for other archives.
|#||Nr zespołu – # Rec. Group||Stan na dzień (as of) 2013-02-01 Nazwa||daty skrajne – date range||księgi – books||metry||ilość sfotografowanych ksiąg – number of books photographed||ilość zrobionych zdjęć – number of photos taken||Braki – deficiencies|
I tried to provide a reasonable translation (using Google Translator with some hand-tweaks) of the Polish Column Headings in English.
LEGEND (3rd Column):
A suffix of moj or -moj indicates Jewish records.
pr – Orthodox Catholic
ew – Evangelical
gr – Greek Orthodox
The rest (or any with rk) would indicate Roman Catholic.
I believe the Column Heading METRY indicates the actual shelf space of storage this record group physically occupies. I believe the units would be in Meters (m). Remember Europe uses ‘,’ where America uses a decimal point. Some appear to be missing the decimal point. In most cases, it appears that Excel has translated the comma to decimal point, but if you look at the source document, you should be aware of this cultural difference.
1 April 2013 – Dateline Philadelphia –
Yes, this jester knows its April Fool’s day; But who better than a jester to speak truth to the people (uh … genealogists, librarians, archivists, & researchers) on this day? The first of April has become the impetus for backup and preservation.
You need only look at today’s world of crazy dictators or Mali terrorists to see that cultural/historical artifacts can disappear in an instant. Cyberwarfare can claim your harddisk. The cloud could crash or hurricane Sandy can happen (please donate to Ellis island Foundation to help in that restoration effort). Libraries and Archives need to safeguard your artifacts too! Are you motivated yet? Good!
There are backup solutions, including some free options to the “cloud”. Apple even provides a free 5GB iCloud. So save your GEDCOM file. If you still have free space then backup pictures or scans that are CRITICAL. You can save/backup to media: CDs, USB thumb drives, etc. But be aware that backup to electronic media needs to be refreshed yearly to avoid stranding your backups on outmoded technology (i.e. 8Track tapes or even floppy disks).
Be careful out there and have a Happy April Fool’s Day!
tanczyk, has been busy revisiting the Metryk (metrical, vital records) images from genealodzy.pl of the various parishes/synagogues [hereafter I just use ‘parish’ as shorthand for ‘parish/synagogue’]. As my blog, Waiting For Polish Archives 2.4 M Scans (March 18th, 2013), indicated, I have been exhausting the possibilities for Biechow & Zborowek parishes in the Buski (Busko-Zdroj) powiat. The images are clearer, so I am replacing my existing images with these much better images. In some cases, I have found that the images of the Polish paragraph format provide me with additional details over what may have been available via only a Latin Box format copy that I might have previously had. At the very least, I have corrected a few mistakes of translation due to unreadable portions from prior microfilm I have read from/taken pictures of. So I strongly encourage others to make this effort.
I have been using the Metryk database and looking at the images/scans. Sometimes you have to look at dozens of images because there is NO index. But most of the collection (post 1812) have indexes. If you see SKU (that means index/skorowidz of births/urodziny), likewise for SKM (for marriage), and SKZ (for death) indexes. Sometimes indexes spread across multiple pages, so you may see SK1, SKa (names begining with the letter ‘A’) or SKU1, etc. SO use these indexes to look for your family names, then just load up the scan of the akt (record) number for your ancestor — no need to search through a multitude of images.
I have also used Geneszukacz as another kind of index to search for family names. These indexes are nice because I can catch ancestors getting married (or dying or giving birth) in another parish that I might not have known to check. If this previously unknown parish is one that has scans, then I go directly to the year/event for that parish and go to the akt specified in Geneszukacz!
So that is all great and I exhort you to do this.
But these new, previously unknown parishes. Where are they? How far away from the ancestral village are they? That is when I need a gazetteer (check out Stanczyk’s Gazetter page) or a map. If you have not been to the Polish War Map Archive (Archiwum Map Wojskowego), then today’s blog is your reason to do so. I have a map on my wall of my ancestral villages. The map’s name is: STOPNICA_PAS47_SLUP32. In fact, I use their MAP INDEX, 1:100,000 scale map tiled in squares (http://igrek.amzp.pl/mapindex.php?cat=WIG100). Please NOTE these map images are from about 4MB to 7MB in size. Make sure you are at a Free WiFi cafe where you can use a high-speed and the large band-width for the map images you download.
When you see, PAS think ROW and when you see SLUP think COLUMN. This is a big Cartesian Grid (or computer types can think 2d-array). It turns out that STOPNICA_PAS47_SLUP32 has: Biechow, Pacanow, Ksiaznice, Zborowek, Swiniary, Szczucin, Beszowa, Olesnica, and STOPNICA. In fact, that one map has many more parishes than those that I enumerated. I have a small snippet of the Map Index below (you can click on the image and it will take you to the actual map index):
So I found an ELIJASZ ancestor in Koniemloty parish getting married, who was from PACANOW parish. Now from the Metryk web app, I knew Koniemloty was in STASZOW powiat. So I go to the Map Index and look at the grid near STOPNICA (P47_S32) and voila, STASZOW is the box due north of STOPNICA in PAS46_SLUP32. If you cannot locate you powiat that way, then you must drop back to MAPA.SZUKACZ.pl (an interactive map that I have raved about before) and look for KONIEMLOTY (do not need to use diacriticals) to get the relative feel that it is north or east (or north-east). So any way, STASZOW_PAS46_SLUP32 is the map for KONIEMLOTY parish. Notice PAS46 is one row less than PAS47 (of STOPNICA). PAS decreasing is going north, PAS increasing is going south. Going east from STASZOW, we see the SLUP increases to SLUP33 (SANDOMIERZ) or going west the SLUP decreases to SLUP31 (PINCZOW). So now you can now work with the Map Index using the cardinal directions by adding/subtracting to/from the rows/columns.
P.S. Since this is the Passover (Pesach) / Easter (Wielkanoc) season, let me honor my wife (Tereza) by pointing out that her paternal grandfather, Benjamin Solomon, had as a birth village, Proskuriw (aka PŁOSKIRÓW, Хмельницький/Khmelnitski – now in Modern Ukraine). This village is shown in the lower right-hand corner of my map snippet (PAS51_SLUP44).
The Polish State Archives via the National Digital Archives (NAC) recently announced the plans to release 2.4 Million scans of metrical book records online. In order to use this database (http://www.szukajwarchiwach.pl/) you will need to know the Archive’s number of the Regional State Archive that you are interested in (i.e. the Archive that has the data/scans of your ancestral village).
Stanczyk could not locate such a list. So this jester created one. Knowing that the IZA had a drop down menu of Archives, I went there. Sure enough it had the archives … and the archive’s number. But it was a drop down menu and I could not copy/paste from the drop down menu. How could I get the data?
I put on my propeller beanie and it occurred to me that the HTML of the web page would have that data for the menu. So I looked at the page source and voila. After some editing to remove HTML tags I built the required list for all of to use with the new 2.4 Million records in http://www.szukajwarchiwach.pl/.
The list is below. One note, I notice that the list is out of date in the sense that there are regional archives that no longer exist. For example, I know to my own pain, that the Kielce regional state archive (Kielce Panstowe) office (oddzial) in Pinczow (#23 in the list) was closed and the records moved to Kielce (the main office, #21). So please take note of this, as I am sure it happened to other offices as well.
Here is the PDF of the listed parishes/synagogues being scanned (for March):
1 Archiwum Glówne Akt Dawnych (1)
2 Archiwum Akt Nowych (2)
4 Archiwum Panstwowe w Bialymstoku (4)
5 Archiwum Panstwowe w Bialymstoku Oddzial w Lomzy (5)
6 Archiwum Panstwowe w Bydgoszczy (6)
8 Archiwum Panstwowe w Czestochowie (8)
9 Archiwum Panstwowe w Elblagu z siedziba w Malborku (9)
10 Archiwum Panstwowe w Gdansku (10)
11 Archiwum Panstwowe w Kaliszu (11)
12 Archiwum Panstwowe w Katowicach (12)
13 Archiwum Panstwowe w Katowicach Oddzial w Bielsku-Bialej (13)
14 Archiwum Panstwowe w Katowicach Oddzial w Cieszynie (14)
15 Archiwum Panstwowe w Katowicach Oddzial w Gliwicach (15)
16 Archiwum Panstwowe w Katowicach Oddzial w Oswiecimiu (16)
17 Archiwum Panstwowe w Katowicach Oddzial w Pszczynie (17)
18 Archiwum Panstwowe w Katowicach Oddzial w Raciborzu (18)
20 Archiwum Panstwowe w Katowicach Oddzial w Zywcu (20)
21 Archiwum Panstwowe w Kielcach (21)
22 AP w Kielcach Oddzial w Jedrzejowie – Oddzial zlikwidowany, akta przeniesione do AP K… (22)
23 AP w Kielcach Oddzial w Pinczowie – Oddzial zlikwidowany, akta przeniesione do AP Kie… (23)
24 Archiwum Panstwowe w Kielcach Oddzial w Sandomierzu (24)
25 AP w Kielcach Oddzial w Starachowicach – Oddzial zlikwidowany, akta przeniesione do A… (25)
26 Archiwum Panstwowe w Koszalinie (26)
27 Archiwum Panstwowe w Koszalinie Oddzial w Slupsku (27)
28 Archiwum Panstwowe w Koszalinie Oddzial w Szczecinku (28)
29 Archiwum Narodowe w Krakowie (29)
30 Archiwum Narodowe w Krakowie Oddzial w Bochni (30)
31 Archiwum Narodowe w Krakowie Oddzial w Nowym Saczu (31)
33 Archiwum Narodowe w Krakowie Oddzial w Tarnowie (33)
34 Archiwum Panstwowe w Lesznie (34)
39 Archiwum Panstwowe w Lodzi (39)
41 Archiwum Panstwowe w Lodzi Oddzial w Sieradzu (41)
42 Archiwum Panstwowe w Olsztynie (42)
45 Archiwum Panstwowe w Opolu (45)
48 Archiwum Panstwowe w Piotrkowie Trybunalskim (48)
49 Archiwum Panstwowe w Piotrkowie Trybunalskim Oddzial w Tomaszowie Mazowieckim (49)
50 Archiwum Panstwowe w Plocku (50)
51 Archiwum Panstwowe w Plocku Oddzial w Kutnie (51)
52 Archiwum Panstwowe w Plocku Oddzial w Leczycy (52)
56 Archiwum Panstwowe w Przemyslu (56)
58 Archiwum Panstwowe w Radomiu (58)
59 Archiwum Panstwowe w Rzeszowie (59)
60 Archiwum Panstwowe w Rzeszowie Oddzial w Sanoku (60)
61 AP w Rzeszowie Oddzial w Skolyszynie – Oddz. zlikwidowany, akta przeniesione do AP Rz… (61)
62 Archiwum Panstwowe w Siedlcach (62)
63 Archiwum Panstwowe w Suwalkach (63)
64 Archiwum Panstwowe w Suwalkach Oddzial w Elku (64)
65 Archiwum Panstwowe w Szczecinie (65)
66 Archiwum Panstwowe w Gorzowie Wielkopolskim (66)
67 Archiwum Panstwowe w Szczecinie Oddzial w Miedzyzdrojach (67)
68 Archiwum Panstwowe w Szczecinie Oddzial w Stargardzie Szczecinskim (68)
69 Archiwum Panstwowe w Toruniu (69)
71 Archiwum Panstwowe w Toruniu Oddzial we Wloclawku (71)
72 Archiwum Panstwowe m.st. Warszawy (72)
73 Archiwum Panstwowe m.st. Warszawy Oddzial w Grodzisku Mazowieckim (73)
75 Archiwum Panstwowe m.st. Warszawy Oddzial w Lowiczu (75)
76 Archiwum Panstwowe m.st. Warszawy Oddzial w Mlawie (76)
78 Archiwum Panstwowe m.st. Warszawy Oddzial w Otwocku (78)
79 Archiwum Panstwowe m.st. Warszawy Oddzial w Pultusku (79)
82 Archiwum Panstwowe we Wroclawiu (82)
83 Archiwum Panstwowe we Wroclawiu Oddzial w Jeleniej Górze (83)
84 Archiwum Panstwowe we Wroclawiu Oddzial w Kamiencu Zabkowickim (84)
85 Archiwum Panstwowe we Wroclawiu Oddzial w Legnicy (85)
86 Archiwum Panstwowe we Wroclawiu Oddzial w Lubaniu (86)
88 Archiwum Panstwowe w Zamosciu (88)
89 Archiwum Panstwowe w Zielonej Górze z siedziba w Starym Kisielinie (89)
90 Archiwum Panstwowe w Zielonej Górze Oddzial w Wilkowie (90)
91 Archiwum Panstwowe w Zielonej Górze Oddzial w Zarach (91)
93 Archiwum Panstwowe w Gdansku Oddzial w Gdyni (93)
307 Muzeum Pierwszych Piastów na Lednicy (307)
309 Archiwum Uniwersytetu Mikolaja Kopernika w Toruniu (309)
324 Stowarzyszenie Archiwum Solidarnosci (324)
325 Zarzad Oddzialu Zwiazku Sybiraków w Lodzi (325)
327 Glówna Biblioteka Lekarska im. Stanislawa Konopki (327)
701 Instytut Józefa Pilsudskiego w Ameryce (701)
702 Polski Instytut Naukowy w Nowym Jorku. Biblioteka i Archiwum im. A. Jurzykowskiego (702)
One of the difficulties of locating records or data on a Polish (or Czech, Russian, Lithuanian, Ukrainian, etc.) ancestor is the many ways a name can be spelled, misspelled, mistranscribed, indexed, etc. in a database. This is why you have to be creative when trying to locate your data.
Here is an example (from my paternal grandfather’s birthplace): Pacanow. That is the English rendering. In Poland it would be written as Pacanów. Now diacriticals aside, how many possible ways can I find Pacanow in Ellis Island (probably similarly for Ancestry.com as well)? OK, you asked …
Those are the ones I have found so far. That is 47 combinations! Now admittedly reading the handwriting from those ship manifests is difficult even when I am pretty sure what is being written; So I can feel for the transcribers / indexers who harvest the data and do the data entry into some database.
Now, no searching by American Soundex, Daitch-Mokotoff Soundex, or even Beider-Morse phonetic matching or even using wild-card searches, not even if you had regular-expression searches (like Oracle databases have) would I have found all of those. I do not know what to tell you to do. Be creative and persistent. Look at adjacent letters on a keyboard (for mis-typings) as data is entered. Look for letters that are swapped (i.e. Eliasz vs. Elaisz) — mistyped or dyslexic. Just keep looking. I found ‘Bacanow’, because I said what might an handwritten letter ‘P’ look like to somebody? Of course, ‘R’ and ‘B’ suggested themselves to my mind. No ‘Racanow’, but sure enough out popped a ‘Bacanow’. So you never know.
Now Stanczyk mentioned Pacanow, because I thought I was being slick and said, “What if I cannot think of all the ways a NAME can be misspelled?”. My answer was, “I know, I’ll just search on everybody coming over from the village of P-A-C-A-N-O-W.” Of course, as you might have guessed now I had a meta-problem because now I had to come up with all of the ways that Pacanow could appear. Well like the riddle, “How many licks does it take to get to the center of a Tootsie-Pop?”, I have an arbitrary answer … 47.
I now have two spreadsheets. My first spreadsheet is my work-in-progress on the ZASUCHA of Niagara Falls SNA. The second spreadsheet I have is a rather large spreadsheet of all of the names from Pacanow (and truth be told Biechow, Piestrzec, Wojcza, …) and all of the surrounding villages that came through Ellis Island that I have found so far [plus a few mis-matches].
Stanczyk has been thinking about GEDCOM a lot these days. As you may know, GEDCOM is the de facto standard format for a genealogical family tree file, in order for it to be shared amongst the many genealogical software programs / websites / apps. Most genealogy programs still use their own proprietary format for storing data but will import / export the data in the GEDCOM standard for you to exchange data with another program or genealogist.
Did you catch the phrase ‘de facto standard’ ? OK it is NOT an open standard maintained by ISO or ANSI standards organizations. But it is widely supported and in fact you should NOT buy or use software that does not support the export and import of GEDCOM files!
Well we are coming up on RootsTech 2013 and my mind is turning back to the technical part of genealogy again!
Today’s blog is about the GEDCOM used by Ancestry.com. Were you aware that you can export your family tree from Ancestry.com? You can by selecting/clicking on ‘Tree Settings‘ under the ‘Tree pages‘ drop down menu (Tree Settings will be the second from the bottom in the menu list). If you click on ‘Tree Settings’ you will see a screen similar to:
Notice that after you click on the ‘Export tree‘ button, that you get a new button named, ‘Download your GEDCOM file‘ in that same place.
In all likelihood if you click on the ‘Download your GEDCOM file‘ button you will get a file in your Downloads directory on your local hard drive. It will have a name of:
Now the phrase ‘<your-family-tree-name>’ will actually be something like ‘Eliasz Family Tree.GED’ . So your Downloads directory will have a similar named file (complete with blanks in the file name). The size of the file will be dependent on how many individuals, families, sources, etc. that you have recorded in your family tree. Figure on a file size of 2MB for about 1,100 people.
Now this file you just downloaded from Ancestry.com is really just a plain text file with a set of standardized ‘tags’ defined by the GEDCOM standard. Software vendors are free to define their own custom tags too. Although CUSTOM tags must begin with an underscore (‘_’). I was curious as to how well Ancestry.com implements/adheres to the GEDCOM standard, so I wrote a little program (in PERL for you programmer types) to analyze my GEDCOM file that I just downloaded.
My program, read_gedcom.pl, spits out a slew stats about the GEDCOM including the tags used. As you may be able to see from the screenshot, there sorted at the end were 5 custom tags:
_APID, _FREL, _MILT, _MREL, _ORIG
These names do not have any meaning except to Ancestry.com and their website’s program(s). What you also see are that in 48,538 lines (in the GEDCOM file downloaded), that 5,158 lines have one of these five custom tags. Normally, I will just ignore these tags and import the GEDCOM file into my laptop’s genealogy software (REUNION, RootsMagic, PAF, etc.) and let that software ignore these non-understandable tags and within seconds I have my Ancestry.com family tree imported in to my computer’s genealogy software. That is fine — no problems.
But what do you think happens you if turn right around and upload that GEDCOM file into your RootsWeb family tree? If you use RootsWeb, then you know you get a LOT of _APID notes across all of your ancestors and sometimes, if you have many facts/citations for any ancestor, then the RootsWeb page for him/her will be horribly marred by all of these _APID tags!
Remember I said the GEDCOM file is a TEXT file. As such it can be edited by whatever your favorite text editor that you use. If your editor does global search/replace, then you can easily remove these CUSTOM tags (_APID, etc.). That will make your RootsWeb family tree individual pages look MUCH better.
Now I know what you are thinking. Do NOT go editing your GEDCOM file! I agree. Make a copy of your GEDCOM file and edit the copy of the downloaded GEDCOM file to remove the lines with ‘_APID’ on them. You can remove all custom tags, but I just bother with the _APID which are so irksome. If your editor can remove the lines with ‘_APID’ then that is what you should do. But if all your editor can do is replace the lines that have _APID on them with a blank line then that is OK too. Make those edits and save the edited (copy) file. The blank lines seem to be ignored by RootsWeb — thank goodness.
Now you can upload the edited file, with the _APID custom tags removed to RootsWeb and your family tree will again look the way it used to before, without these irksome custom tags.
Next time I will tell you what I found when I looked closely at what ANCESTRY.com was putting into the downloaded GEDCOM file.
Stanczyk ‘s position has been overrun! I was trying to write a blog, but the course of events has been running at EXTREME Internet speed so much of this blog post may be “old news” to you — but in case its not, this is very exciting news!
These are actual church record images! I hope they plan on digitizing records from the Kielce Archive (please do PACANOW, BIECHOW, SWINIARY, BESZOWA, ZBOROWEK, KSIAZNICE and STOPNICA parishes).
Can anyone detail the plans for JUNE yet? Unfortunately, the 1.37 Million records in March are NOT from the KIELCE archive or any parish where Stanczyk’s ancestors resided?
Do not forget about GENETEKA database in the meantime:
Thanks in advance for any answers from our genealogists resident in Poland!
This jester has been a big proponent of Ubiquitous Genealogy – i.e. genealogy is portable and everywhere. I have used the Ancestry App for a long while and am well satisfied. They use a concept of synching the App with your tree and their website. Now that the kinks are worked out, I am well satisfied. There are also MyHeritage and Heredis Apps too. These do not synch over the “air”. You need to use your iTunes application on your desktop/laptop to move files into the App’s “sandbox” via synching your iPhone/iPad with the laptop/desktop over the iPhone/iPad cable. Tethered synching is ok but a hindrance.
Ok so the new App on the block is an offering called RootsMagic. Stanczyk likes the Roots Magic laptop application as a full blown offering for working on your genealogy and documenting the tree and finding data on the Internet and keeping track of to-do lists, publishing your tree on CD/Web and all sorts of work that you do when your research spans years (or decades) – does anyone ever finish their genealogy? Its modern and uses Universal Character sets (so us Slavic Researchers can use our slashed Ł’s or Cyrillic Я’s) and other features that the Internet Genealogists have grown up with. So I was hopeful when I received an email from Roots Magic touting their iOS offerings – Its free!
The App starts with the familiar Roots Magic splash screen that you may have grown accustomed to from the laptop application. You are then presented with a list of files from their sandbox (ugh, tethered synching). Once you select a file to work with, your family tree is presented in a Pedigree form (with three generations visible on iPhone/ four generations on iPad). At the top left is a green/white button with three lines (see image) that will allow you to pick a particular person with whom you wish to work upon. At the bottom of the screen are four buttons:
Files, Views, Lists, Tools
Files – Lets you select the family tree file you want to on from your Device or from DropBox (a cloud-based file storage service). It also has HELP (files??) which tell you how to use your Device or DropBox to get a file loaded into the App. Sadly, the RootsMagic app does not read standard gedcom (ged) files. It only reads files with rmgc extension (i.e. created by Roots Magic laptop application). However, it does load their database extremely fast from those rmgc files.
Views – Lets you choose to view the data in a PEDIGREE tree or a FAMILY tree or in a DESCENDENTS outline or in the detailed FACTS (events), NOTE, direct family members of the current INDIVIDUAL. I prefer working in FAMILY (as seen in image) view mode, then switching to INDIVIDUAL view mode for any details on that person. Clicking on NOTE really gives you access to NOTE(S), SOURCE(S), and MEDIA for that individual (and a BACK button at the top to return to INDIVIDUAL view mode).
Lists – This just gives a list of your: Sources, To-Dos, Research, Media, Addresses, Repositories, Correspondences, and PLACES. I liked places (which showed that this jester really needs to make his Places (Locations) conform to some kind of standard).
Tools – Date Calculator, Relationship Calculator, Soundex Calculator, and Calendar. Unimpressive to say the least. Lest you get your hope up, the Calendar tool only displays the Calendar for a Month/Year of your choice [I did not verify the Julian/Gregorian boundary to see if it calculates a proper month calendar for dates before 1582]. It was not worth the effort as I did not see why I would want to see what day June 3rd, 1700 would fall on (Thursday) if you are eyes are young or your glasses are a good prescription to read the day name. Otherwise, you not notice the day names on an iPhone [perhaps a black font, instead of gray, would give better contrast]. The Soundex is only American Soundex – why not Daitsch-Mokotoff or Bieder-Morse codes too? Really, we Slavic researchers get short shrift in the software world. Never fear, just create a desktop icon of Steve Morse’s Soundex page to see all three Soundex/Pattern Matching methods for your family names.
iPad vs iPhone
For some reason the iPad interface treats the buttons (Files, Views, Lists, Tools) differently on the iPad. That was a bit confusing until I got used to the difference. Rotating the iPad to landscape, also brings the Surnames/Search view along side whatever view you are in. The Calendar is a bit easier to see on the iPad, but I’d still like to see the day name text in black (or at least a MUCH darker gray).
The app is strictly for viewing your family tree (et. al.). There are no tools for modifying the tree for re-import into the desktop application. Shortcomings aside, it is still a very good first effort by Roots Magic. If you have the Roots Magic laptop application, then download the free app for your smartphone or tablet and go Ubiquitous. If you do not have the Roots Magic laptop application and do not have a way to get your ged converted to Roots Magic format (rmgc) then do NOT bother to download the app – you will not be able to use it.
P.S. Do you spell “DESCENDENT” – as D-E-S-C-E-N-D-E-N-T or D-E-S-C-E-N-D-A-N-T ? Both spellings are correct, but I guess I use “DESCENDANT” all of the time and so the Roots Magic use startled me.
I also would love to see the REUNION app (the Mac Software vendor) make their iOS App free or low-cost – then I’d review it here too. I am a BIG Apple eco-system fan and as such have used REUNION Mac software for a very LONG time. I would be remiss not to mention that REUNION App does exist, but its cost is a bit steep relative to the other iPhone genealogy APPs in this article.
The changes are really subtle. I am not certain whether I have seen them or not. The tree displays fast and it appears animated. Other displays changed subtly not much here, but all good. UPGRADE (click on image).
The image by the way lists the new features for version 4.0.2 (as well as previous release 4.0). Image as text.
Notice the snazzy new iTunes/App Store interface (a part of iOS6).
Stanczyk has been busy writing a new two-article blog post on my third success with the Social Network Analysis (aka Cluster Genealogy) technique. I hope to complete that soon. As it so often happens, an initial foray kept expanding as a result of the connections.
Watch this space !
P.S. October is Polish American Heritage Month. Here is what the Polish American Center (Philadelphia) is doing … Don’t forget on Columbus Day to celebrate our native son . The book came out in Polish in May 2012 and has now been translated into English — They are looking for a book publisher !!!