LOGO Stransky Family Tree and more

     
 
Welcome Guest,
 
     

How SFT compares to GEDCOM

     
 

Welcome, I have created many large xml databases which store archives of text, images and pdf types. Also I have made them very efficient so the thought was to create a gedcom standard for myself and others to use if they wish.

GOALS:

1. compact tags.
2. compact reoccurring text in nodes.
3. compact various events and sources to a simplified node.

To start, lest view the GEDCOM house hold which is used for family groups and children. I will call it Simplified Family Tree xmls or "SFTxml"

GEDCOM:
FAM fanikyKEY
HUSB husbandKEY
WIFE wifeKEY

......
0 @F2@ FAM
1 HUSB @I6@
1 WIFE @I7@
1 CHIL @I18@
0 @F3@ FAM
1 HUSB @I11@
1 WIFE @I12@
1 CHIL @I28@
1 CHIL @I26@
1 CHIL @I27@
1 HUSB @I11@
1 WIFE @I33@
1 CHIL @I45@
.......
SFTxml:
gid = GroupID
hid = HusbandID
wif = WifeID


<xml>
.....
 <data><gid>2<gid><hid>6</hid><wid>7</wid></data>
<data><gid>3<gid><hid>11</hid><wid>12</wid></data>
<data><gid>4<gid><hid>6</hid><wid>33</wid></data>
 ......
</xml>

So with a parser you have a household view to display from picking a short cut link with a INDI#6 say John Doe. With xpath you filter for (hid='2') husbandID. Note that there will be household matches
first loop matches
gid=2 sees a wid=7 and
then again gid=4 with 2nd wife=33.

the web base parse loop will the PID.xml and will grab and display each of there names and sort them by date order from birth for each marriage/bond date.

John Doe #6 (hyperlinked)
+1st Wife Name #7 (hyperlinked)
  1st childs name #26 (hyperlinked)
  2nd childs name#27 (hyperlinked)
  3rd childs name#28 (hyperlinked)
+2nd Wife (hyperlinked)
  1st childs name#45 (hyperlinked)
 
What does this PID.xml you speak of look like?

GEDCOM:
INDI # KEY
GIVN
SURN
SEX
BIRT
DATE
DEAT
DATE
FAMC
FAMS
......
0 @I6@ INDI
1 NAME John /Doe/
2 GIVN John
2 SURN Doe
1 SEX M
1 BIRT
2 DATE 3 Mar 1926
1 DEAT
2 DATE 6 Oct 1993
1 FAMC @F0@
1 FAMS @F2@
0 @I7@ INDI
1 NAME 1st /Wife/
2 GIVN 1st
2 SURN Wife
1 SEX F
1 FAMC @F65@
1 FAMS @F77@
.......
SFTxml:
PID = PersonID key
CID = ChildID matched to GID
GVN= GivenName
SRN = Surname
SEX = sex type M/F/n
BDT = Birth Date
DDT = Death Date

<xml>
.....
 <data><pid>6<pid><cid>6</cid><gvn>John<gvn><srn>Doe</srn>
         <sex>M</sex><bdt>1926/3/1</bdt><ddt>1993/10/6</ddt>
</data>
<data><pid>3<pid><cid>11</cid><gvn>1st<gvn><srn>wife</srn>
        <sex>F</sex><bdt>/bdt><ddt></ddt>
</data>
 ......
</xml>


Eliminated: FAMS since it was already captured in the GID.xml

Eliminated: NAME since Given and surnames can be combined when displayed.

Also the FAMC is still used but as CID to match GID the same way FAMC is to FAM@.


 
     

     
 

Next how to store Events and Source documents the best and most compact way....

If you followed so far how Household group xml links to the Person ID xml you will see how the EID.xml  eventID store various events in a person or persons lives.

Also many people have complained what to call an event, do we say Baptism or Christening or Jewish or Indian way and types, guess What IT DON'T MATTER with this method. 

The typical way gedcom tries to create a tag for every possible event out there. This makes a everyone suffer trying to create a Rosetta stone to export xml standards to other standards.

I guess I summed up the problem hitting people for years about standard making.

1. Everyone is creating various "tag named events" like immigration, baptism, birth, christening, and so on. This hinders religious beliefs, culture descriptions and so on.

I ask this, all the above are events, all one has to do is see if it happened at a place, time it was an EVENT. what people wish to call that or tag that has no importance at all on xml structure.

How is that, put you date in the node and place.
<date> </date><place></place><type>(select it)</type>

so look at it this way, your in a computer program or an online interface. Pull down a selection bar and pick Baptism, Birth, Immigration, Divorce, Murder, Land Purchase, Hair Color, DNA string, etc...

No Matter what, whatever it was was done at a time or place, we need to stop creating date and place tags INSIDE an untold limitless description of tag names.

Lets reverse that and create a list of events names that can be select which fit in the standard time and place of event.

If you read this fully feel free to share it with the others if this might be a big step to getting standardize quicker.

Simple

GEDCOM:
BIRT
  DATE
  PLAC 
DEAT
  DATE
  PLAC
BURI
  PLAC
BAPM
  DATE
  PLAC
0 @I222@ INDI
1 NAME Joseph /Stransky/
2 GIVN Joseph
2 SURN Stransky
1 SEX M

1 BIRT
2 DATE 1 Aug 1915
2 PLAC 137 Meeker Ave,Brooklyn,NY
1 DEAT
2 DATE 6 Apr 1997
2 PLAC 11239 Brooklyn,NY
1 BURI
2 PLAC Calverton National Cemetery, NY
1 BAPM
2 DATE 15 Aug 1915
2 PLAC St. Cecelia's Roman Catholic Church 84 Herbert St. Brooklyn,NY 11222
1 CHAN
2 DATE 30 Dec 2001
1 FAMS @F39621513@
1 FAMC @F39621512@
1 SOUR @S35630755@
2 DATA
3 DATE 29 Dec 2001
1 NOTE twin to Anthony Stransky
2 CONT
2 CONT JOSEPH STRANSKY PFC
2 CONT US ARMY WORLD WAR II
2 CONT AUGUST 1
2 CONC , 1915 APRIL 6, 1997
2 CONT BRONZE STAR MEDAL.
SFTxml: EID.xml
EID EventID
PID perso ID it belongs to
TYP Type of event / choose select Standard labels!!!!!!
EDT Eventdate
EPL Event Place
DID DocumentID keyed to SID Source ID

 <xml>
.....
 <data><EID>77</EID><PID>222</PID><TYP>BIRTH</TYP>
  <EDT>1915/08/01</EDT><EPL>137 Meeker Ave, Brooklyn, NY</EPL><DID></DID>
</data>
<data><EID>78</EID><PID>222</PID><TYP>DEATH</TYP>
  <EDT>1997/04/06</EDT><EPL>Brooklyn, NY</EPL><DID></DID>
</data>
<data><EID>79</EID><PID>222</PID><TYP>DEATH</TYP>
  <EDT></EDT><EPL>Calverton National Cemetery, NY</EPL><DID></DID>
</data>
<data><EID>92</EID><PID>222</PID><TYP>BAPTISM</TYP>
  <EDT>1915/08/15</EDT><EPL>Roman Catholic Church 84 Herbert St. Brooklyn,NY 11222</EPL><DID>453</DID>
</data>
.....
</xml>


With compared to the GEdcom
BIRT
  DATE
  PLAC 
DEAT
  DATE
  PLAC
BURI
  PLAC
BAPM
  DATE
  PLAC

NOW IS
EVENT - BIRT - DATE - PLAC 
EVENT - DEAT - DATE  - PLAC
EVENT - BURI - PLAC
EVENT - BAPM - DATE - PLAC
-------------------------------------------------------
BONUS:
Your need only one parse to filter event match the type you are looking for via xquery

The Gedcom in xml format you need 4 parses to to match each date place when doing loops.

 

Image archive of documents, pictures, photos, census records, roll film and all. Take the last example above for a baptism Event record#92, the DID DocumentID attached is linked to a SourceID SID=453

GEDCOM:
SOUR
TITL
NOTE
CONT
REPO

0 @S55@ SOUR
1 TITL GEDCOM File : at home.ged
0 @S56@ SOUR
1 TITL Nicholas Buck.FTW
1 NOTE Source Media Type: Other
1 REPO @R85@
0 @S57@ SOUR
1 TITL Goshenhopper Register, 1741-1819/Deaths
1 NOTE NS73
2 CONT
2 CONT Source Media Type: Book
1 REPO @R85@
SFTxml: SID.xml
SID SourceID KEY
TYP (select)
CLS  classification type
SDT SourceDate
SPL SourcePlace
DLO Document Location C:/genarchive/(+) or Root/genarchive/(+)
 <xml>
.....
 <data><SID>77</SID><TYP>Birth</TYP><CLS>certificate</CLS>
  <SDT>1915/08/01</SDT><SPL>ST. Jude's Hospital , Brooklyn, NY</SPL><DLO>467364.jgp</DLO>
</data>
 <data><SID>98</SID><TYP>Census</TYP><CLS>1910 Fed Census</CLS><SDT>1910/01/19</SDT><SPL>King Co. District 12 page 11</SPL><DLO>46765.pdf</DLO>
</data>
.....
</xml>

 

Family Group   Person   Records   Sources
GID KEY#
HID =pid
WID =pid
..**..
  PID KEY#
CID =gid
GVN
SRN
SEX (select)
BDT
DDT
..**..
  EID KEY#
PID = pid
SID = sid
LID = lid
 
TYP (select)
EDT
EPL
DID
..**..
  SID KEY#
RIP = rip
TYP (select)
CLS
SDT
SPL
DLO
..**..
             
Place Location   Repository        
LID KEY#

..**..
  RID KEY#

..**..
       

So you can see there is still room to make changes. GID and PID work hand in hand for the navigation and display templates. This is really the genealogist navigation outline and quick view summary.

The EID and SID work hand in hand and can be used by archive sites as a stand alone system also.

You have all your documents in SID. And EID in the facts off the documents. Lest use a census

You see 8 people in a household, you will make 8 EID entries to describe each persons on the census but they will all point back at the same image or source document.

From either EID or PID you can link them or create new people when needed which will auto link to each other.

 "..**.." indicates other items need to be added and clarified, I hope you understand and feel this is simplified. I will next go into a detailed or expanded understanding of this format.
The above was to cover the very basic must have format. Next is to include all those various LABELS no one can agree on tagging the nodes in a selectable standard format.

 
     

 

Currently 2 visitors on this site.

Join in on a Better Gedcom Discusssion


Other sites we are involed working on xml database projects.




View My Stats