Facility ID's in iHRIS...

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.
-carl

···

On Oct 10, 2013, at 1:05 PM, Edward Bunker <Edward.Bunker@jhpiego.org> wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

-- Ed

********************************

Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

Health Informatics Advisor

Jhpiego-an affiliate of Johns Hopkins University
1615 Thames Street
Baltimore, MD 21231-3492
Phone: 410.537.1891
Fax: 410.537.1476
www.jhpiego.org

PLEASE NOTE NEW E-MAIL ADDRESS: Edward.Bunker@jhpiego.org
---------------

Johns Hopkins Bloomberg School of Public Health
Public Health Informatics Certificate Program
Associate Director
www.jhsph.edu/dept/hpm/certificates/informatics/index.html

---------------

Johns Hopkins Bloomberg School of Public Health
Department of International Health
Associate

I believe the GUID standard is 35 alpha numeric (plus special characters, isn't it?). Not that we would ever really attain an international master facility registry, or that we couldn't just add a GUID if we ever found we needed one...just sayin'...for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:
Michael Gehron
202.203.7485 (O)
202.486.2563 (C)

···

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker <Edward.Bunker@jhpiego.org>; <facility-registry@googlegroups.com> <facility-registry@googlegroups.com>
Cc: Carl Leitner <cleitner@capacityplus.org>
Subject: Re: Facility ID's in iHRIS...

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.
-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker <Edward.Bunker@jhpiego.org<mailto:Edward.Bunker@jhpiego.org>> wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

-- Ed

********************************

Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

Health Informatics Advisor

Jhpiego-an affiliate of Johns Hopkins University
1615 Thames Street
Baltimore, MD 21231-3492
Phone: 410.537.1891
Fax: 410.537.1476
www.jhpiego.org<http://www.jhpiego.org/>

PLEASE NOTE NEW E-MAIL ADDRESS: Edward.Bunker@jhpiego.org<mailto:Edward.Bunker@jhpiego.org>
---------------

Johns Hopkins Bloomberg School of Public Health
Public Health Informatics Certificate Program
Associate Director
www.jhsph.edu/dept/hpm/certificates/informatics/index.html<http://www.jhsph.edu/dept/hpm/certificates/informatics/index.html>

---------------

Johns Hopkins Bloomberg School of Public Health
Department of International Health
Associate

--
You received this message because you are subscribed to the Google Groups "Facility Registry (OHIE)" group.
To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

···

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we
couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:

Michael Gehron

202.203.7485 (O)

202.486.2563 (C)

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

Health Informatics Advisor

Jhpiego-an affiliate of Johns Hopkins University

1615 Thames Street

Baltimore, MD 21231-3492

Phone: 410.537.1891

Fax: 410.537.1476

www.jhpiego.org

PLEASE NOTE NEW E-MAIL ADDRESS: Edward.Bunker@jhpiego.org


Johns Hopkins Bloomberg School of Public Health

Public Health Informatics Certificate Program

Associate Director

www.jhsph.edu/dept/hpm/certificates/informatics/index.html


Johns Hopkins Bloomberg School of Public Health

Department of International Health

Associate

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

If we make the field large enough to hold a GUID then we’re able to support instances where the unique ID is system-generated (a GUID) and the “lookup ID” can be whatever is convenient and human-readable. For all their flaws as a human-keyed ID, GUIDs make great “system” IDs. No harm in making the field large enough to hold them…

My $0.05 worth (we banned pennies in Canada a couple of months ago… ) :wink:

Derek.

···

On Fri, Oct 11, 2013 at 7:48 AM, Bob Jolliffe bobjolliffe@gmail.com wrote:

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.


Derek Ritz

This email may contain confidential information intended only for the recipient. If you receive it by accident, please delete it.

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we
couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:

Michael Gehron

202.203.7485 (O)

202.486.2563 (C)

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

Health Informatics Advisor

Jhpiego-an affiliate of Johns Hopkins University

1615 Thames Street

Baltimore, MD 21231-3492

Phone: 410.537.1891

Fax: 410.537.1476

www.jhpiego.org

PLEASE NOTE NEW E-MAIL ADDRESS: Edward.Bunker@jhpiego.org


Johns Hopkins Bloomberg School of Public Health

Public Health Informatics Certificate Program

Associate Director

www.jhsph.edu/dept/hpm/certificates/informatics/index.html


Johns Hopkins Bloomberg School of Public Health

Department of International Health

Associate

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

Seems like there’s a couple of comments / questions here:

  1. What’s the most flexible datatype specification for a facility code that you’ve seen? I agree with Mike that it should probably allow for a GUID, which most easily would be stored as a string of 36 characters. They are a terror to use however, and I sure as heck hope they wouldn’t be used by individuals in real-world operational settings.

  2. What are best practices around coming up with a facility code “scheme”? In my experience, it’s a grave mistake to advise anyone to build IDs that have implicit meaning in their structure. For all of the reasons that Bob describes below, as soon as you put meaning into a code, you run into a constraint within the scheme. I’ve seen it happen so many times now. :frowning: The additional comment is that check digits (a mathematically imputed value usually appended to the end of a code) are really good practice. That way, you can better protect against human entry errors.

Just my $0.02.

-Paul

···

On Friday, October 11, 2013, Bob Jolliffe wrote:

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we
couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:

Michael Gehron

202.203.7485 (O)

202.486.2563 (C)

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

I’ve a couple of thoughts. First, and partly in response to the original question, I guess there is a difference between identifiers which are currently in common use and what might be a more ideal or suggested best practice.

In my earlier answer to the first, I have found that the most common are either simple integers or some geographic or administrative concatenation of codes.

The merits and demerits of various types of identifier is really wholly dependent of what they are used to identify and how they will be used and who (or what) will use them. For example, with patient identifiers it is can be very important that they lend themselves to anonymous processes - in which case the inclusion of content with implicit meaning (birth date, sex, ethnicity or what have you) seems generally to be considered bad practice. Having said that the Norwegians make use of the their national id (which includes various meanings) in their health record which they seem to be happy enough with. I’m not sure I am would be but I’m not Norwegian.

I am less sure that context independence is necessarily the main thing to strive for when identifying health facilities. Though I take the point that there are hazards. If you identify a facility by the name (early dhis practice) then there are problems that (i) the name might change (ii) there can be different names for the same facility (eg language) and (iii) typically as the geographic scope widens, there can be many facilities with the same name. Identifiers which encode hierarchy, like the 18 digit one from Andra Pradesh, work well until the hierarchy changes.

But these are convenient and useful. As long as you can take into account the fact that their validity is time bound. eg. before the boundary changes the facility code was 67686896 but now it is 6545454. It can be as important to know how the facility was identified then as well as now.

UUIDS (or guids) don’t generally serve much useful purpose when identifying facilities. They simply work to reinforce what you already know. If you have a database full of unique facilities then you can meaningfully assign globally unique uuids to them. But if there are existing duplicates you just end up assigning each duplicate a uniquely different uuid. And if you have a candidate for a new facility to add to the list you need to use other more reliable forms of identification as it doesn’t come with a uuid baked in. I guess this is the difference with for example disk drives, where the uuid is assigned by the manufacturer. Or mac addresses on network adaptors. If the uuid were inscribed in the foundation stone of the clinic, it would be useful. But generally I find them an expensive distraction.

A more stable identification can be had with the geo-coordinates, though facilities can and do move. Mobile clinics being the extreme example.

Trying to fathom a list of useful guidelines from the complexity I would venture:

  1. agree with Paul. Do use checkdigits. Particularly for identifiers that will be entered manually

  2. cater for the fact that identifiers can be timebound

  3. understand that multiple identifiers can be assigned by different authorities - some may be more ideal than others and often you have zero control over them. So rather than creating a structure which takes into account just the max length, rather take into account identifier context, validity and multiplicity. This is a pattern you find in hl7 and elsewhere (even to some extent in the FRED spec).

  4. long random numbers don’t really help when trying to avoid duplicates in a collection. Rather focus on the name, location, type and other attributes as clues to whether you have potential duplicates.

  5. something in the range of 5-6 digits plus a check digit is a reasonable length for a human-grokkable identifier, but these would necessarily be geographically scoped.

  6. An identifier which is used for cross reference between or within interoperable computer systems need not be the same thing as a human grokkable identifier. But it might include it.

Also there is no technical substitute for local knowledge. The program may say there are 50 clinics in a district, all with unique codes, check digits and what have you, but trust the district manager who knows that there are only 49.

Thats at least 5 cents worth …

Bob

···

On 12 October 2013 15:41, Paul Biondich pbiondic@regenstrief.org wrote:

Seems like there’s a couple of comments / questions here:

  1. What’s the most flexible datatype specification for a facility code that you’ve seen? I agree with Mike that it should probably allow for a GUID, which most easily would be stored as a string of 36 characters. They are a terror to use however, and I sure as heck hope they wouldn’t be used by individuals in real-world operational settings.
  1. What are best practices around coming up with a facility code “scheme”? In my experience, it’s a grave mistake to advise anyone to build IDs that have implicit meaning in their structure. For all of the reasons that Bob describes below, as soon as you put meaning into a code, you run into a constraint within the scheme. I’ve seen it happen so many times now. :frowning: The additional comment is that check digits (a mathematically imputed value usually appended to the end of a code) are really good practice. That way, you can better protect against human entry errors.

Just my $0.02.

-Paul

On Friday, October 11, 2013, Bob Jolliffe wrote:

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we
couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:

Michael Gehron

202.203.7485 (O)

202.486.2563 (C)

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,
I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS


You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.

I am going to put in my .02 cents as well – but these are Kenyan cents so possibly worth 1/84th of the 0.02 US cents added so far….I do agree with all of the points in the mail below which is very thorough. I just want to add the practical situations we faced in Kenya when we designed the MFL a few years ago.

The number one biggest hurdle was that EVERYONE except 2-3 enlightened individuals insisted that the facility ID be Province + District + Facility!

Everyone from the govt to the donors was sure that the facility id should have meaning… this is because of historical reasons, mainly:

  •      previously before computers it was necessary to know these ids as part of the facility id as you could not look them up or have them displayed on the screen.  So all previous systems had had boundary IDs within the facility ID.
    
  •      previously boundaries had not changed much in Kenya.
    

But in the 2-3 years of the implementation of the MFL in Kenya the number of districts rose from 105 to over 285.

And now the 8 provinces have disappeared altogether and have been replaced by 47 counties.

Worse:

  •      the creation of new districts often preceded official codes and official boundaries by as much as 1 year.  So often different ministries had to make up their own codes for districts and there were no standard district codes.  There was not a universally accepted single list of districts with codes – but multiple documents that kept changing.
    
  •      Often when new district codes were created they reused previous codes – but the new code referred to a different set of boundaries than the previous code…. (unlike say a land registry plot number that ‘retires’ the plot number if it is subdivided – and the 2 new plots get totally new numbers).
    

There were lots of other subtle practical problems, all of which lead to the TOTAL unsuitability of using any administrative boundary IDs in the facility ID.

Instead, we used a sequential meaningless numeric integer – currently up to about 20000 – which worked since all IDs are assigned by a central database server when any district officer creates a new facility through the web application. (There is also a GUID, but that is for internal software applications only).

Apart from all the other excellent points in the email below, I completely recommend a facility ID with no meaning – but am sure that most users will be requesting this – so be prepared for this with lots of documented examples of why it is not suitable.

If the identifier is meaningless (and will not change), it is not necessary to have the ID to be timebound. But what is necessary to have the DISTRICT or other key administrative boundary to be timebound. So a subtable to show how facilities changed in districts over time. The checkdigit is also a good idea. Agree 5-6 digits plus checkdigit.

Regards

Paul

···

From: facility-registry@googlegroups.com [mailto:facility-registry@googlegroups.com] On Behalf Of Bob Jolliffe
Sent: 12 October 2013 17:26
To: facility-registry@googlegroups.com
Cc: Edward.Bunker@jhpiego.org; cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

I’ve a couple of thoughts. First, and partly in response to the original question, I guess there is a difference between identifiers which are currently in common use and what might be a more ideal or suggested best practice.

In my earlier answer to the first, I have found that the most common are either simple integers or some geographic or administrative concatenation of codes.

The merits and demerits of various types of identifier is really wholly dependent of what they are used to identify and how they will be used and who (or what) will use them. For example, with patient identifiers it is can be very important that they lend themselves to anonymous processes - in which case the inclusion of content with implicit meaning (birth date, sex, ethnicity or what have you) seems generally to be considered bad practice. Having said that the Norwegians make use of the their national id (which includes various meanings) in their health record which they seem to be happy enough with. I’m not sure I am would be but I’m not Norwegian.

I am less sure that context independence is necessarily the main thing to strive for when identifying health facilities. Though I take the point that there are hazards. If you identify a facility by the name (early dhis practice) then there are problems that (i) the name might change (ii) there can be different names for the same facility (eg language) and (iii) typically as the geographic scope widens, there can be many facilities with the same name. Identifiers which encode hierarchy, like the 18 digit one from Andra Pradesh, work well until the hierarchy changes.

But these are convenient and useful. As long as you can take into account the fact that their validity is time bound. eg. before the boundary changes the facility code was 67686896 but now it is 6545454. It can be as important to know how the facility was identified then as well as now.

UUIDS (or guids) don’t generally serve much useful purpose when identifying facilities. They simply work to reinforce what you already know. If you have a database full of unique facilities then you can meaningfully assign globally unique uuids to them. But if there are existing duplicates you just end up assigning each duplicate a uniquely different uuid. And if you have a candidate for a new facility to add to the list you need to use other more reliable forms of identification as it doesn’t come with a uuid baked in. I guess this is the difference with for example disk drives, where the uuid is assigned by the manufacturer. Or mac addresses on network adaptors. If the uuid were inscribed in the foundation stone of the clinic, it would be useful. But generally I find them an expensive distraction.

A more stable identification can be had with the geo-coordinates, though facilities can and do move. Mobile clinics being the extreme example.

Trying to fathom a list of useful guidelines from the complexity I would venture:

  1. agree with Paul. Do use checkdigits. Particularly for identifiers that will be entered manually

  2. cater for the fact that identifiers can be timebound

  3. understand that multiple identifiers can be assigned by different authorities - some may be more ideal than others and often you have zero control over them. So rather than creating a structure which takes into account just the max length, rather take into account identifier context, validity and multiplicity. This is a pattern you find in hl7 and elsewhere (even to some extent in the FRED spec).

  4. long random numbers don’t really help when trying to avoid duplicates in a collection. Rather focus on the name, location, type and other attributes as clues to whether you have potential duplicates.

  5. something in the range of 5-6 digits plus a check digit is a reasonable length for a human-grokkable identifier, but these would necessarily be geographically scoped.

  6. An identifier which is used for cross reference between or within interoperable computer systems need not be the same thing as a human grokkable identifier. But it might include it.

Also there is no technical substitute for local knowledge. The program may say there are 50 clinics in a district, all with unique codes, check digits and what have you, but trust the district manager who knows that there are only 49.

Thats at least 5 cents worth …

Bob

On 12 October 2013 15:41, Paul Biondich pbiondic@regenstrief.org wrote:

Seems like there’s a couple of comments / questions here:

  1. What’s the most flexible datatype specification for a facility code that you’ve seen? I agree with Mike that it should probably allow for a GUID, which most easily would be stored as a string of 36 characters. They are a terror to use however, and I sure as heck hope they wouldn’t be used by individuals in real-world operational settings.

  2. What are best practices around coming up with a facility code “scheme”? In my experience, it’s a grave mistake to advise anyone to build IDs that have implicit meaning in their structure. For all of the reasons that Bob describes below, as soon as you put meaning into a code, you run into a constraint within the scheme. I’ve seen it happen so many times now. :frowning: The additional comment is that check digits (a mathematically imputed value usually appended to the end of a code) are really good practice. That way, you can better protect against human entry errors.

Just my $0.02.

-Paul

On Friday, October 11, 2013, Bob Jolliffe wrote:

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:
Michael Gehron
202.203.7485 (O)
202.486.2563 ©

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,

I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.
To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.
To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Hi Paul,

Our story is very much similar to yours.

How are you managing the registry in Kenya right now? I think the governance behind it is the key to the problem/solution…is there a lead agency overseeing everything about the facility IDs?

···

On Sun, Oct 13, 2013 at 8:36 AM, Paul Krystall paulkrystall@gmail.com wrote:

I am going to put in my .02 cents as well – but these are Kenyan cents so possibly worth 1/84th of the 0.02 US cents added so far….I do agree with all of the points in the mail below which is very thorough. I just want to add the practical situations we faced in Kenya when we designed the MFL a few years ago.

The number one biggest hurdle was that EVERYONE except 2-3 enlightened individuals insisted that the facility ID be Province + District + Facility!

Everyone from the govt to the donors was sure that the facility id should have meaning… this is because of historical reasons, mainly:

  •      previously before computers it was necessary to know these ids as part of the facility id as you could not look them up or have them displayed on the screen.  So all previous systems had had boundary IDs within the facility ID.
    
  •      previously boundaries had not changed much in Kenya.
    

But in the 2-3 years of the implementation of the MFL in Kenya the number of districts rose from 105 to over 285.

And now the 8 provinces have disappeared altogether and have been replaced by 47 counties.

Worse:

  •      the creation of new districts often preceded official codes and official boundaries by as much as 1 year.  So often different ministries had to make up their own codes for districts and there were no standard district codes.  There was not a universally accepted single list of districts with codes – but multiple documents that kept changing.
    
  •      Often when new district codes were created they reused previous codes – but the new code referred to a different set of boundaries than the previous code…. (unlike say a land registry plot number that ‘retires’ the plot number if it is subdivided – and the 2 new plots get totally new numbers).
    

There were lots of other subtle practical problems, all of which lead to the TOTAL unsuitability of using any administrative boundary IDs in the facility ID.

Instead, we used a sequential meaningless numeric integer – currently up to about 20000 – which worked since all IDs are assigned by a central database server when any district officer creates a new facility through the web application. (There is also a GUID, but that is for internal software applications only).

Apart from all the other excellent points in the email below, I completely recommend a facility ID with no meaning – but am sure that most users will be requesting this – so be prepared for this with lots of documented examples of why it is not suitable.

If the identifier is meaningless (and will not change), it is not necessary to have the ID to be timebound. But what is necessary to have the DISTRICT or other key administrative boundary to be timebound. So a subtable to show how facilities changed in districts over time. The checkdigit is also a good idea. Agree 5-6 digits plus checkdigit.

Regards

Paul

From: facility-registry@googlegroups.com [mailto:facility-registry@googlegroups.com] On Behalf Of Bob Jolliffe
Sent: 12 October 2013 17:26
To: facility-registry@googlegroups.com
Cc: Edward.Bunker@jhpiego.org; cleitner@capacityplus.org

Subject: Re: Facility ID’s in iHRIS…

I’ve a couple of thoughts. First, and partly in response to the original question, I guess there is a difference between identifiers which are currently in common use and what might be a more ideal or suggested best practice.

In my earlier answer to the first, I have found that the most common are either simple integers or some geographic or administrative concatenation of codes.

The merits and demerits of various types of identifier is really wholly dependent of what they are used to identify and how they will be used and who (or what) will use them. For example, with patient identifiers it is can be very important that they lend themselves to anonymous processes - in which case the inclusion of content with implicit meaning (birth date, sex, ethnicity or what have you) seems generally to be considered bad practice. Having said that the Norwegians make use of the their national id (which includes various meanings) in their health record which they seem to be happy enough with. I’m not sure I am would be but I’m not Norwegian.

I am less sure that context independence is necessarily the main thing to strive for when identifying health facilities. Though I take the point that there are hazards. If you identify a facility by the name (early dhis practice) then there are problems that (i) the name might change (ii) there can be different names for the same facility (eg language) and (iii) typically as the geographic scope widens, there can be many facilities with the same name. Identifiers which encode hierarchy, like the 18 digit one from Andra Pradesh, work well until the hierarchy changes.

But these are convenient and useful. As long as you can take into account the fact that their validity is time bound. eg. before the boundary changes the facility code was 67686896 but now it is 6545454. It can be as important to know how the facility was identified then as well as now.

UUIDS (or guids) don’t generally serve much useful purpose when identifying facilities. They simply work to reinforce what you already know. If you have a database full of unique facilities then you can meaningfully assign globally unique uuids to them. But if there are existing duplicates you just end up assigning each duplicate a uniquely different uuid. And if you have a candidate for a new facility to add to the list you need to use other more reliable forms of identification as it doesn’t come with a uuid baked in. I guess this is the difference with for example disk drives, where the uuid is assigned by the manufacturer. Or mac addresses on network adaptors. If the uuid were inscribed in the foundation stone of the clinic, it would be useful. But generally I find them an expensive distraction.

A more stable identification can be had with the geo-coordinates, though facilities can and do move. Mobile clinics being the extreme example.

Trying to fathom a list of useful guidelines from the complexity I would venture:

  1. agree with Paul. Do use checkdigits. Particularly for identifiers that will be entered manually
  1. cater for the fact that identifiers can be timebound
  1. understand that multiple identifiers can be assigned by different authorities - some may be more ideal than others and often you have zero control over them. So rather than creating a structure which takes into account just the max length, rather take into account identifier context, validity and multiplicity. This is a pattern you find in hl7 and elsewhere (even to some extent in the FRED spec).
  1. long random numbers don’t really help when trying to avoid duplicates in a collection. Rather focus on the name, location, type and other attributes as clues to whether you have potential duplicates.
  1. something in the range of 5-6 digits plus a check digit is a reasonable length for a human-grokkable identifier, but these would necessarily be geographically scoped.
  1. An identifier which is used for cross reference between or within interoperable computer systems need not be the same thing as a human grokkable identifier. But it might include it.

Also there is no technical substitute for local knowledge. The program may say there are 50 clinics in a district, all with unique codes, check digits and what have you, but trust the district manager who knows that there are only 49.

Thats at least 5 cents worth …

Bob

On 12 October 2013 15:41, Paul Biondich pbiondic@regenstrief.org wrote:

Seems like there’s a couple of comments / questions here:

  1. What’s the most flexible datatype specification for a facility code that you’ve seen? I agree with Mike that it should probably allow for a GUID, which most easily would be stored as a string of 36 characters. They are a terror to use however, and I sure as heck hope they wouldn’t be used by individuals in real-world operational settings.
  1. What are best practices around coming up with a facility code “scheme”? In my experience, it’s a grave mistake to advise anyone to build IDs that have implicit meaning in their structure. For all of the reasons that Bob describes below, as soon as you put meaning into a code, you run into a constraint within the scheme. I’ve seen it happen so many times now. :frowning: The additional comment is that check digits (a mathematically imputed value usually appended to the end of a code) are really good practice. That way, you can better protect against human entry errors.

Just my $0.02.

-Paul

On Friday, October 11, 2013, Bob Jolliffe wrote:

Don’t get me started on guids …

In terms of curated codes they are typically quite short because people haven’t given them much thought and just assign integers. I have also seen cases where they are a concatenation of upper levels - statecode+districtcode+facilitycode in which case they can grow quite large. I believe facilities in India have identifiers like this though I haven’t looked for a while (I think about 16 chars).

Whatever you choose you will soon enough find something which breaks your hard limit so be prepared to change. But 25 sounds like a reasonable start.

Agree with Mike that UUIDs can be added in addition to, rather than as the MOH identifier. Not that I believe they serve much real purpose, but as I said, don’t get me started …

Bob

PS. John Lewis just sent me an example from Andra Pradesh just as I was about to hit the send button. 18 digits: 280728900111100030

On 11 October 2013 12:12, Gehron, Michael M GehronMM@state.gov wrote:

I believe the GUID standard is 35 alpha numeric (plus special characters, isn’t it?). Not that we would ever really attain an international master facility registry, or that we couldn’t just add a GUID if we ever found we needed one…just sayin’…for the sake of looking blackberry busy while eating alone at a crowded place :slight_smile:

Michael Gehron
202.203.7485 (O)
202.486.2563 (C)

From: Carl Leitner [mailto:litlfred@gmail.com]
Sent: Thursday, October 10, 2013 02:30 PM
To: Edward Bunker Edward.Bunker@jhpiego.org; facility-registry@googlegroups.com facility-registry@googlegroups.com
Cc: Carl Leitner cleitner@capacityplus.org
Subject: Re: Facility ID’s in iHRIS…

Hi All,

I am forwarding this question I received from Ed Bunker @ Jhpiego. I am sure he would appreciate your thoughts.

Cheers.

-carl

On Oct 10, 2013, at 1:05 PM, Edward Bunker Edward.Bunker@jhpiego.org wrote:

Carl,

Hello.

In your experience with iHRIS, what is the range of formats and lengths that you have found for Facility IDs… that come from MoH, etc.

Would an open text field of length 20 cover just about anything that we might encounter?

BTW… Thanks again for your reference to the various HR-related standards.

– Ed


Jhpiego-Innovating to Save Lives

Edward B. Bunker, MPH, MS

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.
To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.


You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.
To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

You received this message because you are subscribed to the Google Groups “Facility Registry (OHIE)” group.

To unsubscribe from this group and stop receiving emails from it, send an email to facility-registry+unsubscribe@googlegroups.com.

For more options, visit https://groups.google.com/groups/opt_out.