-
-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incorrect county FIPS code for Bedford, VA #3531
Comments
fitnr/addfips#8 |
Update just got pushed! Should be a simple matter of updating dependencies, I'll throw this issue into this sprint. |
As far as I can tell we're still waiting on the maintainer to merge their fix commit which apparently didn't make it into the release. I'll bump them again. |
I guess we could also pin to their |
|
By Google's geocoder, you mean https://geocoder.readthedocs.io/index.html? Just poking around it seems like you'd need a TAMU key to pull FIPS codes out of county names. But it also seems like there's some federal APIs we could hit to get the FIPS codes? |
I meant Google Maps Platform's Geocoding API. IMO the primary advantages are that:
The disadvantages I am aware of are:
I use a cache layer and my usage always fits in the (generous) free tier. Occasionally cache invalidation issues cause minor annoyance, but it is easy to fix with a refresh. |
Ah sweet! What do you do for a caching layer? I also just spent a few minutes poking around at the documentation and couldn't see where FIPS code would get returned - unless that gets returned as the |
Ah ya I use this as a cleaning/standardization function to convert dirty inputs to the official county names. Then you can do a simple join against the official Census data to get FIPS codes. But you need both! Also I now realize the work I was referencing is actually public, so I'll just link to it. Sorry in advance for the data scientist quality code 😇
The row-level memory cache saves duplicate API calls per session (eg looking up the same county 1000 times), and the dataframe-level disk cache saves duplicate calls between runs (when a source dataset is unchanged). I didn't automate the cache invalidation, I just do it manually because updates are infrequent. But the free tier resets each month, so a monthly clear could make sense. |
Migrating this discussion over to #3884 to discuss options for fixing this! |
Describe the bug
The
addfips
package is labeling Bedford, VA as '51515', which is the code for Bedford City. It should actually be '51019' (Bedford County). See their list of FIPS codes.Bug Severity
How badly is this bug affecting you?
Medium: I was able to identify and fix the bug in my own workflow but it might affect other people.
To Reproduce
I found the error in the
core_eia861__yearly_service_territory
table. Census population files do not have the FIPS code 51515.The text was updated successfully, but these errors were encountered: