Currently the ETL process relies on already geocoded data (blood lead tests). That data was geocoded by CDPH. There are two issues:
- for future refreshes of the data we should not rely on CDPH to do the geocoding
- for the existing data, the geocoding that CDPH was not great. I reran some of the bad addresses through the geocoder API manually and they were recognized. So we will benefit by doing this for all addresses.
Currently the ETL process relies on already geocoded data (blood lead tests). That data was geocoded by CDPH. There are two issues: