Since around the 26th of January, we have been unable to decode the JSON
present within the daily companies house PSC snapshot. Are you aware of
any issues? Or do we need to sanitize the data before processing?
We have tried to decode the JSON (convert it into an array) using a few
different methods (using PHP etc) - what would be the best way for us to
approach this?
I don’t know why things would have changed - I have a file from before 26th and have just tried the last of the latest files and they look the same.
The files were not correct json as they stood, IIRC - they were a series of lines, each containing the (correct) json for one entry. So (I’ve cropped some of the data “…” ):