I see that CH generates a monthly archive with many data fields. However, how complete is the information compared to data from the API? I feel that many data fields are missing in the archive, which are available through the API.
Furthermore, does CH have a schema which outlines how data from the API can be linked appropriately?
The bulk PSC file contains all the information available from the
API regarding PSC’s since this is a new product built from the API. The other
snapshots produced by Companies House may not contain (and do not in many
cases) the full data available from the API since they are legacy products. We
plan to refresh these to contain the wider data in the future but have no firm
timescales at this point.
All of the documentation available for the API can be found here:
Links to the Companies House bulk data products can be found at
the link below, any associated documentation that’s available is referenced on
the page for each product.