Data-lake based data store, adaptive AI-based cataloguing and data quality assurance

Summary
Report of the data collection and storage pipeline designed in T4.2.