The Data Ingestion & Preprocessing Layer is the foundational module of the ISRO Mission Navigator project. It collects raw mission data from multiple sources such as ISRO satellite datasets, CSV files, PDFs, and web knowledge sources, and converts them into a clean, structured format suitable for knowledge graph construction.
This module ensures that heterogeneous space mission data (satellites, rockets, scientists, missions) is standardized, cleaned, and indexed for downstream AI processing and graph generation.
-
Load ISRO datasets (CSV, PDF, text, APIs)
-
vData cleaning and normalization
-
vSchema alignment for knowledge graph
-
Metadata extraction and indexing
-
Handling missing and noisy data
✓ Reliable entity extraction
✓ Consistent graph schema
✓ High-quality semantic search inputs
ISRO Satellite CSV Dataset
-
Cleaned JSON / Structured Data
-
Indexed Mission Knowledge Base
-
Python
-
Pandas
-
NumPy
-
NLP Preprocessing Tools