Skip to content

Latest commit

 

History

History
74 lines (34 loc) · 1.23 KB

File metadata and controls

74 lines (34 loc) · 1.23 KB

📦 Module 1: Data Ingestion & Preprocessing Layer


🧠 Overview

The Data Ingestion & Preprocessing Layer is the foundational module of the ISRO Mission Navigator project. It collects raw mission data from multiple sources such as ISRO satellite datasets, CSV files, PDFs, and web knowledge sources, and converts them into a clean, structured format suitable for knowledge graph construction.


🎯 Purpose

This module ensures that heterogeneous space mission data (satellites, rockets, scientists, missions) is standardized, cleaned, and indexed for downstream AI processing and graph generation.


⚙️ Key Responsibilities

  1. Load ISRO datasets (CSV, PDF, text, APIs)

  2. vData cleaning and normalization

  3. vSchema alignment for knowledge graph

  4. Metadata extraction and indexing

  5. Handling missing and noisy data


🔍 What It Enables

✓ Reliable entity extraction

✓ Consistent graph schema

✓ High-quality semantic search inputs


📊 Input

ISRO Satellite CSV Dataset


📤 Output

  1. Cleaned JSON / Structured Data

  2. Indexed Mission Knowledge Base


🛠️ Tech Stack

  1. Python

  2. Pandas

  3. NumPy

  4. NLP Preprocessing Tools