“Type your pain. Receive motivation.”
- Demo
- Screenshots
- Overview
- How It Works
- Key Features
- Tech Stack
- Installation
- Usage
- Folder Structure
- Developer Notes
- Roadmap
- Contribute
- Cinematic Design Philosophy
- License
| User Input | AI Generated Image + Cinematic Quote | AI Voice Output |
|---|---|---|
| "I lost my job as a graphic designer due to AI and now I'm nothing." | ![]() |
🎧 Play Voice |
Your message → AI motivation → Cinematic image → Spoken in voice.
AIMAN is a premium offline cinematic motivational AI.
You tell it what you're feeling — stress, failure, heartbreak —
and it transforms your message into:
- A motivational quote (generated by Local LLM — Ollama phi3:mini)
- A cinematic portrait image (Stable Diffusion v1.5)
- A deep, masculine AI voice-over (pyttsx3)
Everything happens locally.
No internet. No APIs. No tracking.
Your emotions stay yours.
flowchart LR
A["User types emotional message"] --> B["Ollama (phi3-mini) generates motivation"]
B --> C["Stable Diffusion generates cinematic portrait"]
C --> D["pyttsx3 turns quote into deep voice"]
D --> E["Outputs: Image + Quote + Voice"]
| Feature | Description |
|---|---|
| Understands your emotions | Converts your message into motivational text using phi3:mini via Ollama |
| Generates art | Creates cinematic portraits with Stable Diffusion |
| Speaks to you | Deep voice using pyttsx3 (offline) |
| 100% Local | No internet. No API keys. Privacy-first. |
| Beautiful UI | Built in Streamlit, just click and use. |
| Area | Tech |
|---|---|
| Web UI | Streamlit |
| LLM Text Generation | Ollama (phi3:mini) |
| Image Generation | Hugging Face Diffusers + Stable Diffusion |
| Voice / Speech | pyttsx3 (Offline TTS) |
| Utility | Pillow, Requests, Accelerate |
git clone https://github.com/Sourav-x-3202/aiman.git
cd aimanpython -m venv venv
venv\Scripts\activate # Windows
# or
source venv/bin/activate # Mac/Linuxpip install -r requirements.txtollama serve
ollama pull phi3:ministreamlit run app.py⚠️ Troubleshooting
Install Ollama from: https://ollama.com/download
Then restart your terminal.
Run this manually once:
ollama pull phi3:miniAIMAN will automatically switch to CPU mode. No action needed.
On Windows:
- Open Control Panel
- Go to:
Speech Recognition → Text to Speech - Select a male voice (
Guy / David / Microsoft)
Upgrade pip first:
python -m pip install --upgrade pipIf something still fails, install each dependency manually:
pip install streamlit diffusers pillow pyttsx3 accelerateCreate an issue here: https://github.com/Sourav-x-3202/aiman/issues
- Open Streamlit UI
- Enter your pain/frustration/goal
- Click Generate Motivation
- AIMAN creates:
- Voice narration
- Motivational message
- Cinematic image
text = "I feel lost and tired of failing."aiman/
│
├── app.py # Streamlit UI
├── generate_text.py # AI motivational message generation
├── motivational_image.py # Stable Diffusion cinematic image generation
├── text_to_speech.py # Voice synthesis
├── requirements.txt
├── README.md
├── assets/
│ └── fonts/ # Dancing Script font for overlay text
└── outputs/ # Generated images + voice (auto-created)
- Local LLM via Ollama (phi3:mini) → Generates motivational text
- Stable Diffusion v1.5 → Creates cinematic portraits
- Pillow + custom font → Text overlay on image
- pyttsx3 (offline TTS) → Deep masculine voice
- Auto GPU/CPU fallback based on hardware
- Outputs timestamp-named files inside
/outputs/ - No API keys, no cloud — 100% private
Click to expand — Full Detailed Developer Notes
- Uses
phi3:miniLLM inside Ollama - Fully offline — no API calls or internet dependency
- Custom prompting to maintain:
- Cinematic tone (Godfather vibes)
- Masculine mentorship voice
- Ensures messages are:
- Short
- Powerful
- Emotionally supportive
- Supports streaming so UI remains responsive
- Model:
runwayml/stable-diffusion-v1-5 - Uses
torch.float16on GPU andtorch.float32on CPU - Image generation pipeline:
- Text prompt → latent diffusion → decoding
- Applies cinematic prompt style:
warm golden light • dramatic shadows • film look
- Automatically saves images in:
outputs/
- Uses Pillow (
ImageDraw+ImageFont) - Auto-resizes text to fit image
- Intelligent line wrapping (prevents broken words)
- Adds soft drop shadow behind text
- Uses Dancing Script Bold font for elegance
(fallback to Arial if font missing)
pyttsx3runs offline — no internet requirement- Looks for male voice preferences:
- David
- Male
- Guy
- Parameters tuned for cinematic delivery:
- Speed:
rate = 145 - Volume:
1.0
- Real-time updates without page reload
- Sections:
- Input text
- Generated quote
- Generated image
- Play audio button
- Timestamp filenames:
outputs/
├── ai_image_2025-01-31_211023.png
├── ai_voice_2025-01-31_211023.wav
- No overwrites — every output is preserved
.gitignoreensures:- No output files pushed to GitHub
- No
.wav,.png,.mp3leak
| Situation | AIMAN Response |
|---|---|
| Ollama not running | " AIMAN is offline" |
| Quote generation failed | Uses backup motivational quote |
| Font not found | Uses system default font |
| GPU not detected | Automatic CPU mode |
- Export video reel (portrait + quote + voice)
- Use user's face as the cinematic output
- Add background music under voice narration
- Export video (image + voice) — like a motivational reel
- Add protagonists (your face → AI portrait)
- Voice emotion control (dominant, calm, intense)
PRs and feature requests are welcome.
- If you like this project, star the repo to support it:
- https://github.com/Sourav-x-3202/aiman
“Emotion deserves presentation.”
Developed by Sourav Sharma If you like this project, please star the repo — it motivates the developer https://github.com/Sourav-x-3202/aiman
MIT License — free to use, modify, and distribute.
AIMAN
“Pain is input. Growth is output. AIMAN is the bridge.”



