TamizhGen - SLM Architecture

1️⃣ Core Model & Processing

TamizhGen is a Small Language Model (SLM) built specifically for Tamil.

2️⃣ AI-Generated Tamil Responses (Not Just Translation!)

TamizhGen does not just translate but generates Tamil responses intelligently.

Example:

3️⃣ AI-Generated Content Types

4️⃣ Retrieval System (AI-Generated with FIASS)

Retrieval system also uses AI generation to enhance responses.

FIASS RAG (Fact-Informed Augmented Small-Scale Retrieval-Augmented Generation).

FIASS (Inspired from Facebook AI Similarity Search) + RAG retrieves high-relevance Tamil text.

Example Queries:

5️⃣ Built-in Translation for Tamil Script

Reason: Many users type in English due to keyboard limitations.

Process: AI first generates a Tamil response → Converts it into Tamil script if needed.

Example:

6️⃣ Sequence Processing

7️⃣ Decoding, Training & Optimization

8️⃣ Tokenization & Data Handling

9️⃣ Device Compatibility

Runs on both CPU and GPU (CUDA-enabled when available).

Auto-selection of the best available hardware for optimized performance.