Загрузка...

Legal Document Simplifier and Risk Analyzer Using RAG and LLM

Legal Document Simplifier and Risk Analyzer

This project was developed as part of the B.Sc. (Hons.) Data Science and Artificial Intelligence Programme at IIT Guwahati for Course DA 377 Internship/Term Project - l

Project Overview

It is an AI-powered legal document analysis platform that simplifies complex legal documents, identifies risky clauses, retrieves relevant legal information using Retrieval-Augmented Generation (RAG), and generates professional PDF reports.

Key Features

• PDF, DOCX, JPG, and PNG document support
• OCR-based text extraction using EasyOCR
• Semantic search using FAISS Vector Database
• Retrieval-Augmented Generation (RAG) pipeline
• Legal risk detection and scoring
• AI-powered legal clause explanations
• Indian law knowledge base integration
• Professional PDF report generation
• Fully offline execution after model setup

Technology Stack

Frontend:

Streamlit

Backend:

Python

AI/NLP:

Sentence Transformers
MiniLM-L6-v2
Qwen2.5-7B-Instruct

Vector Database:

FAISS

Document Processing:

PyMuPDF
python-docx
EasyOCR

Reporting:

ReportLab

Project Workflow

Document Upload

Text Extraction

Chunk Generation

Embedding Creation

FAISS Retrieval

RAG Pipeline

Risk Detection

Legal Analysis

PDF Report Generation
Author

Shivam Salve
B.Sc. (Hons.) Data Science and Artificial Intelligence
IIT Guwahati

#AI #NLP #LLM #RAG #FAISS #LegalAI #MachineLearning #DataScience #IITGuwahati #Python #Streamlit

Видео Legal Document Simplifier and Risk Analyzer Using RAG and LLM канала Shivam Salve [IITG]
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять