Documentation¶
Welcome! GreenGovRAG is an AI assistant powered by Retrieval-Augmented Generation (RAG) that helps navigate Australian environmental and planning regulations.
What is GreenGovRAG?¶
GreenGovRAG combines regulatory document retrieval with geospatial filtering to answer questions about:
- Environmental Compliance - EPBC Act, state environmental regulations
- Land Use & Planning - Local council planning schemes, zoning rules
- Vegetation Clearing - Native vegetation regulations by state and LGA
- Emissions Standards - NGER reporting, ESG frameworks, carbon accounting
Key Features:
- Multi-LLM support (OpenAI, Anthropic, AWS Bedrock, Azure)
- Geospatial filtering by Local Government Area (LGA)
- Legal-grade citations with page numbers and sections
- Hybrid search (BM25 + vector similarity)
- Cloud storage support (AWS S3, Azure Blob, Local)
- Production-ready deployment options
Getting Started Paths¶
I want to use GreenGovRAG¶
Start here if you want to query Australian regulations:
- Installation Guide - Set up GreenGovRAG
- Quick Start - Get running in 5 minutes
- Configuration Guide - Configure your setup
- First Query Tutorial - Submit your first query
Then explore:
- User Guide - Learn how to use the system effectively
- Troubleshooting - Common issues and solutions
I want to contribute to GreenGovRAG¶
Start here if you want to add document sources or fix bugs:
- Contribution Overview - How to contribute
- Development Setup - Set up your environment
- Adding Document Sources - Add new regulations
- Code Style Guide - Follow our standards
- Testing Guide - Write and run tests
- Pull Request Workflow - Submit your changes
I want to customize or extend GreenGovRAG¶
Start here if you want to customize the RAG pipeline or add features:
- System Architecture - Understand the design
- RAG Pipeline Deep Dive - How RAG works
- ETL Pipeline Guide - Document processing
- Plugin System - Extensibility
- Metadata Standards - ESG & geospatial tags
- Citation System - Legal-grade citations
I want to deploy GreenGovRAG¶
Start here if you want to deploy to production:
- Deployment Overview - Choose your deployment
- Cloud Provider Comparison - AWS vs Azure vs Local
- Local Docker Setup - Development environment
- AWS Deployment - Deploy on AWS
- Azure Deployment - Deploy on Azure
- Production Checklist - Go-live requirements
Documentation Structure¶
User Guide¶
Learn how to use GreenGovRAG effectively:
- Querying the System - API and CLI usage
- Vector Stores - FAISS vs Qdrant
- Caching - LLM response caching for cost savings
- Monitoring - Health checks and metrics
- Troubleshooting - Common issues and solutions
Contributor Guide¶
For open-source contributors:
- Overview - How to contribute
- Development Setup - Local environment
- Adding Document Sources - New regulations
- Code Style - Ruff, MyPy, standards
- Testing - Write and run tests
- Pull Requests - PR workflow
Developer Guide¶
For advanced customization:
- Architecture
- System Overview - High-level design
- RAG Pipeline - RAG internals
- ETL Pipeline - Document processing
- Plugin System - Extensibility
- Components
- Metadata Standards - ESG & geospatial
- Citation System - Legal-grade citations
- Cloud Storage - Multi-cloud architecture
- Customization
- LLM Configuration - Switching LLM providers
- Custom Parsers - Build custom parsers
- Custom Embeddings - Custom embedding models
Deployment¶
Production deployment guides:
- Overview - Deployment options
- Local Docker - Docker Compose setup
- AWS - AWS ECS, S3, RDS
- Azure - Azure Container Apps, Blob Storage
- Cloud Comparison - Cost and feature matrix
- Production Checklist - Security, scaling, backups
- Monitoring - Logs, metrics, alerts
API Reference¶
Comprehensive API documentation:
- REST API - OpenAPI/Swagger docs
- Python API (Auto-generated)
- RAG Module - Vector stores, LLMs, embeddings
- ETL Module - Document processing pipeline
- Models - Database models
- Cloud Module - Cloud storage
Reference¶
Quick lookup documentation:
- Data Sources - Regulatory document catalog
- Plugin API - Plugin development reference
- Configuration Options - All
.envvariables - CLI Commands - All CLI commands
- Database Schema - Tables and relationships
- Glossary - RAG, LGA, NGER, ESG, etc.
Quick Links¶
Most Popular Guides¶
- Quick Start Guide - Get up and running fast
- Adding Document Sources - Contribute new regulations
- Cloud Storage Guide - Multi-cloud setup
- Vector Stores - FAISS vs Qdrant
- Cloud Provider Comparison - Choose your cloud
Common Tasks¶
- Submit a query via API
- Add a new document source
- Switch from FAISS to Qdrant
- Deploy on AWS
- Configure LLM provider
Troubleshooting¶
Technology Stack¶
Backend: - Python 3.12 - FastAPI for REST API - SQLModel for ORM - LangChain for RAG - PostgreSQL with pgvector
RAG: - FAISS/Qdrant for vector storage - HuggingFace embeddings - Multi-LLM support (OpenAI, Anthropic, AWS Bedrock, Azure)
ETL: - Airflow (local dev) - GitHub Actions (production) - Unstructured.io for PDF parsing
Cloud: - AWS (ECS Fargate, S3, RDS) - Azure (Container Apps, Blob Storage, PostgreSQL) - Docker for containerization
Project Status¶
Current Version: 0.1.0 (Pre-release)
What's Working:
- RAG query pipeline with geospatial filtering
- Multi-LLM support (OpenAI, Anthropic, AWS Bedrock, Azure)
- Vector stores (FAISS, Qdrant)
- Cloud storage (AWS S3, Azure Blob, Local)
- ETL pipeline with plugin system
- Legal-grade citations
- AWS and Azure deployment
Planned:
- 📋 Multi-LGA query support
- 📋 Real-time document update webhooks
- 📋 Parcel-level geospatial queries
- 📋 Export to PDF/DOCX reports
Support & Community¶
Get Help¶
- GitHub Issues: Report bugs or request features
- Documentation: You're here! 📚
Contributing¶
We welcome contributions! See the Contributor Guide to get started.
Quick links: - Contributing Guidelines - Development Setup - Pull Request Process
Resources¶
- GitHub Repository: https://github.com/sdp5/green-gov-rag
- Project Board: https://github.com/sdp5/green-gov-rag/projects
- Changelog: View release history
- License: MIT License
About¶
Author: Sundeep Anand (contact@sundeep.id.au)
Purpose: GreenGovRAG helps individuals, businesses, and government navigate Australia's complex environmental and planning regulations through AI-powered document retrieval.
Data Sovereignty: All regulatory documents are sourced from official Australian government websites, and all data are processed and stored in Australia. (Sydney Datacenter) See Data Sources for details.
Ready to get started? Choose your path above or jump straight to the Quick Start Guide!