β Back to Project
AI-power video knowledge platform β HackAIthon 2025
SuperVault β HackAIthon 2025
βAI-First isnβt just a theme β itβs a mindset for how we build, learn, and share knowledge.β β FPT HackAIthon 2025 Committee
π Overview
Project: SuperVault β AI-Powered Video Knowledge Assistant Achievement: π₯ Second Prize, FPT HackAIthon 2025 Event Theme: AI in Learning & Knowledge Management Date: August 2025 Team: GHC.AICodeCure
π― Problem Statement
Every year, FPT Software produces thousands of hours of internal training videos. While this content is rich with knowledge, much of it remains unstructured and underutilized β difficult to search, summarize, or reuse effectively.
The challenge for HackAIthon 2025:
Build an AI assistant that transforms video content into structured, searchable knowledge β enabling smarter learning and faster insight discovery.
π Our Solution β SuperVault
SuperVault is an AI-powered video knowledge vault that automatically processes, analyzes, and indexes video content into a semantic search platform.
Using Azure AI Video Indexer and our custom-built Onyx AI engine, we transformed raw videos into actionable insights β including transcripts, sentiment analysis, visual recognition, and keyword-based semantic search.
Core Capabilities
- Audio & Visual Intelligence: Speech-to-text, language detection, speaker identification, OCR, and object recognition.
- AI-Powered Insights: Sentiment analysis, summarization, topic extraction, and timeline-based transcript generation.
- Semantic Search & Indexing: Smart chunking, BM25 ranking, and hybrid vector-text search via Vespa.
- Integration Layer: RESTful APIs connecting internal learning systems (Confluence, LMS, S3, etc.).
π§ System Architecture
- SuperVault Core: Upload management, authentication, progress tracking, and workflow orchestration.
- Azure AI Video Indexer: Audio transcription, visual analysis, and structured insights extraction.
- Onyx AI Platform: AI-driven semantic indexing, hybrid vector search, and custom agent integration.
- Persistent Layer: Multi-cloud storage using AWS S3, Azure Blob, Vespa (Vector DB), PostgreSQL, and Redis.
- Data Flow Pipeline:
- Upload β Video to Blob Storage
- Analyze β Extract audio/video insights
- Process β Summarize, visualize, and transform
- Index β Ingest content into Onyx Search
- Search β Query semantically via API or UI
SuperVault Architecture Diagram
π‘ Key Highlights
- End-to-End AI Integration β fully automated video analysis pipeline.
- 15x Faster Knowledge Discovery β semantic search replaces manual browsing.
- Cross-Platform Ready β unified knowledge hub for multiple data sources.
- Enterprise Scalability β designed for internal deployment across FSOFT divisions.
SuperVault Demo
π§ Reflection
SuperVault wasnβt just a Hackathon project β it was a vision for the future of knowledge management at FPT Software. It showcased how AI can transform passive content into active intelligence, bridging the gap between data, peopl