AutoSlides Extractor

Automatically extract slide images from video presentations using advanced computer vision

Overview • Features • Download • Quick Start • How It Works • Build

Overview

AutoSlides Extractor is a powerful cross-platform desktop application designed to intelligently identify and extract distinct slides from presentation videos. Using sophisticated computer vision algorithms—specifically Structural Similarity Index (SSIM) and Perceptual Hashing (pHash)—it automatically detects when slide content changes significantly and saves each unique slide as a high-quality image.

New in v1.1.0: AI-powered slide classification using a MobileNetV4 ONNX model to automatically identify and remove non-slide content (desktop screens, black screens), a trash review system, and a PDF Maker for exporting slides.

✨ Key Features

Core Capabilities	Advanced Tech
🎯 Intelligent Slide Detection Two-stage algorithm using SSIM for precise slide extraction	⚡ Hardware Acceleration GPU support via CUDA, OpenCL, Metal, DirectX, and Vulkan
🧹 Smart Post-Processing Auto-removal of redundant slides using pHash & exclusion lists	🤖 AI-Powered Classification MobileNetV4 model filters out non-slide content
🗑️ Trash Review System Safely review and restore removed slides with metadata tracking	🧩 High Performance Multi-threading & SIMD optimizations (AVX2, NEON)
📄 PDF Maker Organize and export extracted slides to compressed PDF documents	🎼 Multi-Format Support MP4, AVI, MOV, MKV, WMV, FLV, WebM

Detailed Features

Configurable Sensitivity: Presets (Strict, Normal, Loose) and custom SSIM thresholds.
Batch Processing: Queue multiple videos for sequential processing.
Memory Optimization: Chunk-based processing for handling large video files efficiently.
Cross-Platform: Native look and feel on macOS, Windows, and Linux.

📥 Download & Installation

Pre-built Releases

Download the latest installer or portable package for your operating system:

Platform	Download	Note
macOS	AutoSlides.Extractor-macOS-arm64.dmg	Apple Silicon only. Run quarantine fix command below.
Windows	Setup.exe / Portable.zip	64-bit Windows 10 or later.

Important

macOS Users: After installation, run the following command in Terminal to allow the app to run:

sudo xattr -d com.apple.quarantine /Applications/AutoSlides\ Extractor.app

System Requirements

OS: macOS 11+ (Apple Silicon), Windows 10+ (64-bit)
CPU: Quad-core processor recommended
RAM: 8 GB minimum (16 GB for large videos)
GPU: Recommended for hardware acceleration

🚀 Quick Start

Launch AutoSlides Extractor.
Add Videos: Drag & drop video files or click "Add Videos".
Configure (Optional): Select an Output Directory and adjust SSIM sensitivity if needed.
Start: Click the Start button.
Review: Once finished, check the output folder. Use the Trash Review to recover any mistakenly removed slides or the PDF Maker to compile them.

🛠️ Usage Guide

1. Main Workflow

Input: Supports major video formats. Videos are queued and processed one by one.
Monitoring: Real-time status updates show:
- 🟡 Queued: Waiting.
- 🔵 Processing: FFmpeg decoding -> SSIM analysis -> ML Classification.
- 🟢 Completed: Done with slide count.
Controls: Pause, Resume, or Remove items from queue easily.

2. Post-Processing & AI

Redundancy Removal: Uses pHash to find and delete near-duplicate images.
Exclusion Lists: Define patterns (like Intro/Outro slides) to automatically ignore.
AI Classification (v1.1.0):
- "slide": Kept.
- "not_slide": Removed (e.g., desktop, black screen).
- "may_be_slide": Configurable action (keep or delete).
- Tip: Use the visual range sliders in Settings to tune confidence thresholds.

3. PDF Export

Navigate to PDF Maker.
Select the root folder containing your extracted slides.
Choose sort order (Name/Date) and output quality (resize/compress).
Generate a single PDF document for your presentation.

🎯 How It Works

Stage 1: Change Detection (SSIM)
- Samples frames (I-frames) and calculates structural similarity.
- Significant drops in similarity signal a potential new slide.
Stage 2: Stability Verification
- Checks subsequent frames to ensure the "new slide" is stable and not just a transition effect.
Stage 3: Deduplication (pHash)
- Computes perceptual hashes of extracted images.
- Removes duplicates even if minor pixel noise exists.
Stage 4: AI Filtering (MobileNetV4)
- Classifies image content to remove non-slide elements like desktop wallpapers or empty screens.

⚙️ Technical Details

Architecture: Video Input → Hardware Decoder → SSIM Analysis → Image Output → pHash Deduplication → ML Classification → PDF Export

Performance:

SIMD: SSE4.2, AVX2, AVX512, NEON.
GPU Accel: CUDA, DirectML, Metal, OpenCL.
Inference: Core ML (macOS), CUDA/DirectML (Windows).

🔧 Building from Source

Prerequisites:

C++17 Compiler, CMake 3.16+
Qt 6 (Core, Widgets, Gui)
OpenCV 4.x, FFmpeg

Build Steps:

git clone https://github.com/bit-admin/AutoSlides-Extractor.git
cd AutoSlides-Extractor
mkdir build && cd build

# Configure
cmake ..
# Or with specific optimizations
# cmake -DENABLE_CUDA=ON ..

# Build
cmake --build . --config Release

❓ Troubleshooting

Issue	Solution
"App is damaged" on macOS	Run `sudo xattr -d com.apple.quarantine /Applications/AutoSlides\ Extractor.app` in Terminal.
GPU not used	Ensure you have the latest drivers installed. On Windows, check if CUDA is installed for NVIDIA cards.
Slides missed	Try the "Strict" SSIM preset or decrease the custom threshold (e.g., to 0.995).
Too many duplicates	Enable Post-Processing and lower the Hamming Threshold (e.g., to 8).

📄 License

This project is licensed under the MIT License.

Made with ❤️ using Qt6, OpenCV, and FFmpeg

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
docs		docs
installer		installer
resources		resources
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
README_zh-CN.md		README_zh-CN.md
resources.qrc		resources.qrc
vcpkg.json		vcpkg.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AutoSlides Extractor

Overview

✨ Key Features

Detailed Features

📥 Download & Installation

Pre-built Releases

System Requirements

🚀 Quick Start

🛠️ Usage Guide

1. Main Workflow

2. Post-Processing & AI

3. PDF Export

🎯 How It Works

⚙️ Technical Details

🔧 Building from Source

❓ Troubleshooting

📄 License

About

Uh oh!

Releases 3

Packages

Languages

License

bit-admin/AutoSlides-Extractor

Folders and files

Latest commit

History

Repository files navigation

AutoSlides Extractor

Overview

✨ Key Features

Detailed Features

📥 Download & Installation

Pre-built Releases

System Requirements

🚀 Quick Start

🛠️ Usage Guide

1. Main Workflow

2. Post-Processing & AI

3. PDF Export

🎯 How It Works

⚙️ Technical Details

🔧 Building from Source

❓ Troubleshooting

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages