photos

mmainguy/photos

Fork 0

Commit Graph

Author	SHA1	Message	Date
Michael Mainguy	85c1479d94	Add comprehensive AI-powered photo analysis with dual-model classification ## Features Added: - Dual Model Classification: ViT (objects) + CLIP (style/artistic concepts) - Image Captioning: BLIP model for detailed photo descriptions - Auto-tagging: Process all photos with configurable confidence thresholds - Tag Management: Clear all tags functionality with safety confirmations - Comprehensive Analysis: 15-25+ tags per image covering objects, style, mood, lighting ## New API Endpoints: - `/api/classify/batch` - Batch classification with comprehensive mode - `/api/classify/comprehensive` - Dual-model analysis for maximum tags - `/api/classify/config` - Tunable classifier parameters - `/api/caption/batch` - Batch image captioning - `/api/tags/clear` - Clear all tags with safety checks ## UI Enhancements: - Auto-tag All button (processes 5 photos at a time) - Caption All button (processes 3 photos at a time) - Clear All Tags button with confirmation dialogs - Real-time progress bars for batch operations - Tag pills displayed on thumbnails and image modal - AI-generated captions shown in image modal ## Performance Optimizations: - Uses cached thumbnails for 10-100x faster processing - Parallel model initialization and processing - Graceful fallback to original files when thumbnails fail - Configurable batch sizes to prevent memory issues ## Technical Implementation: - Vision Transformer (ViT) for ImageNet object classification (1000+ classes) - CLIP for zero-shot artistic/style classification (photography, lighting, mood) - BLIP for natural language image descriptions - Comprehensive safety checks and error handling - Database integration for persistent tag and caption storage 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-27 17:05:54 -05:00

Author

SHA1

Message

Date

Michael Mainguy

85c1479d94

Add comprehensive AI-powered photo analysis with dual-model classification

## Features Added:
- **Dual Model Classification**: ViT (objects) + CLIP (style/artistic concepts)
- **Image Captioning**: BLIP model for detailed photo descriptions
- **Auto-tagging**: Process all photos with configurable confidence thresholds
- **Tag Management**: Clear all tags functionality with safety confirmations
- **Comprehensive Analysis**: 15-25+ tags per image covering objects, style, mood, lighting

## New API Endpoints:
- `/api/classify/batch` - Batch classification with comprehensive mode
- `/api/classify/comprehensive` - Dual-model analysis for maximum tags
- `/api/classify/config` - Tunable classifier parameters
- `/api/caption/batch` - Batch image captioning
- `/api/tags/clear` - Clear all tags with safety checks

## UI Enhancements:
- Auto-tag All button (processes 5 photos at a time)
- Caption All button (processes 3 photos at a time)
- Clear All Tags button with confirmation dialogs
- Real-time progress bars for batch operations
- Tag pills displayed on thumbnails and image modal
- AI-generated captions shown in image modal

## Performance Optimizations:
- Uses cached thumbnails for 10-100x faster processing
- Parallel model initialization and processing
- Graceful fallback to original files when thumbnails fail
- Configurable batch sizes to prevent memory issues

## Technical Implementation:
- Vision Transformer (ViT) for ImageNet object classification (1000+ classes)
- CLIP for zero-shot artistic/style classification (photography, lighting, mood)
- BLIP for natural language image descriptions
- Comprehensive safety checks and error handling
- Database integration for persistent tag and caption storage

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-08-27 17:05:54 -05:00

1 Commits