Product Updates & Changes

Stay up to date with the latest improvements, new features, and bug fixes in WaterCrawl. We're constantly evolving to serve you better.

v0.3.3

Changed

  • Updated user models, serializers, and services
  • Modified common serializers and services
  • Updated project settings and version
  • Updated Privacy Policy

Added

  • New user migration for newsletter and privacy confirmation
v0.3.2

Added

  • Automated daily page credit reset for active subscriptions
  • Celery beat schedule for running daily tasks
v0.3.1

Changed

  • Removed custom billing cycle anchor from Stripe checkout to fix subscription timing issues
  • Simplified Stripe checkout session configuration for better compatibility
  • Fixed Stripe webhook handling for default plan subscriptions
v0.3.0

Added

  • Team plan management system with Stripe integration
  • Usage tracking and plan-based limits
  • Enterprise mode configuration
  • Resend email verification endpoint
  • Comprehensive admin panel for User, Team, TeamMember, and API key management
  • Unique email constraint with case-insensitive lookups
  • Plan-based crawl request validation
  • Usage history tracking
  • Stripe webhook handling for subscription management

Changed

  • Enhanced user authentication system
  • Improved email templates (removed emojis, simplified text)
  • Added translations for user-facing messages
  • Better spider include/exclude paths handling
  • Updated admin interface with improved filters and search
  • Enhanced team member management
  • Improved error handling and validation messages

Infrastructure

  • Spider options handling for include/exclude paths
  • Email verification and authentication issues
  • Case-sensitive email lookup issues
  • Team member invitation process
  • Added unique email constraint
  • Improved API key management
  • Enhanced authentication validation
v0.2.1

Added

  • New TeamSchemaView for dedicated team API documentation
  • Custom schema generator for filtering team-specific endpoints
  • Separate documentation files for better organization
  • Tags for all API endpoints (Auth, Profile, Team, API Key, Crawl Requests, Crawl Results, Reports, Plugins, Common)
  • Token refresh and verify views with proper documentation
  • Theme customization for documentation UI

Changed

  • Moved API documentation strings to dedicated files
  • Improved endpoint descriptions and documentation structure
  • Updated API key authentication to track last used timestamp
  • Enhanced schema security configuration
  • Reorganized URL patterns for better documentation access
  • Updated settings for better API documentation title and description
  • Changed "Get the current team" to "Get/Update the current team" for clarity

Infrastructure

  • API key authentication display in ReDoc
  • Documentation organization and endpoint grouping
  • Swagger UI and ReDoc configuration
  • Unused imports and code cleanup
v0.2.0

Added

  • Integrated Playwright for dynamic page rendering and JavaScript execution
  • Support for PDF and Screenshot attachments for crawl results
  • Advanced page interaction options (wait time, cookie acceptance, locale settings)
  • Improved Docker build process with multi-platform support
  • Added API version endpoint
  • Extended crawler options with timeout, cookies, locale, and headers support
  • Duration tracking for crawl requests
  • Support for longer URLs (up to 2048 characters)

Changed

  • Enhanced page rendering with Playwright middleware
  • Improved JavaScript handling and dynamic content extraction
  • Enhanced Docker workflow with better caching and versioning
  • Improved domain handling in spider options
  • Updated concurrent request settings
  • Better organization of crawler constants and types

Infrastructure

  • Added multi-platform Docker builds (linux/amd64, linux/arm64)
  • Improved Docker caching and build optimization
  • Added version tracking in Docker builds
v0.1.1

Changed

  • Fixed dependency versions in requirements.txt
v0.1.0

Added

  • Plugin service for managing crawl plugins
  • Plugin schema API endpoint
  • Active plugins loader utility
  • Dynamic middleware and pipeline loading for plugins

Changed

  • Updated watercrawl-plugin to version 0.1.0
  • Updated watercrawl-openai to version 0.1.0
  • Improved spider middleware configuration
  • Enhanced plugin system architecture
v0.0.2

Added

  • Email service for sending templated emails
  • Frontend settings service for managing UI configurations
  • OAuth integration with GitHub and Google
  • Password reset functionality with email notifications
  • Email verification system
  • Team invitation system improvements
  • New user templates for email communications

Changed

  • Enhanced user authentication system
  • Improved team management functionality
  • Updated permission system with login and signup controls
  • Modified team invitation workflow
  • Restructured common services and views

Dependencies

  • Added html2text for email template processing
v0.0.1

Features

  • Initial release of WaterCrawl
  • Core web crawling functionality using Scrapy (v2.12.0)
  • Django-based web application (v5.1.4)
  • REST API using Django REST Framework (v3.15.2)
  • Asynchronous task processing with Celery (v5.4.0)
  • Redis integration for task queue management
  • MinIO integration for file storage
  • User authentication and authorization system
  • OpenAI integration capabilities

Infrastructure

  • Docker support with multi-container setup
  • Swagger/OpenAPI documentation using drf-spectacular
  • Scalable architecture with separate services
  • Database integration with PostgreSQL
  • File storage system using MinIO
  • Celery beat for scheduled tasks

Dependencies

  • Python 3.11+
  • Django 5.1.4
  • Scrapy 2.12.0
  • Celery 5.4.0
  • Redis (latest)
  • PostgreSQL 17.2
  • Nginx
  • GunicornWSGI server
  • MinIO (optional, can use S3 or local storage)