This advanced phishing detection system includes:
Key Features:
1. URL Detection Tab:
- Real-time URL analysis
- Feature extraction from URLs
- Risk level visualization
- Detailed results display
- Sample URLs for testing
2. Model Training Tab:
- Multiple ML algorithm options:
- Random Forest
- Decision Tree
- SVM
- KNN
- Gradient Boosting
- Naive Bayes
- Synthetic data generation
- Model training with progress indication
- Performance metrics display
3. Analysis Tab:
- Model comparison
- Feature importance visualization
- Statistical analysis
- Report generation
4. Technical Features:
- Feature Extraction: Extracts 50+ features from URLs including:
- URL length and structure
- Special character counts
- Digit ratios
- TLD analysis
- Suspicious patterns
- Domain characteristics
- Phishing keywords
- Machine Learning:
- Multiple classifier options
- Accuracy metrics
- Probability scores
- Model persistence
- GUI Features:
- Modern tabbed interface
- Real-time feedback
- Visualizations
- Export capabilities
To Run the Application:
- Install required packages:
bash
pip install numpy pandas scikit-learn matplotlib seaborn joblib tkinter requests beautifulsoup4 tldextract
- Run the application:
bash
python phishing_detector.py
Usage Instructions:
- For URL Analysis:
- Enter a URL in the detection tab
- Click "Analyze URL"
- View detailed results and risk assessment
For Model Training:
- Go to the training tab
- Generate synthetic data
- Select ML algorithms
- Train models
- Save the best model
For Analysis:
- View model performance comparison
- Generate visualizations
- Create detailed reports
Future Enhancements:
- Real WHOIS integration
- SSL certificate validation
- Real-time blacklist checking
- Website screenshot capture
- JavaScript analysis
- Database integration for storing results
- API endpoints for integration
- Multi-language support
Comments