# Image-Object-Localization
**Repository Path**: sdcfsdsd/Image-Object-Localization
## Basic Information
- **Project Name**: Image-Object-Localization
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2018-10-30
- **Last Updated**: 2024-11-24
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
### Description
Given a picture with a bird, we are supposed to box the bird.
In src/data directory, ```images.txt``` is the index of all images, ```bouding_boxex.txt``` is the label box of all images and ```images``` contains all images. Box data make up of 4 data: the top left corner coordinate of box, width of box and height of box.
### Neural Network
For traditional CNN and FC, it will meet degeneration problems when layers go deep.
In paper ```Deep Residual Learning for Image Recognition```, they try to solve this problem by using a Residual Block:
These blocks compose ResNet:
I use ResNet-18 in this project by adding a 4-dimension layer after ResNet-18 to predict box's x, y ,w and h.
Loss: smooth l1 loss
Metric: IoU of groound truth and prediction, threshold=0.75
### Train
Resize all images to ```224*224*3```
Then normalize and standardize all pixel channel.
Split all data into 9000 training data and 2788 tesing data. Train network on training data using ```batch size=128```, ```epoch=100``` and ```validation split ratio=0.1```
Training result:
Testing result:
### Examples
Red box represents ground truth and green box is the prediction of network.
Failed example:
### Attention
You should keep the directory structure.
### Dependency
python 3.6
tensorflow 1.3.0
keras 2.1.0
numpy
PIL
pickle
matplotlib
### Run
In ```src``` directory:
```python getdata.py``` to preprocess data.
If you want to train model, ```python train.py```
If you want to test on trained model(if you had trained model), ```python test.py```
### Reference
Deep Residual Learning for Image Recognition: https://arxiv.org/pdf/1512.03385.pdf
### Author
CKCZZJ
### Licence
MIT