I made an image classification model and tried to move it on mobile


Create a classification model from your own image dataset and share how to move it in real time using your iOS or Android camera.


--google colaboratory (runtime: GPU) (TensorFlow 1.15) (Google Chrome)

1. Create your own dataset and model

In this article, we will use retrain.py to create an image classification model [^ 1]. [^ 1]: retrain.py is migrating to make_image_classifier. If you use make_image_classifier, you can convert it to tflite at once with learning, and it seems that you do not need to rewrite swift 224 to 299.

You can create a model with retrain.py, so you can build an environment with the curl command. You don't have to git clone.

retrain.If you want to get only py

curl -LO https://github.com/tensorflow/hub/raw/master/examples/image_retraining/retrain.py

1.1 Collect image data to create your own dataset

Image data can be collected relatively easily by using scraping and image collection tools. I used google-images-download to collect image data [^ 2]. [^ 2]: As of 03/07/2020, google-images-download does not work in some environments. It is believed that the cause is that the Google search algorithm has changed. How to use google-images-download [many articles](https://www.google.com/search?sxsrf=ALeKk02U-SqEjAhMNjmpl4-sUbwkSaevTQ:1583514716818&q=google_images_download&spell=1&sa=X&ved=2ahUKEwig7MSBrIboAhW Since it has been done, I will omit it here.    To create a model using retrain.py, make the directory structure as follows.

 |     └─ aaa.jpg
 |     └─ bbb.png
 |     └─ ccc.jpg
 |       ⋮
 |-- label_B
 |     └─ ddd.png
 |     └─ eee.jpg
 |     └─ fff.png
   ⋮       ⋮

## 1.2 Modeling
 After preparing retrain.py and image data, we will actually train and create a model.
 When creating a model using retrain.py, it is necessary to specify the data set, so specify it after `--image_dir`.

python retrain.py --image_dir dataset
 In addition, arguments can be specified, and the output destination of the model and the number of trainings can be specified [^ 3].
 [^ 3]: If `--tfhub_module https://tfhub.dev/google/imagenet/mobilenet_v2_100_224/feature_vector/1` is specified in the argument, it will be output as mobilenet. mobilenet is a relatively lightweight model created for the purpose of using the results of machine learning on mobile terminals.
 If you execute it without specifying the output destination, ** output_graph.pb ** and ** output_labels.txt ** will be output to ** / tmp **.

## 1.3 (Bonus) Make the model actually infer
 You can check the inference result of the model using [label_image.py](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/label_image/label_image.py).

# 2. Convert the created model to tflite format
 Convert the output ** output_graph.pb ** file to tflite (TensorFlow Lite) format.
## 2.1 Conversion for iOS
 Since iOS uses a quantized model, specify `QUANTIZED_UINT8` for` --inference_type` and `--inference_input_type`.

tflite_convert \
  --graph_def_file=/tmp/output_graph.pb \
  --output_file=./quant_graph.tflite \
  --input_format=TENSORFLOW_GRAPHDEF \
  --output_format=TFLITE \
  --input_shape=1,299,299,3 \
  --input_array=Placeholder \
  --output_array=final_result \
  --input_data_type=FLOAT \
  --default_ranges_min=0  \
  --default_ranges_max=6  \
  --inference_type=QUANTIZED_UINT8  \
  --inference_input_type=QUANTIZED_UINT8  \
  --mean_values=128 \
  --std_dev_values=128 \

## 2.2 Conversion for Android
 On my Android, the GPU didn't support the quantized model, so I'll use the model for FLOAT. Specify `FLOAT` for` --inference_type` and `--inference_input_type`. Also change `--output_file` to` float_graph.tflite`.

tflite_convert \
  --graph_def_file=/tmp/output_graph.pb \
  --output_file=./float_graph.tflite \
  --inference_type=FLOAT  \
  --inference_input_type=FLOAT  \
## 2.3 Problems when converting to tflite
 --The command to convert to tflite differs depending on the version of TensorFlow.
 --TensorFlow 1. The model created by X series could not be converted by the script of 2.X series.
 --I didn't know what to specify for `--input_array` or` --output_array`.

 Create the following script to know what to specify for `--input_array` and` --output_array`.

import tensorflow as tf
gf = tf.GraphDef()   
m_file = open('/tmp/output_graph.pb','rb')

with open('somefile.txt', 'a') as the_file:
    for n in gf.node:

file = open('somefile.txt','r')
data = file.readlines()
print ("Output name = ")
print (data[len(data)-1])

print ("Input name = ")
file.seek ( 0 )
print (file.readline())
 The execution result looks like this.

Output name = 

Input name = 
# 3. Try it on mobile
 Use the source code found in [tensorflow / examples](https://github.com/tensorflow/examples).

git clone https://github.com/tensorflow/examples.git

3.1 iOS
 1. Open the project according to [README.md](https://github.com/tensorflow/examples/tree/master/lite/examples/image_classification/ios)
 2. Select ImageClassification / ImageClassification / Model cmd ⌘ + click-> ʻAdd Files to" ImageClassification "...` to add ** quant_graph.tflite ** and ** output_labels.txt **
 3. Rewrite ImageClassification / ImageClassification / ModelDataHandler / ModelDataHandler.swift
 4. Change "mobilenet_quant_v1_224" on line 37 to ** "quant_graph" **
 5. Change "labels" on line 38 to ** "output_labels" **
 6. Change the value of inputWidth on line 58 to ** 299 **
 7. Change the value of inputHeight on line 59 to ** 299 **

#### **`.swift`**

  enum MobileNet {
  static let modelInfo: FileInfo = (name: "quant_graph", extension: "tflite")
  static let labelsInfo: FileInfo = (name: "output_labels", extension: "txt")


  // MARK: - Model Parameters
  let batchSize = 1
  let inputChannels = 3
  let inputWidth = 299
  let inputHeight = 299

3.2 Android
 1. Open `\ examples \ lite \ examples \ image_classification \ android` in Android Studio
 2. Place ** float_graph.tflite ** and ** output_labels.txt ** in `\ app \ src \ main \ assets`
 3. Rewrite app \ src \ main \ java \ org \ tensorflow \ lite \ examples \ classification \ tflite \ ClassifierFloatMobileNet.java
 4. Change line 55 from "mobilenet_v1_1.0_224.tflite" to ** "float_graph.tflite" **
 5. Changed line 60 from "labels.txt" to ** "output_labels.txt" **

  protected String getModelPath() {
    // you can download this file from
    // see build.gradle for where to obtain this file. It should be auto
    // downloaded into assets.
    return "float_graph.tflite";

  protected String getLabelPath() {
    return "output_labels.txt";


Recommended Posts

I made an image classification model and tried to move it on mobile
I tried to make an image classification BOT by combining TensorFlow Lite and LINE Messaging API
[I'm an IT beginner] I tried my best to implement Linux on Windows
I tried to use Twitter Scraper on AWS Lambda and it didn't work.
I tried to install scrapy on Anaconda and couldn't
I tried AutoGluon's Image Classification
Build an image classification model explosively with Azure Custom Vision and implement it with Flask
[Introduction to infectious disease model] I tried fitting and playing ♬
I made an anomaly detection model that works on iOS
I made a server with Python socket and ssl and tried to access it from a browser
I tried to move the ball
I tried to create an environment of MkDocs on Amazon Linux
Take an image with Pepper and display it on your tablet
[Machine learning] I tried to do something like passing an image
I tried to make an image similarity function with Python + OpenCV
I tried to summarize until I quit the bank and became an engineer
I tried moving the image to the specified folder by right-clicking and left-clicking
I installed DSX Desktop and tried it
I made an AI to judge whether it is alcohol or not!
I want to display an image on Jupyter Notebook using OpenCV (mac)
[Python] Create a linebot to write a name and age on an image
Matching karaoke keys ~ I tried to put it on Laravel ~ <on the way>
I tried to push the Sphinx document to BitBucket and it will be automatically reflected on the web server
[kotlin] Image classification on android (Pytorch Mobile)
I tried to install Docker on Windows 10 Home but it didn't work
I implemented the VGG16 model in Keras and tried to identify CIFAR10
I made an image discrimination (cifar10) model using a convolutional neural network.
I want to pass an argument to a python function and execute it from PHP on a web server
[Python] I made a script that automatically cuts and pastes files on a local PC to an external SSD.
I made a tool to notify Slack of Connpass events and made it Terraform
I want to write an element to a file with numpy and check it.
I tried running Flask on Raspberry Pi 3 Model B + using Nginx and uWSGI
I tried to extract a line art from an image with Deep Learning
I tried to process and transform the image and expand the data for machine learning
I tried to rescue the data of the laptop by booting it on Ubuntu
I want to convert horizontal text to vertical text and post it on Twitter etc.
I made an image for qemu with Yocto, but I failed and started over
I tried my best to make an optimization function, but it didn't work.
I tried to make a simple image recognition API with Fast API and Tensorflow
I tried to move GAN (mnist) with keras
I implemented DCGAN and tried to generate apples
I tried image processing like an event camera
I tried to implement TOPIC MODEL in Python
I tried to detect an object with M2Det!
[Introduction to PID] I tried to control and play ♬
I made a Docker Image that reads RSS and automatically tweets regularly and released it.
[Rails] v1.0 came out on google-cloud-vision of gem, so I tried to support it
sphinx-quickstart got messy and I tried to create an alternative command and the stress disappeared
A Python beginner made a chat bot, so I tried to summarize how to make it.
Image processing with Python (I tried binarizing it into a mosaic art of 0 and 1)
Memo A beginner tried to build a Java environment and Japaneseize it on Ubuntu 18.04.2 LTS.
I tried to make it easy to change the setting of authenticated Proxy on Jupyter
I want to convert an image to WebP with lollipop
I tried to move machine learning (ObjectDetection) with TouchDesigner
It is difficult to install a green screen, so I cut out only the face and superimposed it on the background image
I tried to move Faster R-CNN quickly with pytorch
I tried to read and save automatically with VOICEROID2 2
I tried to detect the iris from the camera image
I tried to implement and learn DCGAN with PyTorch
I tried adding post-increment to CPython. Overview and summary
I want to develop an Android application on Android (debugging)