tensorflow的图片数据集创建和读取

J_sun

浏览: 20572 次
性别:
来自: 浙江

最近访客更多访客>>

my9797363

fulinweiyang

xxf5131421

george645

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

tensorflow

tensorflow

TensorFlow是Google开源的深度学习框架。可以通过大量打过标签的数据的feed，来生成对同类事物的识别做用。当然数据量少的话就只能当一个分类器了。

TensorFlow 也是一个熊弟介绍给我的，当时说好了一起搞这东西，但是最后只剩我一个人在摸索了(有老婆的人，嗯～还是坑啊)

# -*- coding: utf-8 -*-
import os
import tensorflow as tf
from PIL import Image
import numpy as np

class_path="/home/images/"
writer = tf.python_io.TFRecordWriter("anm_pic_train.tfrecords")

count = 0
dic={}

def sort(key):
    global count
    if key in dic:
        return dic[key]
    else:
        dic[key]=count
        count = count +1
        return count-1

def text2vec(text):
    result = np.zeros(3, dtype=np.int)
    if text < 10 :
        result[0]=1
    if (text >= 10) and (text < 30):
        result[1]=1
    if text >= 30:
        result[2]=1
    return result

for file in os.listdir(class_path):
    if file.endswith(".jpg"):
        file_path = class_path + file
#       print(file_path)
        img = Image.open(file_path)
        # keng
        if img.mode == 'RGB':
            #处理图片的大小
            img = img.resize((240, 320))
            img_raw = img.tobytes()
            example = tf.train.Example(features=tf.train.Features(feature={
                #图片对应单个结果
                "label": tf.train.Feature(int64_list=tf.train.Int64List(value=[sort(file[:file.rindex("_")])])),
                # 图片对应多个结果
                #"label": tf.train.Feature(int64_list=tf.train.Int64List(value=text2vec(xx))),
                'img_raw': tf.train.Feature(bytes_list=tf.train.BytesList(value=[img_raw]))
            }))
            writer.write(example.SerializeToString())
writer.close()
print('dic:',dic,'  length:', len(dic))


# 读取tf
def read_and_decode(filename):
    filename_queue = tf.train.string_input_producer([filename])
    reader = tf.TFRecordReader()
    _, serialized_example = reader.read(filename_queue)
    features = tf.parse_single_example(serialized_example,
       features={
           # 单结果的 label 返回是int
           'label': tf.FixedLenFeature([], tf.int64),
           # 数组返回， [3] 输入的数组的长度一样
           # 'label': tf.FixedLenFeature([3], tf.int64),
           'img_raw' : tf.FixedLenFeature([], tf.string),
       })
    img = tf.decode_raw(features['img_raw'], tf.uint8)
    img = tf.reshape(img, [240, 320, 3])
    # normalize
    img = tf.cast(img, tf.float32) * (1. / 255) - 0.5
    label = features['label']
    return img, label

img, label = read_and_decode("anm_pic_train.tfrecords")
img_batch, label_batch = tf.train.shuffle_batch([img, label],batch_size=10, capacity=2000, min_after_dequeue=1000)
init = tf.global_variables_initializer()

with tf.Session() as sess:

    sess.run(init)
    coord = tf.train.Coordinator()
    threads = tf.train.start_queue_runners(coord=coord, sess=sess)
    for i in range(10):
        val, l = sess.run([img_batch, label_batch])
        print(l)
    print ("complete ...")
    coord.request_stop()
    coord.join(threads)
    sess.close()

以上是 tensorflow 本地图片做数据集和队列读取的代码。其中RGB图片是深坑之一(网上很多文章代码都一样，但是偏偏我用了就报错)，当下载下来大量图片之后，有些图片的位深是不一样的，有24，32等等，其中24的rgb。

在 tf.reshape的时候是指定了3通道的图片即24位深的图片，如果是32位深的图片就会在这里报错，但是坑爹的是在做数据集的时候并不会报错。