使用Python PIL实现隐写术算法

3

我正在尝试使用Python PIL编写基本的黑白图像隐写算法。

使用示例图像,我可以成功地提取其中隐藏的图像,并隐藏其他图像以随后提取它们。问题在于隐藏文本,然后提取它。

代码如下:

from PIL import Image
import matplotlib.pyplot as plt
import scipy.misc as sci
import numpy as np
import array


#CONVERTS IMAGE TO ARRAY OF BINARY 8-BIT NUMBER#
def getImgArray(image):
    w,h = image.size
    out = []
    for x in range(w):
        for y in range(h):
            pixel = image.getpixel((y,x))
            pixel = format(pixel, '08b')
            out.append(pixel)
    return out

def stringByteConverter(data, mode):
    if (mode == "stringToByte"):
        aux = map(ord,data.encode('utf8'))
        aux = [format(char,'08b') for char in aux]
        return aux
    elif (mode == "byteToString"):
        aux = [int(item,2) for item in data]
        aux = "".join(map(chr, aux))
        return aux
    else:
        print("Invalid mode. Use 'stringToByte' or 'byteToString'")

#GETS HIDDEN IMAGE AND RETURNS IT AS BYTE ARRAY REPRESENTING PIXELS#
def getHiddenImage(image):
    buf = ""
    width,height = image.size
    img_aux = []
    for x in range(width):
        for y in range(height):
            if(len(buf)<8):
                pixel = image.getpixel((y,x))
                pixel = format(pixel,'08b')
                buf += pixel[-2:]
            else:
                img_aux.append(buf)
                buf = ""
                pixel = image.getpixel((y,x))
                pixel = format(pixel,'08b')
                buf += pixel[-2:]
    return img_aux

#CONVERT ARRAY OF BYTES TO PNG IMG AND RETURNS PIL IMG OBJECT#
def saveImgArr(ImgArr, size, outputName):
    pixels = np.empty(size)
    iterator = 0
    for i in range(size[0]):
        for j in range(size[1]):
            try:
                pixels[i][j] = int(ImgArr[iterator],2)
                iterator += 1
            except IndexError:
                break

    aux = Image.fromarray(pixels)
    aux = aux.convert("L")
    aux.save(outputName+'.png', 'PNG')
    return pixels

#HIDE IMAGE <src> IN OTHER IMAGE <img>#
def hideImg(src, img, output):
    iterator = 0
    src = src.convert("L")
    srcArr = getImgArray(src)
    imgArr = getImgArray(img)


    for i in range(len(srcArr)):
        buf = []
        buf.append(srcArr[i][:2])
        buf.append(srcArr[i][2:4])
        buf.append(srcArr[i][4:6])
        buf.append(srcArr[i][6:])
        for j in range(4):
            imgArr[iterator] = imgArr[iterator][:-2] + buf[j]
            iterator += 1
    saveImgArr(imgArr,img.size,output)


#HIDE STRING INSIDE IMG#
def hideText(img, string, outputName):
    imgArr = getImgArray(img)
    stringBytes = stringByteConverter(string, "stringToByte")
    iterator = 0
    for i in range(len(string)):
        buf = []
        buf.append(stringBytes[i][:2])
        buf.append(stringBytes[i][2:4])
        buf.append(stringBytes[i][4:6])
        buf.append(stringBytes[i][6:])
        for j in range(4):
            imgArr[iterator] = imgArr[iterator][:-2] + '00'
            imgArr[iterator] = imgArr[iterator][:-2] + buf[j]
            iterator += 1

    print(imgArr[:len(string)*4]) #test print

    saveImgArr(imgArr,img.size,outputName) 

    temp = Image.open(outputName+'.png')
    tempArr = getImgArray(temp)

    print(tempArr[:len(string)*4]) #test print

def getHiddenText(img, msgSize):
    buf = ''
    width,height = img.size
    output = []
    counter = 0
    for x in range(width):
        for y in range(height):
                if(counter < msgSize*4):
                    pixel = img.getpixel((y,x))
                    pixel = format(pixel,'08b')
                    buf += pixel[-2:]
                    counter += 1

    output = stringByteConverter(buf, "byteToString")
    return output

通过在hideText()函数中打印数据数组,我得到了以下结果:
hideText(lena,'test',"lena_hidden_text")

['10100001','10100011','10100001','10100000','10100001', '10011110','10100001','10100001','10100101','10100011', '10100000','10011111','10011001','10100011','10011101', '10011000']

['10011110','10100000','10011110','10011101','10011110', '10011011','10011110','10011110','10100011','10100000', '10011101','10011100','10010101','10100000','10011001', '10010100']

hideText()调用得到的第一个向量完好无损,但是在使用saveImgArr()保存图像并使用getImgArr()重新加载后,返回了第二个向量,它是完全不同的。

我实在找不出问题所在。奇怪的是,使用图像提取隐藏数据或隐藏数据,这两个函数都能正常工作。

我只能猜测我以某种方式错误地处理了文本字节。任何见解都将不胜感激。

1个回答

2

有一个看起来可疑的地方在saveImgArr函数中:

aux = Image.fromarray(pixels)
aux = aux.convert("L")

默认情况下,Image.fromarray 使用的模式是从输入数据类型推导出来的。
在您的情况中,输入的数据类型为numpy的默认数据类型(浮点数),因此将基于浮点数构建图像。因此,我预计保存的png图像看起来不正确(每个像素都会饱和到1.0,只显示一个空白图像)。
要更正这个问题,您可以通过提供正确的数据类型给numpy来更改,即改变:
 pixels = np.empty(size)

to

 pixels = np.empty(size,dtype='uint8')

或者通过改变Image.fromarray中的模式来明确提供模式:

aux = Image.fromarray(pixels)
aux = aux.convert("L")

to

aux = Image.fromarray(pixels,mode='L')

那就是它了。现在的输出完美无缺:['10100001','10100011','10100001','10100000','10100001','10011110','10100001','10100001','10100101','10100011','10100000','10011111','10011001','10100011','10011101','10011000'] - Gabriel Lefundes

网页内容由stack overflow 提供, 点击上面的
可以查看英文原文,
原文链接