Anaconda 環境で Google Cloud Vision APIを使ってみる-スケ郎のお話

Anacondaで Pythonの仮想環境を作って、Google Cloud Vision APIを試してみます。

前提

・Cloud Vision API の有効化がされていること
・サービスアカウントのキー（JSON）は、環境変数へ設定済みであること

まだって人は、次のサイトで詳しく説明されています。
https://dev.classmethod.jp/articles/google-cloud_vision-api/

Anaconda仮想環境の作成

今回は「myenv」という名前で新しく仮想環境を作ります。Pythonのバージョンは「3.9」としましょう。

conda create -n myenv python=3.9

上で作成した仮想環境に切り替えます。

conda activate myenv

ライブラリをインストール

Cloud Visionの、Pythonクライアントをインストールします。

conda install -c conda-forge google-cloud-vision

Google Cloud Vision APIを実行してみる

次のコードを書いて、Google Cloud Vision APIを呼び出してみます。

import io
import os

from google.cloud import vision

# Vision APIで文字認識する画像を読みこみ
file_name = os.path.abspath('path/to/image')
with io.open(file_name, 'rb') as image_file:
    content = image_file.read()
image = vision.Image(content=content)

# APIクライアントの設定
client = vision.ImageAnnotatorClient()

# 文字認識（OCR）の実行
response = client.text_detection(image=image)
texts = response.text_annotations

# 認識結果を表示
for text in texts:
    # 認識した文字列
    print('\n"{}"'.format(text.description))
    
    # 認識した文字のXY座標
    vertices = (['({},{})'.format(vertex.x, vertex.y)
                for vertex in text.bounding_poly.vertices])
    print('bounds: {}'.format(','.join(vertices)))

# エラー処理
if response.error.message:
    raise Exception(
        '{}\nFor more info on error messages, check: '
        'https://cloud.google.com/apis/design/errors'.format(
            response.error.message))

今回は、こんな画像を文字認識させてみました。

enter image description here

そして結果は次のとおり。

"火気
嚴禁"
bounds: (120,160),(405,160),(405,454),(120,454)

"火"
bounds: (121,192),(248,176),(262,290),(135,306)

"気"
bounds: (237,177),(376,160),(390,274),(251,291)

"嚴禁"
bounds: (137,327),(388,312),(395,430),(144,446)

完全に認識してますね！！

Anaconda 環境で Google Cloud Vision APIを使ってみる

前提

Anaconda仮想環境の作成

ライブラリをインストール

Google Cloud Vision APIを実行してみる

0 件のコメント:

コメントを投稿

人気の投稿

このブログを検索

Profile

ラベル