Title: | Access to the 'Google Cloud Vision' API for Image Recognition, OCR and Labeling |
---|---|
Description: | Interact with the 'Google Cloud Vision' <https://cloud.google.com/vision/> API in R. Part of the 'cloudyr' <https://cloudyr.github.io/> project. |
Authors: | Jeno Pal [cre], Tamas Koncz [aut], Balazs Varkoly [aut], Peter Lukacs [aut], Eszter Kocsis [aut], Florian Teschner [ctb] |
Maintainer: | Jeno Pal <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.2.0.9000 |
Built: | 2025-03-08 02:29:54 UTC |
Source: | https://github.com/cloudyr/googlecloudvisionr |
sends the request defined in 'body' to the API
call_vision_api( body, apiEndpoint = "images:annotate", httpRequestType = "POST" )
call_vision_api( body, apiEndpoint = "images:annotate", httpRequestType = "POST" )
body |
output of create_request_body() |
apiEndpoint |
character, api endpoint |
httpRequestType |
character, type of the http request |
API response in raw format
creates a json output from the inputs
create_request_body(imagePaths, feature, maxNumResults)
create_request_body(imagePaths, feature, maxNumResults)
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
request body (payload), encoded as json
creates a list output from the inputs
create_single_image_request(imagePath, feature, maxNumResults)
create_single_image_request(imagePath, feature, maxNumResults)
imagePath |
character, file path, URL or Cloud Storage URI of the image |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
list of request details for one image
base64 encodes an image file
encode_image(imagePath)
encode_image(imagePath)
imagePath |
character, path to the image |
get the image back as encoded file
a utility to extract features from the API response
extract_annotations(responses, imagePaths, featureType)
extract_annotations(responses, imagePaths, featureType)
responses |
an API response object |
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
featureType |
the type of annotation as called in the response object |
a data.table
helper function code to extract error from API response into a data.table
extract_error(responses, imagePaths)
extract_error(responses, imagePaths)
responses |
an API response object |
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
a data.table
a utility to extract features from the API response
extract_response(responses, imagePaths, feature)
extract_response(responses, imagePaths, feature)
responses |
an API response object |
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
a data.table
a utility to provide functions to extract features from the API response
extractor(featureType)
extractor(featureType)
featureType |
the type of annotation as called in the response object |
a function
helper function code to extract API response into a data.table for given feature type
face_detection_extractor(response)
face_detection_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to record available feature types
gcv_get_available_feature_types()
gcv_get_available_feature_types()
a list of available features names and their types (as returned by the API)
gcv_get_available_feature_types()
gcv_get_available_feature_types()
Given a list of images, a feature type and the maximum number of responses, this functions calls the Google Cloud Vision API, and returns the image annotations in a data.table format.
gcv_get_image_annotations( imagePaths, feature = "LABEL_DETECTION", maxNumResults = NULL, batchSize = 64L, savePath = NULL )
gcv_get_image_annotations( imagePaths, feature = "LABEL_DETECTION", maxNumResults = NULL, batchSize = 64L, savePath = NULL )
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
batchSize |
integer, the chunk size for batch processing |
savePath |
character, if specified, results will be saved to this path (as .csv) |
a data frame with image annotation results
## Not run: # Label Detection (default), with maximum 7 results returned per image imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) gcv_get_image_annotations(imagePaths = imagePath, maxNumResults = 7) # Face detection imagePath <- system.file( "extdata", "arnold_wife.jpg", package = "googleCloudVisionR" ) gcv_get_image_annotations(imagePaths = imagePath, feature = "FACE_DETECTION") # Google Cloud Storage URI as input gcv_get_image_annotations("gs://vision-api-handwriting-ocr-bucket/handwriting_image.png") ## End(Not run)
## Not run: # Label Detection (default), with maximum 7 results returned per image imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) gcv_get_image_annotations(imagePaths = imagePath, maxNumResults = 7) # Face detection imagePath <- system.file( "extdata", "arnold_wife.jpg", package = "googleCloudVisionR" ) gcv_get_image_annotations(imagePaths = imagePath, feature = "FACE_DETECTION") # Google Cloud Storage URI as input gcv_get_image_annotations("gs://vision-api-handwriting-ocr-bucket/handwriting_image.png") ## End(Not run)
Given an image, and search parameters, this functions calls the Google Cloud Vision API, and returns an unformatted response (JSON) from the API. If you are looking for a friendlier result, refer to the 'gcv_get_product_search_response()' function
gcv_get_product_search_raw_response( imagePath, projectId, locationId, productSetId, productCategory, maxNumResults )
gcv_get_product_search_raw_response( imagePath, projectId, locationId, productSetId, productCategory, maxNumResults )
imagePath |
character, file path, URLs or Cloud Storage URI of the image |
projectId |
character, GCP project id |
locationId |
character, GCP location id |
productSetId |
character, product set id for Product Search |
productCategory |
character, roduct category for Product Search |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
a list with product search results
Given an image, and search parameters, this functions calls the Google Cloud Vision API, and returns formatted response from the API. Results are not complete! If you are looking for full results, refer to the 'gcv_get_product_search_raw_response()' function
gcv_get_product_search_response( imagePath, projectId, locationId, productSetId, productCategory, maxNumResults )
gcv_get_product_search_response( imagePath, projectId, locationId, productSetId, productCategory, maxNumResults )
imagePath |
character, file path, URLs or Cloud Storage URI of the image |
projectId |
character, GCP project id |
locationId |
character, GCP location id |
productSetId |
character, product set id for Product Search |
productCategory |
character, roduct category for Product Search |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
a data frame with product search results
Given a list of images, a feature type and the maximum number of responses, this functions calls the Google Cloud Vision API, and returns the raw response from the API. For a friendlier response, refer to the 'gcv_get_image_annotations' function, which returns results in a data.table format (however, the information returned is limited compared to the raw response).
gcv_get_raw_response( imagePaths, feature = "LABEL_DETECTION", maxNumResults = NULL )
gcv_get_raw_response( imagePaths, feature = "LABEL_DETECTION", maxNumResults = NULL )
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
a response object returned by the API. To get the image annotations, take the "content" element from the object
## Not run: imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) raw_response <- gcv_get_raw_response(imagePaths = imagePath, maxNumResults = 7) str(raw_response) raw_response ## End(Not run)
## Not run: imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) raw_response <- gcv_get_raw_response(imagePaths = imagePath, maxNumResults = 7) str(raw_response) raw_response ## End(Not run)
Given a list of images, a feature type and the maximum number of responses, this functions calls the Google Cloud Vision API, and returns the image annotations in a data.table format.
gcv_get_response(imagePaths, feature, maxNumResults)
gcv_get_response(imagePaths, feature, maxNumResults)
imagePaths |
character, file paths, URLs or Cloud Storage URIs of the images, can be a combination of all three |
feature |
character, one out of: "LABEL_DETECTION", "FACE_DETECTION", "TEXT_DETECTION", "DOCUMENT_TEXT_DETECTION", "LOGO_DETECTION", "LANDMARK_DETECTION", "IMAGE_PROPERTIES", "OBJECT_LOCALIZATION" |
maxNumResults |
integer, the maximum number of results (per image) to be returned. |
a data frame with image annotation results
## Not run: imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) gcv_get_response(imagePaths = imagePath, maxNumResults = 7) ## End(Not run)
## Not run: imagePath <- system.file( "extdata", "golden_retriever_puppies.jpg", package = "googleCloudVisionR" ) gcv_get_response(imagePaths = imagePath, maxNumResults = 7) ## End(Not run)
helper function code to extract Bounding Box x,y coordinates for an API response element
get_bounding_boxes(response)
get_bounding_boxes(response)
response |
an element of the API response object |
a data.table
helper function to validate input image paths
get_invalid_image_paths(vec)
get_invalid_image_paths(vec)
vec |
a vector of paths |
vector of invalid paths from @vec
helper function code to extract API response into a data.table for given feature type
image_properties_extractor(response)
image_properties_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to extract API response into a data.table for given feature type
label_detection_extractor(response)
label_detection_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to extract API response into a data.table for given feature type
landmark_detection_extractor(response)
landmark_detection_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to extract API response into a data.table for given feature type
logo_detection_extractor(response)
logo_detection_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to extract API response into a data.table for given feature type
object_localization_extractor(response)
object_localization_extractor(response)
response |
an element of the API response object |
a data.table
helper function code to extract API response into a data.table for given feature type
ocr_extractor(response)
ocr_extractor(response)
response |
an element of the API response object |
a data.table
helper function to split a vector to approximately equally sized chunks
split_to_chunks(vec, chunkSize)
split_to_chunks(vec, chunkSize)
vec |
a vector |
chunkSize |
integer, how long should the chunks be? |
a list of chunks