`langchain_community.document_loaders.google_speech_to_text`.GoogleSpeechToTextLoader¶

class langchain_community.document_loaders.google_speech_to_text.GoogleSpeechToTextLoader(project_id: str, file_path: str, location: str = 'us-central1', recognizer_id: str = '_', config: Optional[RecognitionConfig] = None, config_mask: Optional[FieldMask] = None)[源代码]¶

已过时，自版本 0.0.32 起使用 langchain_google_community.SpeechToTextLoader 代替。

Google Cloud Speech-to-Text音频转录的加载器。

它使用Google Cloud Speech-to-Text API将音频文件转录，并根据指定的格式将转录文本加载到一个或多个Document中。

要使用此功能，您应已安装google-cloud-speech Python包。

可以通过Google Cloud Storage URI或本地文件路径指定音频文件。

有关Google Cloud Speech-to-Text的详细信息，请参阅产品文档。 https://cloud.google.com/speech-to-text

初始化GoogleSpeechToTextLoader。

参数

project_id (str) – Google Cloud Project ID。
file_path (str) – 一个Google Cloud Storage URI或本地文件路径。
location (str) – Speech-to-Text识别器的位置。
recognizer_id (str) – Speech-to-Text识别器ID。
config (Optional[RecognitionConfig]) – 识别选项和功能。更多信息请参阅：https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognitionConfig
config_mask (Optional[FieldMask]) – 在此次识别请求中，配置中字段的列表，这些字段会覆盖识别器default_recognition_config中的值。更多信息请参阅：https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognizeRequest

方法

`__init__`(project_id, file_path[, location, ...])	初始化GoogleSpeechToTextLoader。
`alazy_load`()	Documents的懒加载器。
`aload`()	将数据加载到Document对象中。
`lazy_load`()	Documents的懒加载器。
`load`()	转录音频文件并将转录文本加载到文档中。
加载文档并将其分割成块。	`load_and_split`([text_splitter])

加载文档并分割成块。

初始化GoogleSpeechToTextLoader。

参数

project_id (str) – Google Cloud Project ID。
file_path (str) – 一个Google Cloud Storage URI或本地文件路径。
location (str) – Speech-to-Text识别器的位置。
recognizer_id (str) – Speech-to-Text识别器ID。
config (Optional[RecognitionConfig]) – 识别选项和功能。更多信息请参阅：https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognitionConfig
config_mask (Optional[FieldMask]) – 在此次识别请求中，配置中字段的列表，这些字段会覆盖识别器default_recognition_config中的值。更多信息请参阅：https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognizeRequest

__init__(

Documents的懒加载器。

异步加载: async alazy_load() → AsyncIterator[Document]

返回类型

将数据加载到Document对象中。

异步加载: AsyncIterator[Document]

async aload() → List[Document]

Documents的懒加载器。

异步加载: List[Document]

load() → List[Document][source]¶

转录音频文件并将转录文本加载到文档中。

它使用Google Cloud语音识别API来转换音频文件，并阻塞直到转换完成后。

异步加载: AsyncIterator[Document]

load_and_split(text_splitter: Optional[TextSplitter] = None) → List[Document]¶

加载文档并将其分割成块。块作为文档返回。

请勿重写此方法。应考虑将其弃用！

参数: text_splitter (可选[TextSplitter]) – 用于分割文档的TextSplitter实例。默认为RecursiveCharacterTextSplitter。
返回: 文档列表。
异步加载: 列表[Document]

langchain_community.document_loaders.google_speech_to_text.GoogleSpeechToTextLoader¶

`langchain_community.document_loaders.google_speech_to_text`.GoogleSpeechToTextLoader¶