教程：使用 Node.js 体验语音转文本 API

由 Mux 主办的 DEV 全球展示挑战赛：展示你的项目！

使用 Deepgram 的 API 将音频文件或音频流转换为文本。

创建这个博客的目的是记录详细的操作过程以及我学习 Node.js 的笔记。
如果你也感兴趣并想亲自动手实践，那就按照下面的步骤来，尽情享受吧！

先决条件

已安装Node.js
具备命令行界面（CLI/终端）
选择你最喜欢的代码开发环境（例如VSCode）。
已创建Deepgram帐户。

入门

我们首先应该导航到我们想要的目录，然后使用以下命令创建一个文件夹（例如，命名为 sttApp）：

mkdir sttApp

然后使用你常用的IDE打开文件夹。我用的是VS Code。现在可以看到目录是空的，没有任何文件。

下一步，让我们使用终端，导航到当前目录/sttApp：

cd sttApp

运行以下代码初始化新应用程序：

npm init

多次按下回车键以保留这些参数的默认配置，然后您的 CLI 应该会得到类似这样的结果：

接下来，我们使用以下命令安装 Deepgram Node.js SDK：

npm install @deepgram/sdk

如果前面的步骤都正确，那么到目前为止，你的代码IDE中应该会有一个类似如下的目录：

现在，在你的代码 IDE 的当前目录（/sttAPP ）中创建一个名为index.js 的文件，并将以下代码复制粘贴到index.js 文件中，然后保存文件：

const { Deepgram } = require('@deepgram/sdk');
const fs = require('fs');

// The API key you created in step 1
const deepgramApiKey = 'YOUR_API_KEY';

// Replace with your file path and audio mimetype
const pathToFile = 'SOME_FILE.wav';
const mimetype = 'audio/wav';

// Initializes the Deepgram SDK
const deepgram = new Deepgram(deepgramApiKey);

console.log('Requesting transcript...')
console.log('Your file may take up to a couple minutes to process.')
console.log('While you wait, did you know that Deepgram accepts over 40 audio file formats? Even MP4s.')
console.log('To learn more about customizing your transcripts check out developers.deepgram.com.')

deepgram.transcription.preRecorded(
  { buffer: fs.readFileSync(pathToFile), mimetype },
  { punctuate: true, language: 'en-US' },
)
.then((transcription) => {
  console.dir(transcription, {depth: null});
})
.catch((err) => {
  console.log(err);
});