OCR model for recognizing characters in images

Hierarchy

  • default

Constructors

  • Creates a new OCR model

    Parameters

    • modelPath: null | string = null

      Optional custom path to the ONNX model file

    Returns default

Properties

chars: string[]

Character set for recognition

modelPath: string

Path to the ONNX model file

session: InferenceSession

ONNX inference session

Methods

  • Loads the ONNX model for inference

    Returns Promise<void>

    A promise that resolves when the model is loaded

  • Predicts text from an image

    Parameters

    • imageBuffer: Buffer

      The image buffer to process

    Returns Promise<string>

    A promise that resolves to the recognized text

  • Private

    Downloads the ONNX model from GitHub

    Returns Promise<void>

    A promise that resolves when the model is downloaded

    Throws

    If the download fails

  • Private

    Reshapes a flat tensor into a 3D array

    Parameters

    • data: Float32Array

      The flat tensor data

    • shape: number[]

      The target shape [batchSize, seqLength, numClasses]

    Returns number[][][]

    Reshaped 3D array

  • Private

    Computes argmax along a specified axis

    Parameters

    • tensor: number[][][]

      The input tensor

    • axis: number

      The axis along which to compute argmax (only 2 is supported)

    Returns number[][]

    The indices of maximum values

Generated using TypeDoc