コンテンツにスキップ

Module Output

各モジュールの出力について説明します。

Document Analyzer

Document Analyzer モジュールは以下の変数を tuple で出力します。

変数名 説明
results DocumentAnalyzerSchema モジュールの出力結果
ocr_vis np.ndarray | None AI-OCR の出力可視化画像(visualize=True の時のみ)
layout_vis np.ndarray | None Layout Analyzer の出力可視化画像(visualize=True の時のみ)

results 変数の準拠するスキーマ DocumentAnalyzerSchema の仕様は以下の通りです。

DocumentAnalyzerSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
preprocess Preprocessing information of the document
RequiredanyOf
#
Preprocessing information of the document
Type
anyOf
Nested fields
Any of 1
No Additional Propsobject
#
Any of 2
null
#
paragraphs List of detected paragraphs
Requiredarray
#
List of detected paragraphs
Type
array
Nested fields
ParagraphSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
contents Text content of the paragraph
RequiredanyOf
#
Text content of the paragraph
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
direction Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
Text direction, e.g., ['horizontal' or 'vertical']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
order Order of the paragraph in the document
RequiredanyOf
#
Order of the paragraph in the document
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
role Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index',])
RequiredanyOf
#
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index',])
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
indent_level Indentation level of the list_item
anyOf
#
Indentation level of the list_item
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
integer
#
Any of 2
null
#
tables List of detected tables
Requiredarray
#
List of detected tables
Type
array
Nested fields
TableStructureRecognizerSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
n_row Number of rows in the table
Requiredinteger
#
Number of rows in the table
Type
integer
n_col Number of columns in the table
Requiredinteger
#
Number of columns in the table
Type
integer
rows List of table lines representing rows
Requiredarray
#
List of table lines representing rows
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
cols List of table lines representing columns
Requiredarray
#
List of table lines representing columns
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
spans List of table lines representing spans
Requiredarray
#
List of table lines representing spans
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
cells List of table cells
Requiredarray
#
List of table cells
Type
array
Nested fields
TableCellSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
col Column index of the cell
Requiredinteger
#
Column index of the cell
Type
integer
row Row index of the cell
Requiredinteger
#
Row index of the cell
Type
integer
col_span Number of columns spanned by the cell
Requiredinteger
#
Number of columns spanned by the cell
Type
integer
row_span Number of rows spanned by the cell
Requiredinteger
#
Number of rows spanned by the cell
Type
integer
box Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the cell in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
contents Text content of the cell
RequiredanyOf
#
Text content of the cell
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
order Order of the table in the document
Requiredinteger
#
Order of the table in the document
Type
integer
caption Caption of the table
anyOf
#
Caption of the table
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
No Additional Propsobject
#
Any of 2
null
#
words List of recognized words
Requiredarray
#
List of recognized words
Type
array
Nested fields
WordPrediction
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
points Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
#
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
ArrayItem
array
#
Type
array
Constraints
  • Minimum items: 2
  • Maximum items: 2
Nested fields
Item
integer
#
content Text content of the word
Requiredstring
#
Text content of the word
Type
string
direction Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
#
Text direction, e.g., 'horizontal' or 'vertical'
Type
string
rec_score Confidence score of the word recognition
Requirednumber
#
Confidence score of the word recognition
Type
number
det_score Confidence score of the word detection
Requirednumber
#
Confidence score of the word detection
Type
number
figures List of detected figures
Requiredarray
#
List of detected figures
Type
array
Nested fields
FigureSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the figure in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the figure in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
order Order of the figure in the document
RequiredanyOf
#
Order of the figure in the document
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
paragraphs List of paragraphs associated with the figure
Requiredarray
#
List of paragraphs associated with the figure
Type
array
Nested fields
ParagraphSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the paragraph in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the paragraph in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
contents Text content of the paragraph
RequiredanyOf
#
Text content of the paragraph
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
direction Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
Text direction, e.g., ['horizontal' or 'vertical']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
order Order of the paragraph in the document
RequiredanyOf
#
Order of the paragraph in the document
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
role Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index',])
RequiredanyOf
#
Role of the paragraph, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index',])
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
indent_level Indentation level of the list_item
anyOf
#
Indentation level of the list_item
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
integer
#
Any of 2
null
#
role Role of the figure, e.g., ['picture', 'logo', 'code', 'seal']
RequiredanyOf
#
Role of the figure, e.g., ['picture', 'logo', 'code', 'seal']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
direction Text direction, e.g., ['horizontal' or 'vertical']
RequiredanyOf
#
Text direction, e.g., ['horizontal' or 'vertical']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
caption Caption of the figure
anyOf
#
Caption of the figure
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
No Additional Propsobject
#
Any of 2
null
#
decode Decoded contents of the code, if applicable
anyOf
#
Decoded contents of the code, if applicable
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
string
#
Any of 2
null
#

AI-OCR

AI-OCR モジュールは以下の変数を tuple で出力します。

変数名 説明
results OCRSchema モジュールの出力結果
ocr_vis np.ndarray | None AI-OCR の出力可視化画像(visualize=Trueの時のみ)

results 変数の準拠するスキーマ OCRSchema の仕様は以下の通りです。

OCRSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
words List of recognized words with their bounding boxes, content, direction, and scores
Requiredarray
#
List of recognized words with their bounding boxes, content, direction, and scores
Type
array
Nested fields
WordPrediction
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
points Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
#
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
ArrayItem
array
#
Type
array
Constraints
  • Minimum items: 2
  • Maximum items: 2
Nested fields
Item
integer
#
content Text content of the word
Requiredstring
#
Text content of the word
Type
string
direction Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
#
Text direction, e.g., 'horizontal' or 'vertical'
Type
string
rec_score Confidence score of the word recognition
Requirednumber
#
Confidence score of the word recognition
Type
number
det_score Confidence score of the word detection
Requirednumber
#
Confidence score of the word detection
Type
number

Layout Analyzer

Layout Analyzer モジュールは以下の変数を tuple で出力します。

変数名 説明
results LayoutAnalyzerSchema モジュールの出力結果
layout_vis np.ndarray | None Layout Analyzer の出力可視化画像(visualize=Trueの時のみ)

results 変数の準拠するスキーマ LayoutAnalyzerSchema の仕様は以下の通りです。

LayoutAnalyzerSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
paragraphs List of detected paragraphs
Requiredarray
#
List of detected paragraphs
Type
array
Nested fields
Element
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the layout element
RequiredanyOf
#
Unique identifier of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
box Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the layout element in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the layout element detection
Requirednumber
#
Confidence score of the layout element detection
Type
number
role Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
#
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
contents Text content of the layout element
RequiredanyOf
#
Text content of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
tables List of detected tables
Requiredarray
#
List of detected tables
Type
array
Nested fields
TableStructureRecognizerSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
n_row Number of rows in the table
Requiredinteger
#
Number of rows in the table
Type
integer
n_col Number of columns in the table
Requiredinteger
#
Number of columns in the table
Type
integer
rows List of table lines representing rows
Requiredarray
#
List of table lines representing rows
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
cols List of table lines representing columns
Requiredarray
#
List of table lines representing columns
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
spans List of table lines representing spans
Requiredarray
#
List of table lines representing spans
Type
array
Nested fields
TableLineSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
box Bounding box of the table line in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table line in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the table line detection
Requirednumber
#
Confidence score of the table line detection
Type
number
cells List of table cells
Requiredarray
#
List of table cells
Type
array
Nested fields
TableCellSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
col Column index of the cell
Requiredinteger
#
Column index of the cell
Type
integer
row Row index of the cell
Requiredinteger
#
Row index of the cell
Type
integer
col_span Number of columns spanned by the cell
Requiredinteger
#
Number of columns spanned by the cell
Type
integer
row_span Number of rows spanned by the cell
Requiredinteger
#
Number of rows spanned by the cell
Type
integer
box Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the cell in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
contents Text content of the cell
RequiredanyOf
#
Text content of the cell
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
order Order of the table in the document
Requiredinteger
#
Order of the table in the document
Type
integer
caption Caption of the table
anyOf
#
Caption of the table
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
No Additional Propsobject
#
Any of 2
null
#
figures List of detected figures
Requiredarray
#
List of detected figures
Type
array
Nested fields
Element
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the layout element
RequiredanyOf
#
Unique identifier of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
box Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the layout element in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the layout element detection
Requirednumber
#
Confidence score of the layout element detection
Type
number
role Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
#
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
contents Text content of the layout element
RequiredanyOf
#
Text content of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#

Table Semantic Parser

Table Semantic Parser モジュールは以下の変数を tuple で出力します。

変数名 説明
results TableSemanticParserSchema モジュールの出力結果
vis_layout np.ndarray | None TableSemanticParser の出力可視化画像(visualize=True の時のみ)
vis_ocr np.ndarray | None AI-OCR の出力可視化画像(visualize=True の時のみ)

results 変数の準拠するスキーマ TableSemanticParserSchema の仕様は以下の通りです。

TableSemanticParserSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
tables List of tables with semantic information
Requiredarray
#
List of tables with semantic information
Type
array
Nested fields
TableSemanticContentsSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the table
anyOf
#
Unique identifier of the table
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
string
#
Any of 2
null
#
style Border style of the table, e.g., ['border', 'borderless']
Requiredstring
#
Border style of the table, e.g., ['border', 'borderless']
Type
string
box Bounding box [x1, y1, x2, y2]
Requiredarray
#
Bounding box [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
cells Cells keyed by cell_id
Requiredobject
#
Cells keyed by cell_id
Type
object
Constraints
  • Additional properties must match the nested schema
Nested fields
Additional property
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
meta Additional metadata for template/semantics
object
#
Additional metadata for template/semantics
Type
object
contents Text content of the cell
RequiredanyOf
#
Text content of the cell
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
role Role of the cell, e.g., ['cell', 'header', 'empty', 'group']
RequiredanyOf
#
Role of the cell, e.g., ['cell', 'header', 'empty', 'group']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
id Unique identifier of the cell
RequiredanyOf
#
Unique identifier of the cell
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
box Bounding box of the cell in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the cell in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
row Row index of the cell in the table
RequiredanyOf
#
Row index of the cell in the table
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
col Column index of the cell in the table
RequiredanyOf
#
Column index of the cell in the table
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
row_span Number of rows spanned by the cell
RequiredanyOf
#
Number of rows spanned by the cell
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
col_span Number of columns spanned by the cell
RequiredanyOf
#
Number of columns spanned by the cell
Type
anyOf
Nested fields
Any of 1
integer
#
Any of 2
null
#
kv_items Key-value items extracted from the table
Requiredarray
#
Key-value items extracted from the table
Type
array
Nested fields
KvItemSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the key-value item
RequiredanyOf
#
Unique identifier of the key-value item
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
key Key cell id(s)
RequiredanyOf
#
Key cell id(s)
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
array
#
value Value cell id
Requiredstring
#
box Bounding box of the key-value item in the format [x1, y1, x2, y2]
anyOf
#
Bounding box of the key-value item in the format [x1, y1, x2, y2]
Type
anyOf
Constraints
  • Default: None
Nested fields
Any of 1
array
#
Any of 2
null
#
grids Grid representation of the table
Requiredarray
#
Grid representation of the table
Type
array
Nested fields
TableGridSchema
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the table grid
RequiredanyOf
#
Unique identifier of the table grid
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
box Bounding box of the table grid in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the table grid in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
n_row Number of rows in the table grid
Requiredinteger
#
Number of rows in the table grid
Type
integer
n_col Number of columns in the table grid
Requiredinteger
#
Number of columns in the table grid
Type
integer
col_headers 2D array representing the column header cell ids
Requiredarray
#
2D array representing the column header cell ids
Type
array
Nested fields
ArrayItem
array
#
Type
array
Nested fields
Item
string
#
data 2D array representing the table grid data with cell ids
Requiredarray
#
2D array representing the table grid data with cell ids
Type
array
Nested fields
ArrayItem
array
#
Type
array
Nested fields
Item
anyOf
#
paragraphs List of recognized paragraphs in the document
Requiredarray
#
List of recognized paragraphs in the document
Type
array
Nested fields
Element
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
id Unique identifier of the layout element
RequiredanyOf
#
Unique identifier of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
box Bounding box of the layout element in the format [x1, y1, x2, y2]
Requiredarray
#
Bounding box of the layout element in the format [x1, y1, x2, y2]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
Item
integer
#
score Confidence score of the layout element detection
Requirednumber
#
Confidence score of the layout element detection
Type
number
role Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
RequiredanyOf
#
Role of the element, e.g., ['section_headings', 'page_header', 'page_footer', 'list_item', 'caption', 'inline_formula', 'display_formula', 'index']
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
contents Text content of the layout element
RequiredanyOf
#
Text content of the layout element
Type
anyOf
Nested fields
Any of 1
string
#
Any of 2
null
#
words List of recognized words in the document
Requiredarray
#
List of recognized words in the document
Type
array
Nested fields
WordPrediction
No Additional Propsobject
#
Type
object
Constraints
  • Additional properties are not allowed
Nested fields
points Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Requiredarray
#
Bounding box of the word in the format [[x1, y1], [x2, y2], [x3, y3], [x4, y4]]
Type
array
Constraints
  • Minimum items: 4
  • Maximum items: 4
Nested fields
ArrayItem
array
#
Type
array
Constraints
  • Minimum items: 2
  • Maximum items: 2
Nested fields
Item
integer
#
content Text content of the word
Requiredstring
#
Text content of the word
Type
string
direction Text direction, e.g., 'horizontal' or 'vertical'
Requiredstring
#
Text direction, e.g., 'horizontal' or 'vertical'
Type
string
rec_score Confidence score of the word recognition
Requirednumber
#
Confidence score of the word recognition
Type
number
det_score Confidence score of the word detection
Requirednumber
#
Confidence score of the word detection
Type
number

Auto-generated from JSON Schema files.