本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。
了解清单摘要
清单摘要包含以下信息。
有关在验证期间发生的终止性清单内容错误列表的错误信息。
训练和测试数据集中的非终止性 JSON 行验证错误列表的错误位置信息。
错误统计信息,例如在训练和测试数据集中发现的无效 JSON 行总数。
如果没有终止性清单文件错误列表,则会在训练期间创建清单摘要。要获取清单摘要文件 (manifest_summary.json) 的位置,请参阅获取验证结果。
有关特定清单内容错误的信息,请参阅终止性清单内容错误。
清单摘要文件格式
清单文件包含 2 个部分:statistics
和 errors
。
statistics
statistics
包含有关训练和测试数据集中错误的信息。
training
:训练数据集中的统计信息和发现的错误。testing
:测试数据集中的统计信息和发现的错误。
errors
数组中的对象包含清单内容错误的错误代码和错误消息。
error_line_indices
数组包含训练或测试清单中存在错误的每个 JSON 行的行号。有关更多信息,请参阅 修复训练错误。
错误
跨越训练和测试数据集的错误。例如,当没有足够的可用标签与训练和测试数据集重叠时,就会发生 ERROR_INSUFFICIENT_USABLE_LABEL_OVERLAP。
{ "statistics": { "training": { "use_case": String, # Possible values are IMAGE_LEVEL_LABELS, OBJECT_LOCALIZATION and NOT_DETERMINED "total_json_lines": Number, # Total number json lines (images) in the training manifest. "valid_json_lines": Number, # Total number of JSON Lines (images) that can be used for training. "invalid_json_lines": Number, # Total number of invalid JSON Lines. They are not used for training. "ignored_json_lines": Number, # JSON Lines that have a valid schema but have no annotations. The aren't used for training and aren't counted as invalid. "error_json_line_indices": List[int], # Contains a list of line numbers for JSON line errors in the training dataset. "errors": [ { "code": String, # Error code for a training manifest content error. "message": String # Description for a training manifest content error. } ] }, "testing": { "use_case": String, # Possible values are IMAGE_LEVEL_LABELS, OBJECT_LOCALIZATION and NOT_DETERMINED "total_json_lines": Number, # Total number json lines (images) in the manifest. "valid_json_lines": Number, # Total number of JSON Lines (images) that can be used for testing. "invalid_json_lines": Number, # Total number of invalid JSON Lines. They are not used for testing. "ignored_json_lines": Number, # JSON Lines that have a valid schema but have no annotations. They aren't used for testing and aren't counted as invalid. "error_json_line_indices": List[int], # contains a list of error record line numbers in testing dataset. "errors": [ { "code": String, # # Error code for a testing manifest content error. "message": String # Description for a testing manifest content error. } ] } }, "errors": [ { "code": String, # # Error code for errors that span the training and testing datasets. "message": String # Description of the error. } ] }
示例清单摘要
以下示例是显示终止性清单内容错误 (ERROR_TOO_MANY_INVALID_ROWS_IN_MANIFEST) 的一部分清单摘要。error_json_line_indices
数组包含相应训练或测试验证清单中非终止性 JSON 行错误的行号。
{ "errors": [], "statistics": { "training": { "use_case": "NOT_DETERMINED", "total_json_lines": 301, "valid_json_lines": 146, "invalid_json_lines": 155, "ignored_json_lines": 0,
"errors": [ { "code": "ERROR_TOO_MANY_INVALID_ROWS_IN_MANIFEST", "message": "The manifest file contains too many invalid rows." } ],
"error_json_line_indices": [
15, 16, 17, 22, 23, 24, . . . . 300 ] }, "testing": { "use_case": "NOT_DETERMINED", "total_json_lines": 15, "valid_json_lines": 13, "invalid_json_lines": 2, "ignored_json_lines": 0, "errors": [],"error_json_line_indices": [
13, 15 ] } } }