SDG failed with error "failed to generate data with exception: list index out of range" in RHEL AI
Issue
- SDG failed with error "failed to generate data with exception: list index out of range" in RHEL AI, Refer following errror.
WARNING 2025-03-11 12:03:26,640 root:177: Provided markdown file /var/home/instruct/.local/share/instructlab/datasets/2025-03-11_120320/preprocessed_2025-03-11T12_03_25/documents/knowledge_rhelai_76xhx3nq/rhelai.md contains HTML contents, which is currently unsupported as a part of markdown NOTE: Continuing this might affect your data generation quality.
To get best results please format your markdown documents without the use of HTML or use a different document filetype.
NFO 2025-03-11 12:04:01,198 instructlab.sdg.utils.chunkers:184: Processing parsed docling json file: /var/home/instruct/.local/share/instructlab/datasets/2025-03-11_120320/preprocessed_2025-03-11T12_03_25/documents/docling-artifacts/rhelai.json
failed to generate data with exception: list index out of range
Environment
- Red Hat Enterprise Linux AI (RHEL AI)
- 1.2
- 1.3
- 1.4.1
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.