| File | Task | Rows | Size | Key Columns | S3 Bucket | Caption Format |
|---|---|---|---|---|---|---|
| t2i_3_18_2026.parquet | T2I | 62,227 | 46.2 MB | frame_hash_id, caption, s3_frame_path, width, height, is_home_depot | foundry-thd-enterprise-adobe-assets | SCAP v2 JSON |
| stock_3_18_2026.parquet | T2I | 162,480 | 120.7 MB | strImagehash, caption, s3_frame_path, width, height, query, is_home_depot | mldp-image | SCAP v2 JSON |
| ie_3_18_2026.parquet | IE | 40,964 | 34.8 MB | frame_hash_id, caption, target_image, reference_images, edit_instruction | foundry-thd-enterprise-adobe-assets | SCAP + Edit Instruction |
| multiref_3_18_2026.parquet | MultiRef | 4,017 | 19.0 MB | frame_hash_id, caption, edit_instruction, reference_images, target_image, num_reference, source | foundry-thd-enterprise-adobe-assets | Rich Caption + Edit Instruction |
| multiref_subset_02262026.parquet | MultiRef-v0 | 2,543 | 3.7 MB | unique_id, reference_images, target_image, source, edit_instruction | foundry-thd-enterprise-adobe-assets | After Caption + Edit Instruction |
20260330_lite_thd.yaml lines 1758-1770.
Effective THD data for T2I = 40,530 (from HD parquet) used at 80% weight.
Stock 162K provides the non-THD portion + keyword-matched THD-adjacent content.
height >= 1080
| Name | Type | Resolution | Eval N | Config Path |
|---|---|---|---|---|
| i2i_1024p_multiref_thd | MultiRef | 1024 | 12 | foundry_home_depot_eval_set.json |
| i2i_1024p_multiref_thd_sampled | MultiRef | 1024 | 125 | foundry_home_depot_inference_set.json |
| i2i_1024p_product_swap_thd | ProductSwap | 1024 | 10 | thd_multiref_x2x_ready_gen6.json |
| i2i_1024p_singleref_ie_thd | IE | 1024 | 10 | thd_ie_eval_conversational_gen6.json |
| t2i_thd_mixed_benchmark | T2I | 1024/2048 | 45 | thd_t2i_mixed_benchmark_conversational_gen6.json |
| *_rewrite variants (6 sets) | Rewrite | 1024/2048 | 10-45 | prompt-rewrite-03272026/gen6/*.json |
| Task | Data Size | GPUs | Est. Epochs/1K steps | Overfitting Risk |
|---|---|---|---|---|
| T2I | 224,707 | 64 | ~0.3 | Low |
| IE | 40,964 | 32 | ~0.8 | Medium |
| MultiRef | 4,017 | 16 | ~4.0 | HIGH |
| # | Category | Group | Products | Spin Frames | Lifestyle | Triplets | Internal Data | Readiness | Status |
|---|
foundry-thd-enterprise-adobe-assets bucket now works via foundry_aws_gateway library + PLUTO_AUTH_TOKEN. No IAM ticket needed.