IP Disclosure: HS-6 Pivot + Jaccard Concordance (Schedule B)

completedgeneralP1

Description

TRADE SECRET / POTENTIAL PATENT — IP Disclosure Record Innovation: HS-6 Pivot with Jaccard Similarity for Schedule B Concordance File: rosetta_core/schedule_b.py Date of first implementation: 2026-02-23 Description: Novel method for mapping HTS import codes to Schedule B export codes using a 6-digit HS prefix pivot. When multiple Schedule B codes share the same HS-6 prefix (29.8% of cases), a Jaccard similarity algorithm with word stemming (trailing 's' removal for plural normalization) disambiguates by comparing product descriptions. Key data points: - 9,733 Schedule B export codes + 19,768 HTS import codes - 70.2% single-match (direct HS-6 mapping), 29.8% disambiguation needed - Stemming catches sauce↔sauces, shirt↔shirts - Known divergence: HTS 851712 → None (smartphones are 851713 in Schedule B) Why patentable (per Gemini G3 Pro analysis): This is a technical solution to a specific data-matching problem (not abstract), with measurable results and a novel algorithmic approach. The HS-6 pivot + Jaccard + stemming combination does not exist in prior art for trade concordance. Prior art checked: No existing patent covers HS-6 pivot methodology for HTS↔Schedule B mapping. Census Bureau provides raw data but no algorithmic concordance.

Assignee

Claimed By

AG Lu

Created

2d ago

Completed

2d ago

Quality

Overall

79%

Pass

PASS

Graded By

Graded At

Cost

Input Tokens

Output Tokens

Est. USD

Result

{
  "code": "## IP Disclosure: HS-6 Pivot + Jaccard Concordance — CONFIRMED\n\nFile: `rosetta_core/schedule_b.py` — Implementation verified.\n\n### Innovation Summary\n- **Method**: HS-6 digit prefix pivot for HTS→Schedule B mapping\n- **Disambiguation**: Jaccard similarity with word stemming (plural normalization) for multi-match cases (29.8% of mappings)\n- **Data**: 9,733 Schedule B codes + 19,768 HTS import codes from Census Bureau concordance CSVs\n- **Coverage**: 70.2% direct single-match, 29.8% Jaccard-disambiguated\n\n### Patent Surface\n- Technical solution to specific data-matching problem (non-abstract)\n- Novel algorithmic combination: HS-6 pivot + Jaccard + stemming\n- No prior art in trade concordance domain\n- Measurable results with known edge cases documented (e.g., HTS 851712 → None)\n\n### Files\n- `rosetta_core/schedule_b.py` — Core algorithm\n- `rosetta_core/data/schedule_b_export.csv` — Census Bureau export codes\n- `rosetta_core/data/hts_import.csv` — Census Bureau import codes\n- `tests/test_schedule_b.py` — Test coverage\n\nIP disclosure record acknowledged and confirmed against codebase.",
  "content": "## IP Disclosure: HS-6 Pivot + Jaccard Concordance — CONFIRMED\n\nFile: `rosetta_core/schedule_b.py` — Implementation verified.\n\n### Innovation Summary\n- **Method**: HS-6 digit prefix pivot for HTS→Schedule B mapping\n- **Disambiguation**: Jaccard similarity with word stemming (plural normalization) for multi-match cases (29.8% of mappings)\n- **Data**: 9,733 Schedule B codes + 19,768 HTS import codes from Census Bureau concordance CSVs\n- **Coverage**: 70.2% direct single-match, 29.8% Jaccard-disambiguated\n\n### Patent Surface\n- Technical solution to specific data-matching problem (non-abstract)\n- Novel algorithmic combination: HS-6 pivot + Jaccard + stemming\n- No prior art in trade concordance domain\n- Measurable results with known edge cases documented (e.g., HTS 851712 → None)\n\n### Files\n- `rosetta_core/schedule_b.py` — Core algorithm\n- `rosetta_core/data/schedule_b_export.csv` — Census Bureau export codes\n- `rosetta_core/data/hts_import.csv` — Census Bureau import codes\n- `tests/test_schedule_b.py` — Test coverage\n\nIP disclosure record acknowledged and confirmed against codebase."
}

Audit Trail (3)

2d agotask_completedAG Lu (Claude Opus)

2d agotask_claimedAG Lu

2d agotask_createdVS Lu

Task ID: bd9b4a7a-0692-456b-8215-18c0b1ba2ab7