← Tasks
IP Disclosure: HS-6 Pivot + Jaccard Concordance (Schedule B)
completedgeneralP1
Description
TRADE SECRET / POTENTIAL PATENT — IP Disclosure Record
Innovation: HS-6 Pivot with Jaccard Similarity for Schedule B Concordance
File: rosetta_core/schedule_b.py
Date of first implementation: 2026-02-23
Description: Novel method for mapping HTS import codes to Schedule B export codes using a 6-digit HS prefix pivot. When multiple Schedule B codes share the same HS-6 prefix (29.8% of cases), a Jaccard similarity algorithm with word stemming (trailing 's' removal for plural normalization) disambiguates by comparing product descriptions.
Key data points:
- 9,733 Schedule B export codes + 19,768 HTS import codes
- 70.2% single-match (direct HS-6 mapping), 29.8% disambiguation needed
- Stemming catches sauce↔sauces, shirt↔shirts
- Known divergence: HTS 851712 → None (smartphones are 851713 in Schedule B)
Why patentable (per Gemini G3 Pro analysis): This is a technical solution to a specific data-matching problem (not abstract), with measurable results and a novel algorithmic approach. The HS-6 pivot + Jaccard + stemming combination does not exist in prior art for trade concordance.
Prior art checked: No existing patent covers HS-6 pivot methodology for HTS↔Schedule B mapping. Census Bureau provides raw data but no algorithmic concordance.
Assignee
-
Claimed By
AG Lu
Created
2d ago
Completed
2d ago
Quality
Overall
79%
Pass
PASS
Graded By
-
Graded At
-
Cost
Input Tokens
-
Output Tokens
-
Est. USD
-
Result
{
"code": "## IP Disclosure: HS-6 Pivot + Jaccard Concordance — CONFIRMED\n\nFile: `rosetta_core/schedule_b.py` — Implementation verified.\n\n### Innovation Summary\n- **Method**: HS-6 digit prefix pivot for HTS→Schedule B mapping\n- **Disambiguation**: Jaccard similarity with word stemming (plural normalization) for multi-match cases (29.8% of mappings)\n- **Data**: 9,733 Schedule B codes + 19,768 HTS import codes from Census Bureau concordance CSVs\n- **Coverage**: 70.2% direct single-match, 29.8% Jaccard-disambiguated\n\n### Patent Surface\n- Technical solution to specific data-matching problem (non-abstract)\n- Novel algorithmic combination: HS-6 pivot + Jaccard + stemming\n- No prior art in trade concordance domain\n- Measurable results with known edge cases documented (e.g., HTS 851712 → None)\n\n### Files\n- `rosetta_core/schedule_b.py` — Core algorithm\n- `rosetta_core/data/schedule_b_export.csv` — Census Bureau export codes\n- `rosetta_core/data/hts_import.csv` — Census Bureau import codes\n- `tests/test_schedule_b.py` — Test coverage\n\nIP disclosure record acknowledged and confirmed against codebase.",
"content": "## IP Disclosure: HS-6 Pivot + Jaccard Concordance — CONFIRMED\n\nFile: `rosetta_core/schedule_b.py` — Implementation verified.\n\n### Innovation Summary\n- **Method**: HS-6 digit prefix pivot for HTS→Schedule B mapping\n- **Disambiguation**: Jaccard similarity with word stemming (plural normalization) for multi-match cases (29.8% of mappings)\n- **Data**: 9,733 Schedule B codes + 19,768 HTS import codes from Census Bureau concordance CSVs\n- **Coverage**: 70.2% direct single-match, 29.8% Jaccard-disambiguated\n\n### Patent Surface\n- Technical solution to specific data-matching problem (non-abstract)\n- Novel algorithmic combination: HS-6 pivot + Jaccard + stemming\n- No prior art in trade concordance domain\n- Measurable results with known edge cases documented (e.g., HTS 851712 → None)\n\n### Files\n- `rosetta_core/schedule_b.py` — Core algorithm\n- `rosetta_core/data/schedule_b_export.csv` — Census Bureau export codes\n- `rosetta_core/data/hts_import.csv` — Census Bureau import codes\n- `tests/test_schedule_b.py` — Test coverage\n\nIP disclosure record acknowledged and confirmed against codebase."
}Audit Trail (3)
2d agotask_completedAG Lu (Claude Opus)
2d agotask_claimedAG Lu
2d agotask_createdVS Lu
Task ID: bd9b4a7a-0692-456b-8215-18c0b1ba2ab7