Skip to content

DMDB curie update#606

Draft
ahueb wants to merge 1 commit intoSuLab:mainfrom
ahueb:dmdb-update
Draft

DMDB curie update#606
ahueb wants to merge 1 commit intoSuLab:mainfrom
ahueb:dmdb-update

Conversation

@ahueb
Copy link

@ahueb ahueb commented Oct 16, 2024

added utility to normalize all curies in the nodes and links of an indication_paths.json document using translator nodenorm, updated indication_paths.json file included in PR

Total unique normalized IDs: 4857
Total unique failed IDs: 271

95% of curies align with nodenorm

most unresolved curies appear to be interpro

[
  "MESH",
  "InterPro",
  "CHEBI",
  "GO",
  "UniProtKB",
  "Pfam",
  "TIGR",
  "DrugBank",
  "NCBITaxon",
  "REACT"
]

…dication_paths.json document using translator nodenorm
@alexeistepa
Copy link

@ahueb Thank you for this, extremely helpful!

One small comment in case you wanted to merge this: in indication_paths-normalized-2024-10-15.json, the drug_mesh and disease_mesh keys should probably be renamed to drug_id and disease_id (or similar)

"""
graph = entry.get('graph', {})
entry['graph'] = {k: id_map.get(v, v) if isinstance(v, str) else v for k, v in graph.items()}
entry['links'] = [{

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you're aiming to merge this, would be cleaner to also update the nodes as well

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants