add ca fertilization harmonization and ncc compost support by divine7022 · Pull Request #4002 · PecanProject/pecan

divine7022 · 2026-05-18T18:43:43Z

Description

folds standalone N fertilization harmonization (was living at /projectnb/dietzelab/ccmmf/management/fertilization/harmonization.R) into data.land data-raw pipeline, then layers ncc (compost) sampling support on top and extends the events schema with an ncc event type

events_schema_v0.1.2.json picks up ncc as an allowed event_type with material, ncc_subtype, fert_subtype, and pft properties

separate workflow PR will consume these samplers from workflows/fertilization-statewide and workflows/ncc-statewide to emit ensemble events

cc @sarahkanee @mdietze @infotroph @dlebauer

Motivation and Context

Review Time Estimate

Immediately
Within one week
When possible

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My change requires a change to the documentation.
My name is in the list of CITATION.cff
I agree that PEcAn Project may distribute my contribution under any or all of
- the same license as the existing code,
- and/or the BSD 3-clause license.
I have updated the CHANGELOG.md.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

…mpost

divine7022 · 2026-05-24T00:57:14Z

high level, priors are out now, two ref tables (ca_compost_amendment, ca_n_application_rate) stay as bundled data for now. and new ag/management package vs bety priors vs whatever is separate conv

dlebauer · 2026-05-27T18:33:24Z

new ag/management package vs bety priors vs whatever is separate conv

@divine7022 Could you please clarify?

dlebauer

Thank you for these changes.

General changes requested:

Better documentation

(Make sure to review the data packaging guidance from the Data chapter of the "R-Packages" book)

Documentation should cover both the source (TSV) and package (RDA) data:

package datasets should be documented in roxygen, and should include

definitions of each field
information about where the data came from:
- references
- how the dataset was compiled, including assumptions
- link to more details in source data README.

Source datasets can be documented in a data-raw/README.md and should include:
- where data came from
- what decisions were made during curation
- note that for package datasets that do not have a manual curation step, the provenance of package data is documented by R scripts.
- how to update both the TSV and .rda files

dataset names

Make sure naming is consistent, e.g. create_n_rate_data.R reads a file n_fertilizaton.tsv and generates a dataset ca_n_application_rate. This can get confusing.

consider renaming compost.tsv to organic_ammendments.tsv to bettr reflect the diversity of materials represented in the dataset.

dlebauer · 2026-05-27T20:37:12Z

@@ -0,0 +1,33 @@
+Material	C_MIN (C:N)	C_MAX (C:N)	C_Avg (C:N)	C_Assumed (%)	Total N (%)	4 week PAN (%)	LowerN/HigherN	RowsMIN_AppRate (tons/acre)	RowsMIN_AppRate (lbs/acre)	RowsMIN_Total_N (lbs N/acre)	RowsMIN_Avail_N (lbs N/acre)	RowsMIN_Total_C (lbs C/acre)	RowsMAX_AppRate (tons/acre)	RowsMAX_AppRate (lbs/acre)	RowsMAX_Total_N (lbs N/acre)	RowsMAX_Avail_N (lbs N/acre)	RowsMAX_Total_C (lbs C/acre)	TreesMIN_AppRate (tons/acre)	TreesMIN_AppRate (lbs/acre)	TreesMIN_Total_N (lbs N/acre)	TreesMIN_Avail_N (lbs N/acre)	TreesMIN_Total_C (lbs C/acre)	TreesMAX_AppRate (tons/acre)	TreesMAX_AppRate (lbs/acre)	TreesMAX_Total_N (lbs N/acre)	TreesMAX_Avail_N (lbs N/acre)	TreesMAX_Total_C (lbs C/acre)	Source


This table combines stoichiometric properties with assumed application-rate
scenarios.

would it make more sense to separate material properties from application-rates?

See more general requests about dataset documentation in my general review comments. Some specific questions are below

what columns come from the source, and what information was added during manual curation?

e.g. are the Rows* and Trees* application-rate values directly supported by
the cited sources, or did sources provide more granular, species level rates?

Why does the build script expose only the Rows* values in
ca_compost_amendment?

Is the intended distinction that synthetic N rates are crop-specific, while
compost/organic amendment rates are only available at a broader crop-structure
level?

Given the contents include manure, blood meal, paper, wood chips, straw, and
other organic materials, would organic_amendments.tsv or
compost_amendments.tsv be a clearer filename than compost.tsv?

dlebauer · 2026-05-27T20:52:22Z

+# map each raw material name to one of the CalRecycle classes
+# (14 CCR section 17852). biosolids is empty in the current table.
+material_to_class <- function(m) {
+  s <- tolower(m)


where does m come from? I don't see it in compost.tsv

dlebauer · 2026-05-27T20:56:23Z

+    app_rate_min = .data$`RowsMIN_AppRate (lbs/acre)`,
+    app_rate_max = .data$`RowsMAX_AppRate (lbs/acre)`,


why are only Rows[MIN|MAX] and not Trees[MIN|MAX] used here? (see also general question about documentation)

dlebauer · 2026-05-27T21:07:32Z

@@ -0,0 +1,90 @@
+PFT Group	Crop	PlantStage	Season	MINN	MAXN	Unit	Source	Notes


Do crop-names come from a controlled vocabulary? Any reason not to use the LandIQ names here?

Where do PFTs come from? On one hand, row/woody is a simple designation, and there may be reasons to include a grouping in this dataset. But caution should be used and rationale/motivation documented because PFT groupings are opinionated, and in this case alfalfa is not typically considered a 'row' crop (it is forage).

dlebauer

@divine7022

To unblock #4003, which currently depends on this PR, you could

copy the .rda and .R files from this PR to the workflows/[fertilization|ncc]-statewide directories
update #4003 to use those resources

This is in line with one of the motivations for the workflows directory - as a place to maintain and develop workflows until individual components can be generalized and extracted into PEcAn package functions and data

divine7022 added 30 commits May 18, 2026 14:20

rewrite create_n_rate_data data raw script

5b609f9

refresh n_application_rates csv

af7171a

rebuild ca_n_application_rate rda

9079a65

add oz_per_tree_to_lb_per_acre helper

4ba0b09

add doc for oz_per_tree_to_lb_per_acre

4da526e

add tpa_lookup helper

00a4ce8

add doc for tpa_lookup

b738e65

rewrite create_compost_data with material class

70f56c8

refresh compost_amendments csv

ef7d28c

rebuild ca_compost_amendment rda

1b96343

update doc for ca_compost_amendment

36ed147

add create_ca_compost_distributions data raw script

3598750

add ca_compost_pct_c_distribution rda

adc5190

add ca_compost_cn_distribution rda

9a15db8

add ca_compost_app_rate_envelope rda

e850cee

add ca_compost_calendar_window rda

c5cb2ad

add ca_compost_material_whitelist rda

5e2fff3

add doc for ca_compost_pct_c_distribution

e41539f

add doc for ca_compost_cn_distribution

de0bb71

add doc for ca_compost_app_rate_envelope

9b22635

add doc for ca_compost_calendar_window

4dfed63

add doc for ca_compost_material_whitelist

56a95f6

add sample_ca_compost functions

ee1ef5e

add doc for sample_ca_compost_pct_c

d858456

add doc for sample_ca_compost_cn

c5d72f0

add doc for sample_ca_compost_app_rate

c353ef5

add doc for sample_ca_compost_date_offset

1d24977

add doc for sample_ca_compost_material

a71ef65

add tests for sample_ca_compost

5f5172c

update bundled dataset docs

f52c1c3

divine7022 added 26 commits May 23, 2026 19:13

drop doc for ca_compost_material_whitelist

9a9dcf0

drop doc for sample_ca_compost_pct_c

7ee162b

drop doc for sample_ca_compost_cn

6d943fb

drop doc for sample_ca_compost_app_rate

3da67e0

drop doc for sample_ca_compost_date_offset

6e025b4

drop doc for sample_ca_compost_material

9433f59

drop oz_per_tree_to_lb_per_acre helper

0a715a6

drop doc for oz_per_tree_to_lb_per_acre

2467216

drop tpa_lookup helper

1e5dbe5

drop doc for tpa_lookup

aa60cf2

drop docs for removed compost distribution datasets

6d3850c

drop namespace exports for removed functions

b5c658b

update look_up_ca_compost_amendment for new column set

1bc7c15

regen doc for look_up_ca_compost_amendment

740cc8c

update lookup tests for recovered crops and new compost columns

4150ede

add raw compost tsv to data-raw

7a59c49

add raw n fertilization tsv to data-raw

229b2ac

read raw compost tsv from data-raw and use ud_convert

ca51a68

read raw n tsv from data-raw and use ud_convert

39ce033

rebuild ca_compost_amendment rda

50c2ffa

rebuild ca_n_application_rate rda

0cc6cdf

Merge remote-tracking branch 'origin/develop' into ncc-compost

7b3dd3b

update news entry to match new compost and n rate columns

76455c1

update ca_compost_amendment doc to mention material_class

335031c

regen doc for ca_compost_amendment

fc67a4e

Merge branch 'ncc-compost' of github.com:divine7022/pecan into ncc-co…

b7d07e3

…mpost

dlebauer reviewed May 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add ca fertilization harmonization and ncc compost support#4002

add ca fertilization harmonization and ncc compost support#4002
divine7022 wants to merge 75 commits into
PecanProject:developfrom
divine7022:ncc-compost

divine7022 commented May 18, 2026 •

edited

Loading

Uh oh!

divine7022 commented May 24, 2026

Uh oh!

dlebauer commented May 27, 2026

Uh oh!

dlebauer left a comment

Uh oh!

dlebauer May 27, 2026

Uh oh!

dlebauer May 27, 2026

Uh oh!

dlebauer May 27, 2026 •

edited

Loading

Uh oh!

dlebauer May 27, 2026

Uh oh!

dlebauer left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,33 @@
		Material C_MIN (C:N) C_MAX (C:N) C_Avg (C:N) C_Assumed (%) Total N (%) 4 week PAN (%) LowerN/HigherN RowsMIN_AppRate (tons/acre) RowsMIN_AppRate (lbs/acre) RowsMIN_Total_N (lbs N/acre) RowsMIN_Avail_N (lbs N/acre) RowsMIN_Total_C (lbs C/acre) RowsMAX_AppRate (tons/acre) RowsMAX_AppRate (lbs/acre) RowsMAX_Total_N (lbs N/acre) RowsMAX_Avail_N (lbs N/acre) RowsMAX_Total_C (lbs C/acre) TreesMIN_AppRate (tons/acre) TreesMIN_AppRate (lbs/acre) TreesMIN_Total_N (lbs N/acre) TreesMIN_Avail_N (lbs N/acre) TreesMIN_Total_C (lbs C/acre) TreesMAX_AppRate (tons/acre) TreesMAX_AppRate (lbs/acre) TreesMAX_Total_N (lbs N/acre) TreesMAX_Avail_N (lbs N/acre) TreesMAX_Total_C (lbs C/acre) Source

		app_rate_min = .data$`RowsMIN_AppRate (lbs/acre)`,
		app_rate_max = .data$`RowsMAX_AppRate (lbs/acre)`,

		@@ -0,0 +1,90 @@
		PFT Group Crop PlantStage Season MINN MAXN Unit Source Notes

Conversation

divine7022 commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Review Time Estimate

Types of changes

Checklist:

Uh oh!

divine7022 commented May 24, 2026

Uh oh!

dlebauer commented May 27, 2026

Uh oh!

dlebauer left a comment

Choose a reason for hiding this comment

Uh oh!

dlebauer May 27, 2026

Choose a reason for hiding this comment

Uh oh!

dlebauer May 27, 2026

Choose a reason for hiding this comment

Uh oh!

dlebauer May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dlebauer May 27, 2026

Choose a reason for hiding this comment

Uh oh!

dlebauer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

divine7022 commented May 18, 2026 •

edited

Loading

dlebauer May 27, 2026 •

edited

Loading

dlebauer left a comment •

edited

Loading