Large-scale Field Data Research Articles

Farmers grow crops in specific sequences to lower disease pressure and boost crop productivity, particularly in organic farming where artificial pesticides and chemical fertilisers are prohibited. Knowledge about crop sequences used in organic and conventional farming will aid the development of future farming systems through optimising crop diversity and pre-crop effects for improved resource efficiency. This study aims to investigate crop diversity and patterns in organic and conventional crop sequences in Sweden. Large-scale LPIS field data managed by the European Union (EU) Integrated Administration and Control System (IACS) were used to monitor crop sequences on arable land in Sweden over 10 consecutive years (2005–2014). Individual fields (land parcels) could be followed on 40% of Sweden’s total arable area (349,891 fields extracted) over the 10 years. The LPIS data was combined with information from a database on which fields were farmed organically. Crop distribution, diversity of crop sequences and pre-crops to the main cereal crops (winter wheat, spring barley) were analysed in organic and conventional farming systems in the five agricultural productivity zones of Sweden. The results showed that in the most productive zone in southernmost Sweden, small-grain cereals (particularly winter wheat) were the most common crops (62%), followed by oilseeds (11%), ley and forage crops (9%) and sugar beet (8%), when excluding permanent grassland. In the least productive zone (at higher altitudes and/or latitudes), ley and forage crops dominated (67%), followed by spring cereals (barley, oats) (23%). Crop diversity was higher in the two more productive zones (mean 4.6 crop types) than in two less productive zones (3.4) and organic farms showed 9% higher crop diversity than conventional farms in the most productive zones. Overall, in all zones, the pre-crop to winter wheat was generally a different crop type (3 out of 5 times) e.g., young ley (1–2 years) or grain legume, while the pre-crop to spring barley was most often (4 out of 5 times) another cereal. For both these crops, pre-crop type was more diverse in organic than conventional systems. These findings demonstrate that LPIS data can offer valuable insights into agronomic trends and on-farm practices regarding crop choice and that analysis of field-level LPIS data on crop sequences at large scale can reveal information about organic and conventional cropping in different productivity zones across countries. This information can be used to understand the practical limitations in the use of crop diversity to maximise pre-crop effects. This could in turn support advisory service and policy makers to facilitate more sustainable, productive and resource efficient crop production.

The increasing availability of complex, geo-referenced on-farm data demands analytical frameworks that can guide crop management recommendations. Recent developments in interpretable machine learning techniques offer opportunities to use these methods in agronomic studies. Our objectives were two-fold: (1) to assess the performance of different machine learning methods to explain on-farm wheat yield variability in the Northwestern Indo-Gangetic Plains of India, and (2) to identify the most important drivers and interactions explaining wheat yield variability. A suite of fine-tuned machine learning models (ridge and lasso regression, classification and regression trees, k-nearest neighbor, support vector machines, gradient boosting, extreme gradient boosting, and random forest) were statistically compared using the R 2 , root mean square error (RMSE), and mean absolute error (MAE). The best performing model was again fine-tuned using a grid search approach for the bias-variance trade-off. Three post-hoc model agnostic techniques were used to interpret the best performing model: variable importance (a variable was considered “important” if shuffling its values increased or decreased the model error considerably), interaction strength (based on Friedman’s H-statistic), and two-way interaction (i.e., how much of the total variability in wheat yield was explained by a particular two-way interaction). Model outputs were compared against empirical data to contextualize results and provide a blueprint for future analysis in other production systems. Tree-based and decision boundary-based methods outperformed regression-based methods in explaining wheat yield variability. Random forest was the best performing method in terms of goodness-of-fit and model precision and accuracy with RMSE, MAE, and R 2 ranging between 367 and 470 kg ha −1 , 276–345 kg ha −1 , and 0.44–0.63, respectively. Random forest was then used for selection of important variables and interactions. The most important management variables explaining wheat yield variability were nitrogen application rate and crop residue management, whereas the average of monthly cumulative solar radiation during February and March (coinciding with reproductive phase of wheat) was the most important biophysical variable. The effect size of these variables on wheat yield ranged between 227 kg ha −1 for nitrogen application rate to 372 kg ha −1 for cumulative solar radiation during February and March. The effect of important interactions on wheat yield was detected in the data namely the interaction between crop residue management and disease management and, nitrogen application rate and seeding rate. For instance, farmers’ fields with moderate disease incidence yielded 750 kg ha −1 less when crop residues were removed than when crop residues were retained. Similarly, wheat yield response to residue retention was higher under low seed and N application rates. As an inductive research approach, the appropriate application of interpretable machine learning methods can be used to extract agronomically actionable information from large-scale farmer field data. • Data-driven agronomic research requires new analytical and methodological approaches. • Machine learning methods were used to disentangle complex relationships in farmer field data. • Model-agnostic tools were used to derive agronomic interpretations. • Residue management and N application rate were important management variables for wheat yield. • Residue management interacted with other practices to explain wheat yield variability.

Large-scale Field Data Research Articles

Articles published on Large-scale Field Data

The Genesis Effect: Digital Goods in the Metaverse

재택근무가 업무 생산성에 미치는 영향과 업무 특성의 조절 효과

Land Parcel Identification System (LPIS) data allows identification of crop sequence patterns and diversity in organic and conventional farming systems.

The Art of Slowness: Slow Motion Enhances Consumer Evaluations by Increasing Processing Fluency

Economic costs of the invasive Yellow-legged hornet on honey bees

Spherical Planting Inversion of GRAIL Data

Energy dissipation model for irregular breaking waves owing to air bubbles

Interpretable machine learning methods to explain on-farm yield variability of high productivity wheat in Northwest India

An efficient 3D wave-equation pre-stack time migration for high-density and wide-azimuth data

An enhanced hybrid deep neural network reduced-order model for transonic buffet flow prediction

TriCTI: an actionable cyber threat intelligence discovery system via trigger-enhanced neural network

How do ethical consumers utilize sharing economy platforms as part of their sustainable resale behavior? The role of consumers’ green consumption values

Data-Driven Time-Frequency Method and Its Application in Detection of Free Gas Beneath a Gas Hydrate Deposit

Earth current and charge accumulation in the electrical interface during geomagnetic storm using largescale geo-electric field data

Earth current and charge accumulation in the electrical interface during geomagnetic storm using largescale geo-electric field data

Do Consumers Order More Calories in a Meal with a Diet or Regular Soft Drink? An Empirical Investigation Using Large-Scale Field Data

Nothing to Worry About: Why Liberals Underestimate Dominant Leaders and Act Complacently

What is beautiful is not always good: influence of machine learning-derived photo attractiveness on intention to initiate social interactions in mobile dating applications

Species occurrence relates to pesticide gradient in streams

Customer attrition analysis in the securities industry: a large-scale field study in Korea

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large-scale Field Data Research Articles

Articles published on Large-scale Field Data

The Genesis Effect: Digital Goods in the Metaverse

재택근무가 업무 생산성에 미치는 영향과 업무 특성의 조절 효과

Land Parcel Identification System (LPIS) data allows identification of crop sequence patterns and diversity in organic and conventional farming systems.

The Art of Slowness: Slow Motion Enhances Consumer Evaluations by Increasing Processing Fluency

Economic costs of the invasive Yellow-legged hornet on honey bees

Spherical Planting Inversion of GRAIL Data

Energy dissipation model for irregular breaking waves owing to air bubbles

Interpretable machine learning methods to explain on-farm yield variability of high productivity wheat in Northwest India

An efficient 3D wave-equation pre-stack time migration for high-density and wide-azimuth data

An enhanced hybrid deep neural network reduced-order model for transonic buffet flow prediction

TriCTI: an actionable cyber threat intelligence discovery system via trigger-enhanced neural network

How do ethical consumers utilize sharing economy platforms as part of their sustainable resale behavior? The role of consumers’ green consumption values

Data-Driven Time-Frequency Method and Its Application in Detection of Free Gas Beneath a Gas Hydrate Deposit

Earth current and charge accumulation in the electrical interface during geomagnetic storm using largescale geo-electric field data

Earth current and charge accumulation in the electrical interface during geomagnetic storm using largescale geo-electric field data

Do Consumers Order More Calories in a Meal with a Diet or Regular Soft Drink? An Empirical Investigation Using Large-Scale Field Data

Nothing to Worry About: Why Liberals Underestimate Dominant Leaders and Act Complacently

What is beautiful is not always good: influence of machine learning-derived photo attractiveness on intention to initiate social interactions in mobile dating applications

Species occurrence relates to pesticide gradient in streams

Customer attrition analysis in the securities industry: a large-scale field study in Korea