Agriculture AI faces dataset shortages and weak model transferability

Crop monitoring requires vast amounts of labeled images covering different plant species, growth stages, and environmental conditions. Yet most available datasets remain small, fragmented, or region-specific, limiting the ability of models to generalize effectively.


CO-EDP, VisionRICO-EDP, VisionRI | Updated: 19-09-2025 22:45 IST | Created: 19-09-2025 22:45 IST
Agriculture AI faces dataset shortages and weak model transferability
Representative Image. Credit: ChatGPT

Artificial intelligence is moving to the center of agricultural science, but challenges around data and collaboration threaten to slow progress. Researchers recently analysed how deep learning is being used to monitor crops, offering both a roadmap for innovation and a warning of persistent weaknesses.

Their paper, A Bibliometric Review of Deep Learning in Crop Monitoring: Trends, Challenges, and Future Perspectives, published in Frontiers in Artificial Intelligence, analyzes more than 650 academic publications from 2000 to 2024. Using bibliometric and knowledge graph methods with tools such as VOSviewer and CiteSpace, the authors trace how deep learning, often linked with remote sensing, is reshaping research in food security, pest detection, and yield forecasting.

How deep learning is reshaping agricultural monitoring

The review finds that deep learning has become the backbone of modern crop monitoring, with convolutional neural networks (CNNs) dominating the field. These models are now widely applied to tasks such as disease detection, pest management, and growth assessment. The technology is frequently paired with data from drones, satellites, and other remote sensing systems, allowing researchers to capture detailed, real-time insights across vast agricultural landscapes.

Over the past decade, the pace of adoption has accelerated. Publications in the field increased sharply after 2015, reflecting advances in both computing power and the availability of higher-resolution imagery. CNNs remain the most frequently deployed architectures, but newer models are beginning to emerge. The authors note that the integration of deep learning with unmanned aerial vehicles and satellite remote sensing has been particularly impactful in enabling real-time monitoring at scale.

Beyond identifying pests or predicting yields, deep learning models are being adapted for tasks such as soil health evaluation and climate resilience studies. This versatility, the researchers argue, demonstrates the transformative potential of AI for global food systems, particularly in the face of climate change and rising population pressures.

What barriers are holding back progress

The review also points out several barriers that must be addressed for deep learning in agriculture to reach full maturity. Foremost is the scarcity of high-quality annotated datasets. Crop monitoring requires vast amounts of labeled images covering different plant species, growth stages, and environmental conditions. Yet most available datasets remain small, fragmented, or region-specific, limiting the ability of models to generalize effectively.

The second challenge is weak model generalization across diverse agricultural environments. Models trained in one region often fail when applied elsewhere due to differences in climate, soil composition, and farming practices. This lack of transferability makes it difficult to deploy AI systems at scale, especially in developing countries where agricultural conditions vary widely.

A third barrier is the difficulty of fusing data from multiple sources. While researchers now have access to imagery from satellites, drones, and ground sensors, combining these streams into coherent, high-performing models remains technically complex. Without effective data fusion, the benefits of multimodal monitoring are left unrealized.

The authors also highlight the need for stronger interdisciplinary collaboration. Much of the research is siloed within computer science or agricultural science, with limited integration between the two fields. Without more cross-domain cooperation, they warn, progress will stall on problems that require both technical innovation and agronomic expertise.

Where the field must go next

The researchers call for a strategic shift in priorities. Building large-scale, open-access agricultural datasets is described as the most urgent task. These should include diverse crop species, geographic regions, and environmental conditions to support robust, transferable models. International collaboration and shared funding mechanisms will be critical to achieving this goal.

The review also points to the development of models that can adapt to dynamic conditions. Advances in transfer learning and domain adaptation may allow systems trained in one setting to be recalibrated quickly for another. Such adaptability will be vital for global deployment, especially in regions with limited local data.

Data fusion is flagged as another frontier. By integrating multiple data sources, from satellites to soil sensors, researchers can capture a more complete picture of crop health. Achieving this will require advances in multimodal deep learning and closer collaboration between disciplines.

Finally, the authors stress the importance of sustained interdisciplinary partnerships. Agronomists, computer scientists, and policymakers must work together to design systems that are not only technically sound but also practical for farmers and scalable in real-world conditions. Without this collaborative approach, the potential of deep learning to support food security may remain unfulfilled.

  • FIRST PUBLISHED IN:
  • Devdiscourse
Give Feedback