Automation in agriculture is revolutionizing the management of field tasks. This paper focuses on identifying the essential needs and preliminary technical requirements for designing an autonomous robot specifically for fruit harvesting. The study examines the integration of advanced machine vision technologies and neural networks for the accurate detection of ripe fruits and obstacles, optimizing harvesting efficiency. Key features such as proximity and touch sensors are also evaluated to ensure the safe handling of fruits, minimizing damage during harvesting. The technical specifications, including traction, speed, and the robot’s ability to operate on uneven terrain, are analyzed to ensure adaptability across various crops and ground conditions. Perception systems, incorporating cameras and sensors, are crucial for crop inspection and route planning. The main findings of this review establish a set of technical requirements that serve as a foundation for developing efficient and economically viable fruit-harvesting robots. These systems offer significant potential to transform agriculture by addressing labor shortages and improving harvesting precision. However, challenges remain, particularly in adapting to diverse crops and environments, emphasizing the need for further research and development.