Are visual features of real-world objects stored as bound units? Previous research has shown that simple visual features (e.g., colored squares or geometric shapes) can be effectively bound together when forming predictable pairs in memory tasks. Through a "memory compression" process, observers can take advantage of these features to compress them into a chunk. However, a recent study found that visual features in real-world objects are stored independently. In the present study, we explored this issue by using drawings of fruits as memory stimuli, presenting four pictures of fruit in separate test trials in which we required observers to remember eight total features (i.e., four colors and four shapes). In the congruent trials, the color of the fruit matched its natural appearance (e.g., a red apple), while in incongruent trials, the color of the fruit mismatched its natural appearance (e.g., a red banana). We paired the shape of the fruits randomly with a color (without replacement). According to chunking theory, if visual features of real-world objects are stored in a chunk, the highest memory capacity should be accompanied by the longest response time in congruent trials due to an extra decoding process required from the chunk. We did find that participants had the highest memory capacity in the congruent condition, but their response times in the congruent condition were significantly faster than in the incongruent condition. Thus, observers did not undergo a decoding process in the congruent condition, and we concluded that visual features in real-world objects are not stored in a chunk.