For detailed mesh models reconstruction, the only way is to separate large target area into small regions and reconstruction them separately. Large scale models always have less detailed information than small scale models. “Region A” dataset with 64 photos can generate a model with 1441254 faces in the study area. When combining photos from “Region A” and aerial photos, this number drop down to 67478 faces. Figure 87 illustrates four models according to Table 13: Model (a) is derived from the UAV dataset. This model is accurate but lack of details. Model (b) is derived from the Nikon dataset, this model is more detailed but the mean displacement is about 0.093 meters. Model (c) is generated from a combined dataset which contains the UAV photos and the "Region A" photos, this model hasn’t improve a lot details but has improved the accuracy to 0.0492. The last model (d) is generated from "Region A" dataset and was georeferenced by using local control points. Its accuracy has been improved to 0.053 and the number of generated vertices is the highest.