Identifying Animals in Camera Trap Images via Neural Architecture Search.

Liang Jia, Ye Tian, Junguo Zhang
Author Information
  1. Liang Jia: School of Technology, Beijing Forestry University, Beijing 100083, China. ORCID
  2. Ye Tian: School of Technology, Beijing Forestry University, Beijing 100083, China. ORCID
  3. Junguo Zhang: School of Technology, Beijing Forestry University, Beijing 100083, China. ORCID

Abstract

Wild animals are essential for ecosystem structuring and stability, and thus they are important for ecological research. Since most wild animals have high athletic or concealable abilities or both, it is used to be relatively difficult to acquire evidence of animal appearances before applications of camera traps in ecological researches. However, a single camera trap may produce thousands of animal images in a short period of time and inevitably ends up with millions of images requiring classification. Although there have been many methods developed for classifying camera trap images, almost all of them follow the pattern of a very deep convolutional neural network processing all camera trap images. Consequently, the corresponding surveillance area may need to be delicately controlled to match the network capability, and it may be difficult to expand the area in the future. In this study, we consider a scenario in which camera traps are grouped into independent clusters, and images produced by a cluster are processed by an edge device installed with a customized network. Accordingly, edge devices in this scenario may be highly heterogeneous due to cluster scales. Resultantly, networks popular in the classification of camera trap images may not be deployable for edge devices without modifications requiring the expertise which may be hard to obtain. This motivates us to automatize network design via neural architecture search for edge devices. However, the search may be costly due to the evaluations of candidate networks, and its results may be infeasible without considering the resource limits of edge devices. Accordingly, we propose a search method using regression trees to evaluate candidate networks to lower search costs, and candidate networks are built based on a meta-architecture automatically adjusted regarding to the resource limits. In experiments, the search consumes 6.5 hours to find a network applicable to the edge device Jetson X2. The found network is then trained on camera trap images through a workstation and tested on Jetson X2. The network achieves competitive accuracies compared with the automatically and the manually designed networks.

References

  1. Neural Netw. 2020 Mar;123:305-316 [PMID: 31896462]
  2. Proc Natl Acad Sci U S A. 2018 Jun 19;115(25):E5716-E5725 [PMID: 29871948]
  3. Trends Ecol Evol. 2015 Nov;30(11):685-696 [PMID: 26437636]
  4. Science. 2014 Jan 10;343(6167):1241484 [PMID: 24408439]
  5. Neural Comput. 1997 Nov 15;9(8):1735-80 [PMID: 9377276]
  6. R Soc Open Sci. 2019 Mar 6;6(3):181748 [PMID: 31032031]
  7. Sci Rep. 2019 May 31;9(1):8137 [PMID: 31148564]
  8. Science. 2019 Oct 4;366(6461):120-124 [PMID: 31604313]
  9. Sci Adv. 2015 May 01;1(4):e1400103 [PMID: 26601172]
  10. Sci Data. 2015 Jun 09;2:150026 [PMID: 26097743]

MeSH Term

Animals
Ecosystem
Neural Networks, Computer

Word Cloud

Created with Highcharts 10.0.0maycameraimagesnetworkedgetrapnetworkssearchdevicescandidateanimalsecologicaldifficultanimaltrapsHoweverrequiringclassificationneuralareascenarioclusterdeviceAccordinglyduewithoutviaresourcelimitsautomaticallyJetsonX2WildessentialecosystemstructuringstabilitythusimportantresearchSincewildhighathleticconcealableabilitiesusedrelativelyacquireevidenceappearancesapplicationsresearchessingleproducethousandsshortperiodtimeinevitablyendsmillionsAlthoughmanymethodsdevelopedclassifyingalmostfollowpatterndeepconvolutionalprocessingConsequentlycorrespondingsurveillanceneeddelicatelycontrolledmatchcapabilityexpandfuturestudyconsidergroupedindependentclustersproducedprocessedinstalledcustomizedhighlyheterogeneousscalesResultantlypopulardeployablemodificationsexpertisehardobtainmotivatesusautomatizedesignarchitecturecostlyevaluationsresultsinfeasibleconsideringproposemethodusingregressiontreesevaluatelowercostsbuiltbasedmeta-architectureadjustedregardingexperimentsconsumes65 hoursfindapplicablefoundtrainedworkstationtestedachievescompetitiveaccuraciescomparedmanuallydesignedIdentifyingAnimalsCameraTrapImagesNeuralArchitectureSearch

Similar Articles

Cited By (2)