mobile OpenCL local memory bank conflict. Why using local memory is slower than does global memory in kernel?

I'm developing face detection app in android platform using OpenCL. Face detection algorithm is based on Viola Jones algorithm. I tried to make Cascade classification step kernel code. and I set classifier data of cascade stage 1 among cascade stages to local memory(__local) because classifier data are used for all...

opencv train classifier never contiueus 3 stage

I am trying to learn the cascade classifier to learn license plates. I'm running OpenCV 2.4.9 on a Windows 7 server with 16 GB RAM and i7 processor of 3,4 GHz. I want to train the classifier, but it never continuous stage 3. This is how I made my .vec...