add thread safe version of ev_tally3()
implement PairOMP::ev_tally3_thr() method with similar
strategy and the generic ev_tally_thr() method. no
additional per-thread accumulators seem to be needed.
this should allow to implement a threaded version of
the stillinge-weber pair style class.