Two kinds of improved parallel algorithms for 3D PIC simulation have been designed, which can reduce one step of process synchronization in a time step. However, due to the motion path and initial position of particles related to random function, only one of them can guarantee the correctness of parallel computing. The improved parallel algorithms were implemented on CHIPIC3D, which was then used to simulate a relativistic backward wave oscillator tube. The simulation results show that just one improved parallel algorithm is correct with speedup and efficiency enhanced.