Java线程池以及线程池监控(II)

本文部分来源自Java线程池实现原理及其在美团业务中的实践

上一篇说到了execute的执行过程，接下来讲一讲线程创建的过程，即addWorker(Runnable firstTask, boolean core)

  private boolean addWorker(Runnable firstTask, boolean core) {
        retry:
        for (;;) {
            int c = ctl.get();
            int rs = runStateOf(c);

            // Check if queue empty only if necessary.
            if (rs >= SHUTDOWN &&
                ! (rs == SHUTDOWN &&
                   firstTask == null &&
                   ! workQueue.isEmpty()))
                return false;

            for (;;) {
                int wc = workerCountOf(c);
                if (wc >= CAPACITY ||
                    wc >= (core ? corePoolSize : maximumPoolSize))
                    return false;
                if (compareAndIncrementWorkerCount(c))
                    break retry;
                c = ctl.get();  // Re-read ctl
                if (runStateOf(c) != rs)
                    continue retry;
                // else CAS failed due to workerCount change; retry inner loop
            }
        }

        boolean workerStarted = false;
        boolean workerAdded = false;
        Worker w = null;
        try {
            w = new Worker(firstTask);
            final Thread t = w.thread;
            if (t != null) {
                final ReentrantLock mainLock = this.mainLock;
                mainLock.lock();
                try {
                    // Recheck while holding lock.
                    // Back out on ThreadFactory failure or if
                    // shut down before lock acquired.
                    int rs = runStateOf(ctl.get());

                    if (rs < SHUTDOWN ||
                        (rs == SHUTDOWN && firstTask == null)) {
                        if (t.isAlive()) // precheck that t is startable
                            throw new IllegalThreadStateException();
                        workers.add(w);
                        int s = workers.size();
                        if (s > largestPoolSize)
                            largestPoolSize = s;
                        workerAdded = true;
                    }
                } finally {
                    mainLock.unlock();
                }
                if (workerAdded) {
                    t.start();
                    workerStarted = true;
                }
            }
        } finally {
            if (! workerStarted)
                addWorkerFailed(w);
        }
        return workerStarted;
    }
复制代码

addWorker两个参数，firstTask表示线程创建成功后执行firstTask，core表示创建是否是核心线程池里的线程 1.有两层死循环可以理解为自旋，外层的自旋，先获取当前的线程池状态rs，如果当前rs的状态大于等于SHUTDOWN的或者SHUTDOWN状态且队列没有任务了，创建失败，如果rs的状态为RUNNING，则进入第二层自旋 2.获取当前的线程数wc，判断wc是否大于CAPACITY，CAPACITY为5亿多基本不可能，接着再根据core的值判断wc是要大于corePoolSize还是maximumPoolSize，满足其中一个条件，就创建失败。

                if (wc >= CAPACITY ||
                    wc >= (core ? corePoolSize : maximumPoolSize))
                    return false;
复制代码

3.如果线程数符合要求就会通过CAS的方式增加当前线程数量的值，如果CAS成功则跳出两层自旋

                if (compareAndIncrementWorkerCount(c))
                    break retry;
复制代码

，如果失败，再次判断下当前的状态是否跟之前的rs一致，如果不一致回到最外层的循环，重新走1步骤，如果一致，则重新走2步骤

                if (runStateOf(c) != rs)
                    continue retry;
复制代码

4.线程池中的线程都被包装成工作线程Worker，线程在Worker里面创建。创建一个工作线程，把firstTask传进去

    private final class Worker
        extends AbstractQueuedSynchronizer
        implements Runnable
    {
        /**
         * This class will never be serialized, but we provide a
         * serialVersionUID to suppress a javac warning.
         */
        private static final long serialVersionUID = 6138294804551838833L;

        /** Thread this worker is running in.  Null if factory fails. */
        final Thread thread;
        /** Initial task to run.  Possibly null. */
        Runnable firstTask;
        /** Per-thread task counter */
        volatile long completedTasks;

        /**
         * Creates with given first task and thread from ThreadFactory.
         * @param firstTask the first task (null if none)
         */
        Worker(Runnable firstTask) {
            setState(-1); // inhibit interrupts until runWorker
            this.firstTask = firstTask;
            this.thread = getThreadFactory().newThread(this);
        }
    }
复制代码

拿到Worker里的线程

final Thread t = w.thread;
复制代码

如果t不为空，就会获取锁，获取是为了一些操作能够安全的更新如工作集合的添加和largestPoolSize的修改，获取到锁后，获取当前线程池的状态rs，判断rs是否符合状态，符合的话再判断t是否已经启动，如果启动抛出异常，否则把该工作线程添加到工作线程的集合中，并修改largestPoolSize，紧接着释放锁。然后启动线程

 if (workerAdded) {
     t.start();
     workerStarted = true;
 }
复制代码

如果线程启动失败，则会把刚刚的工作线程从工作线程集合中删除，并把线程数量减一

finally {
            if (! workerStarted)
                addWorkerFailed(w);
 }
 ----------
 private void addWorkerFailed(Worker w) {
    final ReentrantLock mainLock = this.mainLock;
    mainLock.lock();
    try {
        if (w != null)
            workers.remove(w);
        decrementWorkerCount();
        tryTerminate();
    } finally {
        mainLock.unlock();
    }
}
复制代码

上面用的finally的原因是因为在之前的两层循环中，已经把线程数量加一了，所以无论如何只要线程没启动成功就得要把线程数量减一，以确保数量正确。

讲完了如何添加工作线程的，现在来讲一讲工作线程是如何运行起来的。工作线程中的运行是执行runWorker()

        public void run() {
            runWorker(this);
        }
----------------------------------
    final void runWorker(Worker w) {
        Thread wt = Thread.currentThread();
        Runnable task = w.firstTask;
        w.firstTask = null;
        w.unlock(); // allow interrupts
        boolean completedAbruptly = true;
        try {
            while (task != null || (task = getTask()) != null) {
                w.lock();
                // If pool is stopping, ensure thread is interrupted;
                // if not, ensure thread is not interrupted.  This
                // requires a recheck in second case to deal with
                // shutdownNow race while clearing interrupt
                if ((runStateAtLeast(ctl.get(), STOP) ||
                     (Thread.interrupted() &&
                      runStateAtLeast(ctl.get(), STOP))) &&
                    !wt.isInterrupted())
                    wt.interrupt();
                try {
                    beforeExecute(wt, task);
                    Throwable thrown = null;
                    try {
                        task.run();
                    } catch (RuntimeException x) {
                        thrown = x; throw x;
                    } catch (Error x) {
                        thrown = x; throw x;
                    } catch (Throwable x) {
                        thrown = x; throw new Error(x);
                    } finally {
                        afterExecute(task, thrown);
                    }
                } finally {
                    task = null;
                    w.completedTasks++;
                    w.unlock();
                }
            }
            completedAbruptly = false;
        } finally {
            processWorkerExit(w, completedAbruptly);
        }
    }        
复制代码

线程执行前先解锁，确保能够被中断，接着使用while一直循环从getTask()获取任务，获取到任务后，先把线程上锁，然后判断当前线程的状态是否是STOP，如果是STOP，则中断当前的线程。在运行真正的线程run时前有一个beforeExecute，运行完后有一个afterExecute，这两个方法都是空方法，留着给开发人员实现，可以对线程的运行前面做一下操作，当线程运行完后，当前工作线程的完成任务数加一，同时线程释放锁

finally {
                    task = null;
                    w.completedTasks++;
                    w.unlock();
                }
复制代码

如果获取不到任务，则回收工作线程

finally {
            processWorkerExit(w, completedAbruptly);
        }
复制代码

现在我们来看看，工作线程是如何获取到任务，即getTask() 方法

   private Runnable getTask() {
        boolean timedOut = false; // Did the last poll() time out?

        for (;;) {
            int c = ctl.get();
            int rs = runStateOf(c);

            // Check if queue empty only if necessary.
            if (rs >= SHUTDOWN && (rs >= STOP || workQueue.isEmpty())) {
                decrementWorkerCount();
                return null;
            }

            int wc = workerCountOf(c);

            // Are workers subject to culling?
            boolean timed = allowCoreThreadTimeOut || wc > corePoolSize;

            if ((wc > maximumPoolSize || (timed && timedOut))
                && (wc > 1 || workQueue.isEmpty())) {
                if (compareAndDecrementWorkerCount(c))
                    return null;
                continue;
            }

            try {
                Runnable r = timed ?
                    workQueue.poll(keepAliveTime, TimeUnit.NANOSECONDS) :
                    workQueue.take();
                if (r != null)
                    return r;
                timedOut = true;
            } catch (InterruptedException retry) {
                timedOut = false;
            }
        }
    }
复制代码

1.设置一个变量timeOut来判断上一次获取任务是否超时， 2.接着获取当前的线程池状态rs，判断当前线程池状态是否为STOP或者是SHUTDOWN且队列为空，如果是减少线程数量-1，返回null。 3.否则获取当前的线程数量wc，同时设置一个变量timed判断是否核心线程允许超时，紧接着判断 wc 是否大于 maximumPoolSize ，或者上一次获取任务超时且队列为空，符合就减少线程数量，返回null 4.最重要的一步

            try {
                Runnable r = timed ?
                    workQueue.poll(keepAliveTime, TimeUnit.NANOSECONDS) :
                    workQueue.take();
                if (r != null)
                    return r;
                timedOut = true;
            } catch (InterruptedException retry) {
                timedOut = false;
            }
复制代码

根据timed来判断是阻塞的获取任务一直等待直到有任务，还是根据keepAliveTime等待一定时间如果没有获取到任务就返回null，如果获取到任务就返回任务，获取不到任务任务就把timedOut 置为true进行下一次的循环，同时如果阻塞线程被中断了，这里会捕获到，把timedOut = false进行下一次的循环。

在这里额外介绍两点 1.为什么要判断wc > maximumPoolSize，这是因为有可能使用了 setMaximumPoolSize 把 maximumPoolSize调小了，导致目前的线程数量超过最大值，所以要把超出的部分减去同时回收超过的部分 2.timed = allowCoreThreadTimeOut || wc > corePoolSize;，在线程池中使用 allowCoreThreadTimeOut 来控制核心线程池的线程是否也应该被超时回收，如果为true则核心线程池也需要，为false则超过核心线程池数量的线程才会被超时回收

Java线程池以及线程池监控(II)

猜你喜欢