fork()函数

#include <unistd.h>

// 参数	： void 
// 返回值： pid_t 创建的子进程ID
pid_t fork(void);

返回值：fork() 返回值会返回两次，分别在父进程和子进程中返回。

在父进程中返回子进程的ID，在子进程中返回0。所以可以通过fork的返回值来区分父进程与子进程。
在父进程中返回 -1 ，表示创建子进程失败，并设置errorno。如下面两种情况导致创建失败：
- 当前系统的进程数已经达到了系统规定的上限，这时errno的值被设置为EAGAIN
- 系统内存不足，这时 errno 的值被设置为ENOMEM

程序实例

 #include <unistd.h>
 #include <stdio.h>
 
 int main() {
     pid_t cld_pid;
     int a = 1, b = 2;
     for (int i = 0; i < 2; i++) {
         if ((cld_pid = fork()) == 0) {
             a += 1;
             printf("a=%d  b=%d\n", a, b);
         } else {
             b += 1;
             printf("a=%d  b=%d\n", a, b);
         }
     }
     return 0;
 }

执行过程：

原理

Linux的fork是通过写时拷贝(Copy On Write)实现的。在执行fork语句后，内核不是复制一份父进程的整个地址空间，而是父子进程共享父进程的地址空间。在父进程或子进程进行写操作时，子进程才复制一份地址空间，从而使得父子进程拥有自己的虚拟地址空间，在自己的地址空间进行写操作。 对于文件资源，fork之后的父子进程共享文件，fork之后的父进程与子进程的文件描述符表指向相同的文件表，引用计数增加，共享文件偏移指针。

fork引发的死锁问题

fork函数在创建子进程时，如果原进程为多线程进程，则只会复制当前线程，不会复制其他线程。

现在有这样一个问题：该程序有个全局对象sGlobalInstance，父进程先通过该对象执行了lock操作，然后执行fork，在子进程中，也去执行lock操作。

#include <errno.h>
#include <pthread.h>
#include <stdio.h>
#include <stdlib.h>
#include <sys/syscall.h>
#include <unistd.h>

class Test {
public:
    Test() {
        pthread_mutex_init(&mMutex, nullptr);
        printf("Init test instance pid:%u tid:%u\n", getpid(), gettid());
    }

    ~Test() {
        pthread_mutex_destroy(&mMutex);
    }

    void lock() {
        pthread_mutex_lock(&mMutex);
    }

    void unlock() {
        pthread_mutex_unlock(&mMutex);
    }

private:
    pthread_mutex_t mMutex;
};

static Test* sGlobalInstance = nullptr;

void* func(void* arg) {
    if (sGlobalInstance == nullptr) {
        sGlobalInstance = new Test();
    }

    printf("Before get lock pid:%u tid:%u\n", getpid(), gettid());
    sGlobalInstance->lock();
    printf("After get lock pid:%u tid:%u\n", getpid(), gettid());

    pause();
    return nullptr;
}

int main() {
    printf("In parent process. pid:%u tid:%u\n", getpid(), gettid());
    sGlobalInstance = new Test();

    pthread_t id;
    pthread_create(&id, nullptr, func, nullptr);
    // Sleep to make sure the thread get lock
    sleep(1);

    int pid = fork();
    if (pid < 0) {
        printf("Error occur while fork. errno:%d\n", errno);
        return errno;
    } else if (pid == 0) {
        // In child process
        printf("In child process. pid:%u tid:%u\n", getpid(), gettid());
        func(nullptr);
    } else {
        // In parent process
        pause();
    }
    
    return 0;
}

上面程序的执行结果如下：子进程没有拿到锁，产生了死锁。

In parent process. pid:22287 tid:22287
Init test instance pid:22287 tid:22287
Before get lock pid:22287 tid:22288
After get lock pid:22287 tid:22288
In child process. pid:22293 tid:22293
Before get lock pid:22293 tid:22293

从执行流程看，该全局变量只在父进程中被初始化了一次，此时已经加上锁了。由于fork的cow机制，此时子进程中该变量也是加锁了的，当父进程对该变量进行解锁时，操作系统会复制一份进程中的资源，导致子进程中该变量仍是加锁了的。

解决方式：

pthread_atfork 函数可以用来处理这种情况，该函数原型如下：

回调函数prepare在fork前调用
fork后在父进程中调用回调函数parent

fork后在子进程中调用回调函数child

int pthread_atfork(void (*prepare)(void), void (*parent)(void), void (*child)(void));

fork之后的父子进程执行顺序问题

进程的执行顺序是要看操作系统如何进行进程调度的，具体看调度算法。

如果fork之后先调度父进程，此时父进程在cpu中处于活跃状态，无需进行进程切换操作，能提高性能。

如果fork之后先调度子进程立即exec（进程切换？）的情况，此时所有的父进程页面会因为cow进入保护状态。由于父进程有倾向继续修改内存页和栈页，所以需要为子进程复制一份页面，此时有发生页中断的潜在危险，对于子进程来说，它未必需要父进程的页面。此时如果先执行子进程做exec，就可以省去很多拷贝页面的过程。如果需要强制子进程先运行的话可以使用vfork。

fork()函数

好好学习，天天向上

fork()函数

程序实例

原理

fork引发的死锁问题

fork之后的父子进程执行顺序问题

参考资料