git-0.1版本源码分析

git是一个开源的分布式代码控制系统(SCM),由Linus在2005年开发.当时由于linux内核工程所使用的SCM工具BitKeeper的提供商不再提供免费使用,Linus没有寻找到能替代BitKeeper,满足需求的SCM工具,因此自己设计开发了git.为什么叫这个名字,在初始版本的README中,Linus是这样解释的, 也就是说,其实也没有特殊的含义-:)

    GIT - the stupid content tracker

"git" can mean anything, depending on your mood.

 - random three-letter combination that is pronounceable, and not
   actually used by any common UNIX command.  The fact that it is a
   mispronounciation of "get" may or may not be relevant.
 - stupid. contemptible and despicable. simple. Take your pick from the
   dictionary of slang.
 - "global information tracker": you're in a good mood, and it actually
   works for you. Angels sing, and a light suddenly fills the room. 
 - "goddamn idiotic truckload of sh*t": when it breaks

相比较而言,git有如下几个显著特点和优势:
1) 从仓库克隆以后,包含所有的历史修改记录.
2) 分支管理的高效性和高性能.
3) 分布式开发的高效率.
当然,git还有许多强大和便利的功能.git的源码工程请参考:https://github.com/git/git

git的初始版本为git-0.1,完成了对象数据库和cache的框架设计,并且只是实现了底层命令的操作,我们现在使用git时,比如git add 或者git rm等,都是以特定的参数调用某个底层命令.git的每个底层命令都被编译成为可执行文件. git初始版本的目录结构及文件如下图所示:
这里写图片描述

cache.h头文件中,包括cache处理流程中涉及到的几个类型和接口,在流程分析中会说明这些类型,该文件的内容如下:

#ifndef CACHE_H
#define CACHE_H

#include <stdio.h>
#include <sys/stat.h>
#include <fcntl.h>
#include <stddef.h>
#include <stdlib.h>
#include <stdarg.h>
#include <errno.h>
#include <sys/mman.h>

#include <openssl/sha.h>
#include <zlib.h>

/*
 * Basic data structures for the directory cache
 *
 * NOTE NOTE NOTE! This is all in the native CPU byte format. It's
 * not even trying to be portable. It's trying to be efficient. It's
 * just a cache, after all.
 */

#define CACHE_SIGNATURE 0x44495243  /* "DIRC" */
struct cache_header {
    unsigned int signature;
    unsigned int version;
    unsigned int entries;
    unsigned char sha1[20];
};

/*
 * The "cache_time" is just the low 32 bits of the
 * time. It doesn't matter if it overflows - we only
 * check it for equality in the 32 bits we save.
 */
struct cache_time {
    unsigned int sec;
    unsigned int nsec;
};

/*
 * dev/ino/uid/gid/size are also just tracked to the low 32 bits
 * Again - this is just a (very strong in practice) heuristic that
 * the inode hasn't changed.
 */
struct cache_entry {
    struct cache_time ctime;
    struct cache_time mtime;
    unsigned int st_dev;
    unsigned int st_ino;
    unsigned int st_mode;
    unsigned int st_uid;
    unsigned int st_gid;
    unsigned int st_size;
    unsigned char sha1[20];
    unsigned short namelen;
    unsigned char name[0];
};

const char *sha1_file_directory;
struct cache_entry **active_cache;
unsigned int active_nr, active_alloc;

#define DB_ENVIRONMENT "SHA1_FILE_DIRECTORY"
#define DEFAULT_DB_ENVIRONMENT ".dircache/objects"

#define cache_entry_size(len) ((offsetof(struct cache_entry,name) + (len) + 8) & ~7)
#define ce_size(ce) cache_entry_size((ce)->namelen)

#define alloc_nr(x) (((x)+16)*3/2)

/* Initialize the cache information */
extern int read_cache(void);

/* Return a statically allocated filename matching the sha1 signature */
extern char *sha1_file_name(unsigned char *sha1);

/* Write a memory buffer out to the sha file */
extern int write_sha1_buffer(unsigned char *sha1, void *buf, unsigned int size);

/* Read and unpack a sha1 file into memory, write memory to a sha1 file */
extern void * read_sha1_file(unsigned char *sha1, char *type, unsigned long *size);
extern int write_sha1_file(char *buf, unsigned len);

/* Convert to/from hex/sha1 representation */
extern int get_sha1_hex(char *hex, unsigned char *sha1);
extern char *sha1_to_hex(unsigned char *sha1);  /* static buffer! */

/* General helper functions */
extern void usage(const char *err);

#endif /* CACHE_H */

源码流程分析主要包括如下几部分:
init-db命令处理流程
update-cache命令处理流程
TODO: write-tree命令处理流程.
TODO: read-tree命令处理流程.
TODO: commit-tree命令处理流程.
TODO: show-diff命令处理流程.
TODO: cat-file命令处理流程.

猜你喜欢

转载自blog.csdn.net/azurelaker/article/details/81639034