同事在用hdfs api 写入hdfs文件，2年前没有成功，这次一起解决了这个问题。详细代码如下：

客户端需要指定ns名称，节点配置，ConfiguredFailoverProxyProvider等信息。

代码示例：

package cn.itacst.hadoop.hdfs;

import java.io.FileInputStream;
import java.io.InputStream;
import java.io.OutputStream;
import java.net.URI;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IOUtils;

public class HDFS_HA {

    
    public static void main(String[] args) throws Exception {
        Configuration conf = new Configuration();
        conf.set("fs.defaultFS", "hdfs://ns1");
        conf.set("dfs.nameservices", "ns1");
        conf.set("dfs.ha.namenodes.ns1", "nn1,nn2");
        conf.set("dfs.namenode.rpc-address.ns1.nn1", "hdfsname01:9000");
        conf.set("dfs.namenode.rpc-address.ns1.nn2", "hdfsname02:9000");
        //conf.setBoolean(name, value);
        conf.set("dfs.client.failover.proxy.provider.ns1", "org.apache.hadoop.hdfs.server.namenode.ha.ConfiguredFailoverProxyProvider");
        FileSystem fs = FileSystem.get(new URI("hdfs://ns1"), conf, "hadoop");
        InputStream in =new FileInputStream("c://test.rar");
        OutputStream out = fs.create(new Path("/test"));
        IOUtils.copyBytes(in, out, 4096, true);
    }
}

下面是调用api 官网说明

static FileSystem get(URI uri, Configuration conf, String user)
Get a filesystem instance based on the uri, the passed configuration and the user

网址：http://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/fs/FileSystem.html

喜欢追踪原来的朋友可以看第二部分。

二。原理解析

跟踪进入FileSystem

通过getDefaultUri(conf)获得前面代码中设置的主机的uri地址，然后再调用重载的get方法。

以下是重载的get方法内容：

①getScheme() scheme的值是hdfs，getAuthority是获得namenode的主机名和端口号。

如果scheme==null 以及authority==null 就返回default fs也就是本地文件系统。

②然后拼接disableCacheName，将scheme拼接进去，拼接为fs.hdfs.impl.disable.cache。然后从conf里面区get该参数值，如果有值就返回，如果没值就返回false。该参数的意思是禁用缓存，禁用缓存为false就意味着要用缓存，然后走到下面的程序。CACHE.get(uri,conf).

到这里，filesystem的get方法什么也没做，搞了一堆判断。然后返回的CACHE.get(uri,conf).

我们下面看一下CACHE.get干了什么。

③CACHE是filesystem的内部类