经验首页 前端设计 程序设计 Java相关 移动开发 数据库/运维 软件/图像 大数据/云计算 其他经验
当前位置:技术经验 » 数据库/运维 » Kubernetes » 查看文章
使用Kubeadm(1.13+)快速搭建Kubernetes集群
来源:cnblogs  作者:雨夜朦胧  时间:2018/12/10 9:25:37  对本文有异议

Kubeadm是管理集群生命周期的重要工具,从创建到配置再到升级,Kubeadm处理现有硬件上的生产集群的引导,并以最佳实践方式配置核心Kubernetes组件,以便为新节点提供安全而简单的连接流程并支持轻松升级。随着Kubernetes 1.13 的发布,现在Kubeadm正式成为GA。

准备

首先准备2台虚拟机(CPU最少2核),我是使用Hyper-V创建的2台Ubuntu18.04虚拟机,IP和机器名如下:

172.17.20.210 master

172.17.20.211 node1

禁用Swap

Kubernetes 1.8开始要求必须禁用Swap,如果不关闭,默认配置下kubelet将无法启动。

编辑/etc/fstab文件:

  1. raining@master:~$ sudo vim /etc/fstab
  2. UUID=8be04efd-f7c5-11e8-be8b-00155d000500 / ext4 defaults 0 0
  3. UUID=C0E3-6A72 /boot/efi vfat defaults 0 0
  4. #/swap.img none swap sw 0 0

如上,将/swap.img所在的行注释掉,然后运行:

  1. sudo swapoff -a

(可选)DNS配置

在Ubuntu18.04+版本中,DNS由systemd全面接管,接口监听在127.0.0.53:53,配置文件在/etc/systemd/resolved.conf中。

有时候会导致无法解析域名的问题,可使用如下2种方式来解决:

1.最简单的就是关闭systemd-resolvd服务

  1. sudo systemctl stop systemd-resolved
  2. sudo systemctl disable systemd-resolved

然后手动修改/etc/resolv.conf文件就可以了。

2.更加推荐的做法是修改systemd-resolv的设置:

  1. sudo vim /etc/systemd/resolved.conf
  2. # 修改为如下
  3. [Resolve]
  4. DNS=1.1.1.1 1.0.0.1
  5. #FallbackDNS=
  6. #Domains=
  7. LLMNR=no
  8. #MulticastDNS=no
  9. #DNSSEC=no
  10. #Cache=yes
  11. #DNSStubListener=yes

DNS=设置的是域名解析服务器的IP地址,这里分别设为1.1.1.1和1.0.0.1
LLMNR=设置的是禁止运行LLMNR(Link-Local Multicast Name Resolution),否则systemd-resolve会监听5535端口。

安装Docker

Kubernetes从1.6开始使用CRI(Container Runtime Interface)容器运行时接口。默认的容器运行时仍然是Docker,是使用kubelet中内置dockershim CRI来实现的。

Docker的安装可以参考之前的博客:Docker初体验

需要注意的是,Kubernetes 1.13已经针对Docker的1.11.1, 1.12.1, 1.13.1, 17.03, 17.06, 17.09, 18.06等版本做了验证,最低支持的Docker版本是1.11.1,最高支持是18.06,而Docker最新版本已经是18.09了,故我们安装时需要指定版本为18.06.1-ce

  1. sudo apt install docker-ce=18.06.1~ce~3-0~ubuntu

安装kubeadm, kubelet 和 kubectl

部署之前,我们需要安装三个包:

  • kubeadm: 引导启动k8s集群的命令行工具。

  • kubelet: 在群集中所有节点上运行的核心组件, 用来执行如启动pods和containers等操作。

  • kubectl: 操作集群的命令行工具。

首先添加apt-key:

  1. sudo apt update && sudo apt install -y apt-transport-https curl
  2. curl -s https://mirrors.aliyun.com/kubernetes/apt/doc/apt-key.gpg | sudo apt-key add -

添加kubernetes源:

  1. sudo vim /etc/apt/sources.list.d/kubernetes.list
  2. deb https://mirrors.aliyun.com/kubernetes/apt/ kubernetes-xenial main

安装:

  1. sudo apt update
  2. sudo apt install -y kubelet kubeadm kubectl
  3. sudo apt-mark hold kubelet kubeadm kubectl

使用kubeadm创建一个单Master集群

初始化Master节点

K8s的控制面板组件运行在Master节点上,包括etcd和API server(Kubectl便是通过API server与k8s通信)。

在执行初始化之前,我们还有一下3点需要注意:

1.选择一个网络插件,并检查它是否需要在初始化Master时指定一些参数,比如我们可能需要根据选择的插件来设置--pod-network-cidr参数。参考:Installing a pod network add-on

2.kubeadm使用eth0的默认网络接口(通常是内网IP)做为Master节点的advertise address,如果我们想使用不同的网络接口,可以使用--apiserver-advertise-address=<ip-address>参数来设置。如果适应IPv6,则必须使用IPv6d的地址,如:--apiserver-advertise-address=fd00::101

3.使用kubeadm config images pull来预先拉取初始化需要用到的镜像,用来检查是否能连接到Kubenetes的Registries。

Kubenetes默认Registries地址是k8s.gcr.io,很明显,在国内并不能访问gcr.io,因此在kubeadm v1.13之前的版本,安装起来非常麻烦,但是在1.13版本中终于解决了国内的痛点,其增加了一个--image-repository参数,默认值是k8s.gcr.io,我们将其指定为国内镜像地址:registry.aliyuncs.com/google_containers,其它的就可以完全按照官方文档来愉快的玩耍了。

其次,我们还需要指定--kubernetes-version参数,因为它的默认值是stable-1,会导致从https://dl.k8s.io/release/stable-1.txt下载最新的版本号,我们可以将其指定为固定版本(最新版:v1.13.0)来跳过网络请求。

现在,我们就来试一下:

  1. # 使用calico网络 --pod-network-cidr=192.168.0.0/16
  2. sudo kubeadm init --image-repository registry.aliyuncs.com/google_containers --kubernetes-version v1.13.0 --pod-network-cidr=192.168.0.0/16
  3. # 输出
  4. [init] Using Kubernetes version: v1.13.0
  5. [preflight] Running pre-flight checks
  6. [preflight] Pulling images required for setting up a Kubernetes cluster
  7. [preflight] This might take a minute or two, depending on the speed of your internet connection
  8. [preflight] You can also perform this action in beforehand using 'kubeadm config images pull'
  9. [kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"
  10. [kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"
  11. [kubelet-start] Activating the kubelet service
  12. [certs] Using certificateDir folder "/etc/kubernetes/pki"
  13. [certs] Generating "ca" certificate and key
  14. [certs] Generating "apiserver" certificate and key
  15. [certs] apiserver serving cert is signed for DNS names [master kubernetes kubernetes.default kubernetes.default.svc kubernetes.default.svc.cluster.local] and IPs [10.96.0.1 172.17.20.210]
  16. [certs] Generating "apiserver-kubelet-client" certificate and key
  17. [certs] Generating "front-proxy-ca" certificate and key
  18. [certs] Generating "front-proxy-client" certificate and key
  19. [certs] Generating "etcd/ca" certificate and key
  20. [certs] Generating "etcd/peer" certificate and key
  21. [certs] etcd/peer serving cert is signed for DNS names [master localhost] and IPs [172.17.20.210 127.0.0.1 ::1]
  22. [certs] Generating "etcd/server" certificate and key
  23. [certs] etcd/server serving cert is signed for DNS names [master localhost] and IPs [172.17.20.210 127.0.0.1 ::1]
  24. [certs] Generating "etcd/healthcheck-client" certificate and key
  25. [certs] Generating "apiserver-etcd-client" certificate and key
  26. [certs] Generating "sa" key and public key
  27. [kubeconfig] Using kubeconfig folder "/etc/kubernetes"
  28. [kubeconfig] Writing "admin.conf" kubeconfig file
  29. [kubeconfig] Writing "kubelet.conf" kubeconfig file
  30. [kubeconfig] Writing "controller-manager.conf" kubeconfig file
  31. [kubeconfig] Writing "scheduler.conf" kubeconfig file
  32. [control-plane] Using manifest folder "/etc/kubernetes/manifests"
  33. [control-plane] Creating static Pod manifest for "kube-apiserver"
  34. [control-plane] Creating static Pod manifest for "kube-controller-manager"
  35. [control-plane] Creating static Pod manifest for "kube-scheduler"
  36. [etcd] Creating static Pod manifest for local etcd in "/etc/kubernetes/manifests"
  37. [wait-control-plane] Waiting for the kubelet to boot up the control plane as static Pods from directory "/etc/kubernetes/manifests". This can take up to 4m0s
  38. [kubelet-check] Initial timeout of 40s passed.
  39. [apiclient] All control plane components are healthy after 42.003645 seconds
  40. [uploadconfig] storing the configuration used in ConfigMap "kubeadm-config" in the "kube-system" Namespace
  41. [kubelet] Creating a ConfigMap "kubelet-config-1.13" in namespace kube-system with the configuration for the kubelets in the cluster
  42. [patchnode] Uploading the CRI Socket information "/var/run/dockershim.sock" to the Node API object "master" as an annotation
  43. [mark-control-plane] Marking the node master as control-plane by adding the label "node-role.kubernetes.io/master=''"
  44. [mark-control-plane] Marking the node master as control-plane by adding the taints [node-role.kubernetes.io/master:NoSchedule]
  45. [bootstrap-token] Using token: 6pkrlg.8glf2fqpuf3i489m
  46. [bootstrap-token] Configuring bootstrap tokens, cluster-info ConfigMap, RBAC Roles
  47. [bootstraptoken] configured RBAC rules to allow Node Bootstrap tokens to post CSRs in order for nodes to get long term certificate credentials
  48. [bootstraptoken] configured RBAC rules to allow the csrapprover controller automatically approve CSRs from a Node Bootstrap Token
  49. [bootstraptoken] configured RBAC rules to allow certificate rotation for all node client certificates in the cluster
  50. [bootstraptoken] creating the "cluster-info" ConfigMap in the "kube-public" namespace
  51. [addons] Applied essential addon: CoreDNS
  52. [addons] Applied essential addon: kube-proxy
  53. Your Kubernetes master has initialized successfully!
  54. To start using your cluster, you need to run the following as a regular user:
  55. mkdir -p $HOME/.kube
  56. sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
  57. sudo chown $(id -u):$(id -g) $HOME/.kube/config
  58. You should now deploy a pod network to the cluster.
  59. Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
  60. https://kubernetes.io/docs/concepts/cluster-administration/addons/
  61. You can now join any number of machines by running the following on each node
  62. as root:
  63. kubeadm join 172.17.20.210:6443 --token 6pkrlg.8glf2fqpuf3i489m --discovery-token-ca-cert-hash sha256:eebfe256113bee397b218ba832f412273ae734bd4686241fb910885d26efd222

这次非常顺利的就部署成功了,如果我们想使用非root用户操作kubectl,可以使用以下命令,这也是kubeadm init输出的一部分:

  1. mkdir -p $HOME/.kube
  2. sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
  3. sudo chown $(id -u):$(id -g) $HOME/.kube/config

安装网络插件

为了让Pods间可以相互通信,我们必须安装一个网络插件,并且必须在部署任何应用之前安装,CoreDNS也是在网络插件安装之后才会启动的。

网络的插件完整列表,请参考 Networking and Network Policy

在安装之前,我们先查看一下当前Pods的状态:

  1. kubectl get pods --all-namespaces
  2. # 输出
  3. NAMESPACE NAME READY STATUS RESTARTS AGE
  4. kube-system coredns-78d4cf999f-6pgfr 0/1 Pending 0 87s
  5. kube-system coredns-78d4cf999f-m9kgs 0/1 Pending 0 87s
  6. kube-system etcd-master 1/1 Running 0 47s
  7. kube-system kube-apiserver-master 1/1 Running 0 38s
  8. kube-system kube-controller-manager-master 1/1 Running 0 55s
  9. kube-system kube-proxy-mkg24 1/1 Running 0 87s
  10. kube-system kube-scheduler-master 1/1 Running 0 41s

如上,可以看到CoreDND的状态是Pending,这是因为我们还没有安装网络插件。

Calico是一个纯三层的虚拟网络方案,Calico 为每个容器分配一个 IP,每个 host 都是 router,把不同 host 的容器连接起来。与 VxLAN 不同的是,Calico 不对数据包做额外封装,不需要 NAT 和端口映射,扩展性和性能都很好。

默认情况下,Calico网络插件使用的的网段是192.168.0.0/16,在init的时候,我们已经通过--pod-network-cidr=192.168.0.0/16来适配Calico,当然你也可以修改calico.yml文件来指定不同的网段。

可以使用如下命令命令来安装Canal插件:

  1. kubectl apply -f https://docs.projectcalico.org/v3.3/getting-started/kubernetes/installation/hosted/rbac-kdd.yaml
  2. kubectl apply -f https://docs.projectcalico.org/v3.3/getting-started/kubernetes/installation/hosted/kubernetes-datastore/calico-networking/1.7/calico.yaml
  3. # 上面的calico.yaml会去quay.io拉取镜像,如果无法拉取,可使用下面的国内镜像
  4. kubectl apply -f http://mirror.faasx.com/k8s/calico/v3.3.2/rbac-kdd.yaml
  5. kubectl apply -f http://mirror.faasx.com/k8s/calico/v3.3.2/calico.yaml

关于更多Canal的信息可以查看Calico官方文档:kubeadm quickstart

稍等片刻,再使用kubectl get pods --all-namespaces命令来查看网络插件的安装情况:

  1. kubectl get pods --all-namespaces
  2. # 输出
  3. NAMESPACE NAME READY STATUS RESTARTS AGE
  4. kube-system calico-node-x96gn 2/2 Running 0 47s
  5. kube-system coredns-78d4cf999f-6pgfr 1/1 Running 0 54m
  6. kube-system coredns-78d4cf999f-m9kgs 1/1 Running 0 54m
  7. kube-system etcd-master 1/1 Running 3 53m
  8. kube-system kube-apiserver-master 1/1 Running 3 53m
  9. kube-system kube-controller-manager-master 1/1 Running 3 53m
  10. kube-system kube-proxy-mkg24 1/1 Running 2 54m
  11. kube-system kube-scheduler-master 1/1 Running 3 53m

如上,STATUS全部变为了Running,表示安装成功,接下来就可以加入其他节点以及部署应用了。

Master隔离

默认情况下,由于安全原因,集群并不会将pods部署在Master节点上。但是在开发环境下,我们可能就只有一个Master节点,这时可以使用下面的命令来解除这个限制:

  1. kubectl taint nodes --all node-role.kubernetes.io/master-
  2. ## 输出
  3. node/master untainted

加入工作节点

要为群集添加工作节点,需要为每台计算机执行以下操作:

  • SSH到机器
  • 成为root用户,(如: sudo su -)
  • 运行上面的kubeadm init命令输出的:kubeadm join --token <token> <master-ip>:<master-port> --discovery-token-ca-cert-hash sha256:<hash>

如果我们忘记了Master节点的加入token,可以使用如下命令来查看:

  1. kubeadm token list
  2. # 输出
  3. TOKEN TTL EXPIRES USAGES DESCRIPTION EXTRA GROUPS
  4. 6pkrlg.8glf2fqpuf3i489m 22h 2018-12-07T13:46:33Z authentication,signing The default bootstrap token generated by 'kubeadm init'. system:bootstrappers:kubeadm:default-node-token

默认情况下,token的有效期是24小时,如果我们的token已经过期的话,可以使用以下命令重新生成:

  1. kubeadm token create
  2. # 输出
  3. u2mt59.tyqpo0v5wf05lx2q

如果我们也没有--discovery-token-ca-cert-hash的值,可以使用以下命令生成:

  1. openssl x509 -pubkey -in /etc/kubernetes/pki/ca.crt | openssl rsa -pubin -outform der 2>/dev/null | openssl dgst -sha256 -hex | sed 's/^.* //'
  2. # 输出
  3. eebfe256113bee397b218ba832f412273ae734bd4686241fb910885d26efd222

现在,我们登录到工作节点服务器,然后运行如下命令加入集群(这也是上面init输出的一部分):

  1. sudo kubeadm join 172.17.20.210:6443 --token 6pkrlg.8glf2fqpuf3i489m --discovery-token-ca-cert-hash sha256:eebfe256113bee397b218ba832f412273ae734bd4686241fb910885d26efd222
  2. # 输出
  3. [sudo] password for raining:
  4. [preflight] Running pre-flight checks
  5. [discovery] Trying to connect to API Server "172.17.20.210:6443"
  6. [discovery] Created cluster-info discovery client, requesting info from "https://172.17.20.210:6443"
  7. [discovery] Requesting info from "https://172.17.20.210:6443" again to validate TLS against the pinned public key
  8. [discovery] Cluster info signature and contents are valid and TLS certificate validates against pinned roots, will use API Server "172.17.20.210:6443"
  9. [discovery] Successfully established connection with API Server "172.17.20.210:6443"
  10. [join] Reading configuration from the cluster...
  11. [join] FYI: You can look at this config file with 'kubectl -n kube-system get cm kubeadm-config -oyaml'
  12. [kubelet] Downloading configuration for the kubelet from the "kubelet-config-1.13" ConfigMap in the kube-system namespace
  13. [kubelet-start] Writing kubelet configuration to file "/var/lib/kubelet/config.yaml"
  14. [kubelet-start] Writing kubelet environment file with flags to file "/var/lib/kubelet/kubeadm-flags.env"
  15. [kubelet-start] Activating the kubelet service
  16. [tlsbootstrap] Waiting for the kubelet to perform the TLS Bootstrap...
  17. [patchnode] Uploading the CRI Socket information "/var/run/dockershim.sock" to the Node API object "node1" as an annotation
  18. This node has joined the cluster:
  19. * Certificate signing request was sent to apiserver and a response was received.
  20. * The Kubelet was informed of the new secure connection details.
  21. Run 'kubectl get nodes' on the master to see this node join the cluster.

等待一会,我们可以在Master节点上使用kubectl get nodes命令来查看节点的状态:

  1. kubectl get nodes
  2. # 输出
  3. NAME STATUS ROLES AGE VERSION
  4. master Ready master 17m v1.13.0
  5. node1 Ready <none> 15m v1.13.0

如上全部Ready,大功告成,我们可以运行一些命令来测试一下集群是否正常。

测试

首先验证kube-apiserver, kube-controller-manager, kube-scheduler, pod network 是否正常:

  1. # 部署一个 Nginx Deployment,包含两个Pod
  2. # https://kubernetes.io/docs/concepts/workloads/controllers/deployment/
  3. kubectl create deployment nginx --image=nginx:alpine
  4. kubectl scale deployment nginx --replicas=2
  5. # 验证Nginx Pod是否正确运行,并且会分配192.168.开头的集群IP
  6. kubectl get pods -l app=nginx -o wide
  7. # 输出如下:
  8. NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES
  9. nginx-54458cd494-p8jzs 1/1 Running 0 31s 192.168.1.2 node1 <none> <none>
  10. nginx-54458cd494-v2m4b 1/1 Running 0 24s 192.168.1.3 node1 <none> <none>

再验证一下kube-proxy是否正常:

  1. # 以 NodePort 方式对外提供服务 https://kubernetes.io/docs/concepts/services-networking/connect-applications-service/
  2. kubectl expose deployment nginx --port=80 --type=NodePort
  3. # 查看集群外可访问的Port
  4. kubectl get services nginx
  5. # 输出
  6. NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE
  7. nginx NodePort 10.110.49.49 <none> 80:31899/TCP 4s
  8. # 可以通过任意 NodeIP:Port 在集群外部访问这个服务,本示例中部署的2台集群IP分别是172.17.20.210和172.17.20.211
  9. curl http://172.17.20.210:31899
  10. curl http://172.17.20.211:31899

最后验证一下dns, pod network是否正常:

  1. # 运行Busybox并进入交互模式
  2. kubectl run -it curl --image=radial/busyboxplus:curl
  3. # 输入`nslookup nginx`查看是否可以正确解析出集群内的IP,已验证DNS是否正常
  4. [ root@curl-66959f6557-6sfqh:/ ]$ nslookup nginx
  5. # 输出
  6. Server: 10.96.0.10
  7. Address 1: 10.96.0.10 kube-dns.kube-system.svc.cluster.local
  8. Name: nginx
  9. Address 1: 10.110.49.49 nginx.default.svc.cluster.local
  10. # 通过服务名进行访问,验证kube-proxy是否正常
  11. [ root@curl-66959f6557-6sfqh:/ ]$ curl http://nginx/
  12. # 输出如下:
  13. # <!DOCTYPE html> ---省略
  14. # 分别访问一下2个Pod的内网IP,验证跨Node的网络通信是否正常
  15. [ root@curl-66959f6557-6sfqh:/ ]$ curl http://192.168.1.2/
  16. [ root@curl-66959f6557-6sfqh:/ ]$ curl http://192.168.1.3/

验证通过,集群搭建成功,接下来我们就可以参考官方文档来部署其他服务,愉快的玩耍了。

卸载集群

想要撤销kubeadm执行的操作,首先要排除节点,并确保该节点为空, 然后再将其关闭。

在Master节点上运行:

  1. kubectl drain <node name> --delete-local-data --force --ignore-daemonsets
  2. kubectl delete node <node name>

然后在需要移除的节点上,重置kubeadm的安装状态:

  1. sudo kubeadm reset

如果你想重新配置集群,使用新的参数重新运行kubeadm init或者kubeadm join即可。

参考资料

 友情链接:直通硅谷  点职佳  北美留学生论坛

本站QQ群:前端 618073944 | Java 606181507 | Python 626812652 | C/C++ 612253063 | 微信 634508462 | 苹果 692586424 | C#/.net 182808419 | PHP 305140648 | 运维 608723728

W3xue 的所有内容仅供测试,对任何法律问题及风险不承担任何责任。通过使用本站内容随之而来的风险与本站无关。
关于我们  |  意见建议  |  捐助我们  |  报错有奖  |  广告合作、友情链接(目前9元/月)请联系QQ:27243702 沸活量
皖ICP备17017327号-2 皖公网安备34020702000426号