Nginx 代理 varnish 的姿势

TerHandfiel 8年前
   <p>以前做网站的时候遇到了网站的访问量很大,而导致后端处理程序响应超时而导致的一些问题。当时采用的架构是nginx+php-fastcgi,同事想到了用nginx-proxycache来做页面缓存,效果也还行。下面我想介绍一下varnish的使用技巧</p>    <h2><strong>准备</strong></h2>    <p>varnish严格来说是可以当作一个代理服务器的软件,直接将 HTTP 请求转发到 php-cgi ,然后交给php处理,varnish会获取经过php处理后的数据,最后返回给浏览器。如图</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/444380fc92f288794f187323c69dd10d.jpg"></p>    <p>但是,现在 php-fastcgi 已经被逐渐淘汰了,也就是说我们一般情况下不会使用 php-fastcgi ,那么我们不能直接将varnish与php组合,因为php-fpm的交互方式为socket,而不再是监听本机的9000端口</p>    <p>所以我们必须找一个的媒介,连接 varnish 和 php-fpm , nginx 可以扮演这个媒介,如下图:</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/60071c1c861c29e6c1274502f785cf71.jpg"></p>    <p>那么问题来了,根据研究发现, varnish 处理 http 请求不如 nginx 那么高效。所以如果我们让 nginx 做前锋,这样就更完美了。那我们需要怎么才能达到这个目的呢,下面我们来整理一下流程</p>    <p style="text-align:center"><img src="https://simg.open-open.com/show/8d13afae02a727554b28811fd0b720bf.jpg"></p>    <p>下面就来实现一下图三的架构吧。</p>    <p>事先需要准备 nginx , varnish , php-fpm , php 这些软件,OS是 ubuntu ,所有软件都可以用 apt-get install 来安装,不了解包名全称的话可以先 apt-get update ,更新一下源,然后再用 apt-cache search xxx 来查找软件包名</p>    <p>安装完 varnish 后,可以使用 service varnish 回车,查看可操作选项 * Usage: /etc/init.d/varnish {start|stop|restart|reload|force-reload|configtest} ,一般安装完毕后,系统会自动启动 varnish 的, nginx 也是一样,便不赘述了</p>    <h2><strong>配置</strong></h2>    <p>安装完所需的软件后,下面需要配置这些软件,来实现这个架构</p>    <h3><strong>nginx部分</strong></h3>    <p>vi /etc/nginx/nginx.conf</p>    <pre>  <code class="language-groovy">http {      ## proxy global setting      proxy_connect_timeout 5;      proxy_read_timeout 60;      proxy_send_timeout 5;      proxy_buffer_size 16k;      proxy_buffers 4 64k;      proxy_busy_buffers_size 128k;      ##END      ## cache proxy pass      upstream cache {              server  127.0.0.1:6081;      }      ##END      ## php proxy pass      upstream php {               server  127.0.0.1:8080;      }      ##END      # Basic Settings      sendfile on;      tcp_nopush on;      tcp_nodelay on;      keepalive_timeout 65;      types_hash_max_size 2048;      server_tokens off;      #depend on nginx-extras 需要安装nginx-extras才能定义Server      more_set_headers 'Server: Bird-shark';      # server_names_hash_bucket_size 64;      # server_name_in_redirect off;      include /etc/nginx/mime.types;      default_type application/octet-stream;      ##      # SSL Settings      ##      ssl_protocols TLSv1 TLSv1.1 TLSv1.2; # Dropping SSLv3, ref: POODLE      ssl_prefer_server_ciphers on;      ##      # Logging Settings      ##      access_log /var/log/nginx/access.log;      error_log /var/log/nginx/error.log;      ##      # Gzip Settings      ##      gzip on;      gzip_disable "msie6";      gzip_vary on;      gzip_proxied any;      gzip_comp_level 6;      gzip_buffers 16 8k;      gzip_http_version 1.1;      gzip_types text/plain text/css application/json application/javascript text/xml application/xml application/xml+rss text/javascript;      ##      # Virtual Host Configs      ##      include /etc/nginx/conf.d/*.conf;      include /etc/nginx/sites-enabled/*;    }</code></pre>    <p>vi /etc/nginx/sites-available/default</p>    <pre>  <code class="language-groovy">server {      listen 80 default_server;      listen [::]:80 default_server;      index index.html index.htm index.php;      server_name localhost;      location ~ .*\.(gif|jpg|png|css|js|flv|ico|swf|html)$ {          proxy_set_header Host $host;          proxy_set_header X-Real-IP $remote_addr;          proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;          proxy_pass http://cache;      }      # pass the PHP scripts to FastCGI server listening on 127.0.0.1:9000      #      location / {          proxy_pass http://php;          proxy_set_header Host $host;          proxy_set_header X-Real-IP $remote_addr;          proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;          proxy_pass_header Server;      }  }  server {      listen 8080;      root /var/www/html;      index  index.html index.htm index.php;      location / {          if (!-e $request_filename){              rewrite  ^(.*)$  /index.php?s=$1  last;              break;          }          try_files $uri $uri/ =404;      }      location ~ ^(.+\.php)(.*)$ {          fastcgi_pass unix:/var/run/php5-fpm.sock;          fastcgi_intercept_errors on;          fastcgi_buffers 8 128k;          fastcgi_index  index.php;          fastcgi_param  SCRIPT_FILENAME $document_root$fastcgi_script_name;          include        fastcgi_params;      }  }</code></pre>    <h3><strong>varnish部分</strong></h3>    <p>vi /etc/varnish/default.vcl</p>    <pre>  <code class="language-groovy">vcl 4.0;    # Default backend definition. Set this to point to your content server.  backend default {      .host = "127.0.0.1";      .port = "8080";  }  acl purgers {      "localhost";      #"103.22.188.169";  }  sub vcl_recv {      # Happens before we check if we have this in cache already.      #      # Typically you clean up the request here, removing cookies you don't need,      # rewriting the request, etc.      if (req.restarts == 0) {          unset req.http.X-Purger;      }      if(req.method == "PURGE"){          if(!client.ip ~ purgers){              return(synth(405,"Not Allowed."));          }          return (purge);          #ban("obj.http.x-url ~ " + req.url);      }      if(req.method == "GET" && req.url ~ "\.(js|css|jpg|png|gif|swf|jpeg|ico)$"){          unset req.http.cookie;      }  }    sub vcl_backend_response {      #set beresp.http.x-url = bereq.url;      if (bereq.url ~ "\.(js|css|jpg|png|gif|swf|jpeg|ico)$") {          unset beresp.http.Cache-Control;          unset beresp.http.set-cookie;          set beresp.ttl = 10h;          set beresp.http.Cache-Control = "max-age=36000";          set beresp.do_gzip = true;      }      if(bereq.url ~ "\.html$"){          set beresp.ttl = 10m;          set beresp.do_gzip = true;          unset beresp.http.Cache-Control;          unset beresp.http.Pragma;          set beresp.http.Cache-Control = "max-age=600";          unset beresp.http.Expires;      }  }    sub vcl_deliver {      if (obj.hits > 0) {          set resp.http.X-Cache = "HIT from " + req.http.host;          #set resp.http.X-Cache-Hits = obj.hits;      } else {          set resp.http.X-Cache = "MISS from " + req.http.host;      }      if (req.http.X-Purger) {          set resp.http.X-Purger = req.http.X-Purger;      }      unset resp.http.X-Powered-By;      unset resp.http.Server;             unset resp.http.Via;      unset resp.http.X-Varnish;             unset resp.http.Age;      #unset resp.http.x-url; # Optional  }  sub vcl_hit {      if (req.method == "PURGE") {          return (synth(200,"Purged."));      }  }  sub vcl_miss {      if (req.method == "PURGE") {          return (synth(404,"Purged."));      }  }  sub vcl_purge {      if (req.method == "PURGE") {          #set req.http.X-Purge = "Purged";          ban("req.url ~ "+req.url);          #return (restart);          set req.method = "GET";          set req.http.X-Purger = "Purged";          return (restart);      }  }  sub vcl_hash {      hash_data(req.url);      if (req.http.host) {          hash_data(req.http.host);      } else {          hash_data(server.ip);      }      if (req.http.cookie) {          hash_data(req.http.cookie);      }      if (req.http.Accept-Encoding ~ "gzip") {          hash_data("gzip");      } elseif (req.http.Accept-Encoding ~ "deflate") {          hash_data("deflate");      }  }</code></pre>    <h2><strong>测试&分析</strong></h2>    <h3><strong>1. 在不使用缓存模块的情况下</strong></h3>    <p>vi /etc/nginx/sites-available/default</p>    <pre>  <code class="language-groovy">#location ~ .*\.(gif|jpg|png|css|js|flv|ico|swf|html)$ {  #    proxy_set_header Host $host;  #    proxy_set_header X-Real-IP $remote_addr;  #    proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;  #    proxy_pass http://cache;  #}</code></pre>    <p>先用chrome浏览器访问查看请求头</p>    <p><img src="https://simg.open-open.com/show/4d8782a013b6b091325970266f828cb4.png"></p>    <p>我们再使用curl,在服务器上执行以下命令</p>    <pre>  <code class="language-groovy">curl -k -v 'http://192.168.99.1/Public/Home/images/t_navigation_logo.png' -H 'Pragma: no-cache' -H 'Accept-Encoding: gzip, deflate, sdch' -H 'Accept-Language: zh,en;q=0.8,zh-CN;q=0.6' -H 'Upgrade-Insecure-Requests: 1' -H 'User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36' -H 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8' -H 'Connection: keep-alive' --compressed</code></pre>    <p>发现有输出内容。</p>    <p>然后,反选disable cache</p>    <p><img src="https://simg.open-open.com/show/b72c9b38f0b016c9dd1d14faf1f73745.png"></p>    <p>然后在服务器上执行以下命令</p>    <pre>  <code class="language-groovy">curl  -k -v 'http://192.168.99.1/Public/Home/images/t_navigation_logo.png' -H 'If-None-Match: "57c6b733-1962"' -H 'Accept-Encoding: gzip, deflate, sdch' -H 'Accept-Language: zh,en;q=0.8,zh-CN;q=0.6' -H 'Upgrade-Insecure-Requests: 1' -H 'User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36' -H 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8' -H 'Connection: keep-alive' -H 'If-Modified-Since: Wed, 31 Aug 2016 10:53:39 GMT' --compressed</code></pre>    <p>发现只返回了头部信息,然而没有内容返回</p>    <p>然后我们比较两个命令 发现区别就在 -H 'Pragma: no-cache' 和 -H 'If-Modified-Since: Wed, 31 Aug 2016 10:53:39 GMT' -H 'If-None-Match: "57c6b733-1962"'</p>    <p>57c6b733-1962 这串字符对应的是服务器响应给浏览器的 ETag 部分的内容,然后我们修改一下部分的内容</p>    <pre>  <code class="language-groovy">curl  -k -v 'http://192.168.99.1/Public/Home/images/t_navigation_logo.png' -H 'If-None-Match: "57c6b733-1234"' -H 'Accept-Encoding: gzip, deflate, sdch' -H 'Accept-Language: zh,en;q=0.8,zh-CN;q=0.6' -H 'Upgrade-Insecure-Requests: 1' -H 'User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/52.0.2743.116 Safari/537.36' -H 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8' -H 'Connection: keep-alive' -H 'If-Modified-Since: Wed, 31 Aug 2016 10:53:39 GMT' --compressed</code></pre>    <p>在服务器端执行一下。发现有内容返回,所以这个 ETag 相当于 token ,它不是由 nginx 随便生成的,且跟请求链接应是一一对应的,用来标识缓存的,当服务器返回的状态为 304 的时候,这时候我们浏览器会直接找到本地的缓存数据</p>    <h3><strong>2. 在使用缓存模块的情况下</strong></h3>    <p>vi /etc/nginx/sites-available/default</p>    <pre>  <code class="language-groovy">location ~ .*\.(gif|jpg|png|css|js|flv|ico|swf|html)$ {      proxy_set_header Host $host;      proxy_set_header X-Real-IP $remote_addr;      proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;      proxy_pass http://cache;  }</code></pre>    <p>用浏览器查看响应头</p>    <p>发现 X-Cache:MISS from 192.168.99.1 .这表示缓存未命中,然后我们刷新 X-Cache:HIT from 192.168.99.1 ,这时候发现已经命中了。</p>    <p>对于已经命中的资源文件,我们如果将其删除会出现什么效果呢,答案是,其依然可以访问,除非重启或者将缓存清除</p>    <p>但是对PURGE显然是不对外公开的,以下是服务器端用curl清除varnish缓存的命令</p>    <pre>  <code class="language-groovy">curl -v -k -X PURGE http://localhost/Public/Home/css/t_navigation.css</code></pre>    <h2><strong>结语</strong></h2>    <p>varnish 是一款内存类型的缓存软件,而非nginx扩展proxy_cache那种物理缓存类型的软件,存取速度比较快,但是也有弊端,重启后所有缓存得重写。不管怎么说,什么架子都适用的场景,要想满足业务需求还是得捣鼓透彻,而我也只是将我想到的给实现出来,毕竟资源和精力都是有限的,也就随便玩玩,诸位看客看看就好,别太认真,知道怎么回事儿就行。</p>    <p> </p>    <p>来自:https://segmentfault.com/a/1190000006853425</p>    <p> </p>