Why Google Reader survived the 2nd June ban of GFW and what can you learn from it

GFW is the national firewall implemented in backbone Internet of China. It has four types of block:

  • URL block: ie. if twitter.com is blocked, any HTTP request that has a twitter.com string appears between GET and rnrn is blocked. GFW will try RST to both you and the server many times.
  • DNS pollution. All UDP port 53 are censored in China. Your domain will resolve to fake IPs if polluted. Currently known fake IPs include:

    202.106.1.2, 211.95.129.161, 211.94.66.147, 220.250.64.23, 216.234.179.13, 4.36.66.178

  • IP ban.
  • content filter. All unencrypted TCP/UDP data are monitored. You might trigger a shit load of keywords during your daily surfing without knowing anything. Like URL block, GFW will try RST to both you and the server many times if your keywords matches a certain level and quantity. And your IP & client info will be logged for further investigation if needed.

During the great ban of 2009-06-02, *.live.com, bing.com, twitter.com, flickr.com, hotmail.com, along with previous banned youtube,com and *.blogspot.com, are no longer accessible within China directly. But Google Reader, one of the main anti-☭ propaganda source, survived. Why?

Let's look at one of the typical Google Reader HTTP request:

https://www.google.com/reader/api/0/stream/contents/user%2F1338082....
    |   |            |
    |   +-----+------+
    |         |
    |         +--- DNS pollution and IP ban are unlikely,
    |              unless GFW totally bans all www.google.com
    |
    +--- https, means data transfer between your IP
         and www.google.com IP (64.233.189.99) are encrypted,
         thus URL block and content filter are useless,
         except GFW implements some sort of MITM attack which is too costly

So what can we learn from it?

  • The bigger your website are, the sooner your website get unblocked. If your little known sites get blocked, who cares?
  • Don't use HTTP GET since it's very easily URL-blocked. This also helps reduce XSS & XSRF
  • Make your URL jumping httpS compatible. Even static files.
  • Use RFC 2068 compatible CDNs like squid. So you can route all your other nodes' traffic to a node where your clients could access(ie. youtube could be accessed within China if your proxy all your www.youtube.com/* requests to www.google.cn IP). But you have to make sure your CDN can deliver content accross GFW.
  • Use as few as subdomains as possible. This reduce DNS pollution damage if you just have dozens to recover instead of thousands. If Google Reader's URL is something like reader.google.com, it's very possible to get banned years ago.

标签: ,

5 条评论 发表在“Why Google Reader survived the 2nd June ban of GFW and what can you learn from it”上

  1. happydog 说到:

    我是来学英语的 :)

    • electronixtar 说到:

      其实你是来抢沙发的。

  2. chys 说到:

    我觉得,还有一个原因,被封的都是开放的系统,一个人写了,全internet都能看。greader相对是密闭的,看的是主动订阅的东西,分享也基本上只是点对点的信息传播

  3. keakon 说到:

    我用http也没被GFW

    话说Google.com都经常被block,不知道有什么好的解决办法

    之前还能用https或者改到jp、tw,但越来越乏力了

  4. 悉尼 说到:

    哈哈,那么多机密信息~~~
    这里会不会被东篱把酒黄昏后墙啊

留下回复