Nelze vybrat více než 25 témat Téma musí začínat písmenem nebo číslem, může obsahovat pomlčky („-“) a může být dlouhé až 35 znaků.

268 řádky
15 KiB

  1. $Id: README,v 1.12 2007/10/19 12:43:48 chandrusf Exp $
  2. ibrowse is a HTTP client. The following are a list of features.
  3. - RFC2616 compliant (AFAIK)
  4. - supports GET, POST, OPTIONS, HEAD, PUT, DELETE, TRACE,
  5. MKCOL, PROPFIND, PROPPATCH, LOCK, UNLOCK, MOVE and COPY
  6. - Understands HTTP/0.9, HTTP/1.0 and HTTP/1.1
  7. - Understands chunked encoding
  8. - Can generate requests using Chunked Transfer-Encoding
  9. - Named pools of connections to each webserver
  10. - Pipelining support
  11. - Download to file
  12. - Asynchronous requests. Responses are streamed to a process
  13. - Basic authentication
  14. - Supports proxy authentication
  15. - Can talk to Secure webservers using SSL
  16. - any other features in the code not listed here :)
  17. ibrowse is available under two different licenses. LGPL and the BSD license.
  18. Comments to : Chandrashekhar.Mullaparthi@t-mobile.co.uk
  19. CONTRIBUTIONS & CHANGE HISTORY
  20. ==============================
  21. 17-10-2007 - Matthew Reilly (matthew dot reilly _at_ sipphone dot com)
  22. sent a bug report and a fix. If the chunk trailer spans two TCP
  23. packets, then ibrowse fails to recognise that the chunked transfer
  24. has ended.
  25. 29-08-2007 - Bug report by Peter Kristensen(ptx _at_ daimi dot au dot dk).
  26. ibrowse crashes when the webserver returns just the Status line
  27. and nothing else.
  28. 28-06-2007 - Added host_header option to enable connection to secure sites
  29. via stunnel
  30. 20-04-2007 - Geoff Cant sent a patch to remove URL encoding for digits in
  31. ibrowse_lib:url_encode/1.
  32. ibrowse had a dependency on the inets application because the
  33. ibrowse_http_client.erl invoked httpd_util:encode_base64/1. This
  34. dependency is now removed and the encode_base64/1 has been
  35. implemented in ibrowse_lib.erl
  36. 06-03-2007 - Eric Merritt sent a patch to support WebDAV requests.
  37. 12-01-2007 - Derek Upham sent in a bug fix. The reset_state function was not
  38. behaving correctly when the transfer encoding was not chunked.
  39. 13-11-2006 - Youns Hafri reported a bug where ibrowse was not returning the
  40. temporary filename when the server was closing the connection
  41. after sending the data (as in HTTP/1.0).
  42. Released ibrowse under the BSD license
  43. 12-10-2006 - Chris Newcombe reported bug in dealing with requests where no
  44. body is expected in the response. The first request would succeed
  45. and the next request would hang.
  46. 24-May-2006 - Sean Hinde reported a bug. Async responses with pipelining was
  47. returning the wrong result.
  48. 08-Dec-2005 - Richard Cameron (camster@citeulike.org). Patch to ibrowse to
  49. prevent port number being included in the Host header when port
  50. 80 is intended.
  51. 22-Nov-2005 - Added ability to generate requests using the Chunked
  52. Transfer-Encoding.
  53. 08-May-2005 - Youns Hafri made a CRUX LINUX port of ibrowse.
  54. http://yhafri.club.fr/crux/index.html
  55. Here are some usage examples. Enjoy!
  56. 5> ibrowse:start().
  57. {ok,<0.94.0>}
  58. %% A simple GET
  59. 6> ibrowse:send_req("http://intranet/messenger/", [], get).
  60. {ok,"200",
  61. [{"Server","Microsoft-IIS/5.0"},
  62. {"Content-Location","http://intranet/messenger/index.html"},
  63. {"Date","Fri, 17 Dec 2004 15:16:19 GMT"},
  64. {"Content-Type","text/html"},
  65. {"Accept-Ranges","bytes"},
  66. {"Last-Modified","Fri, 17 Dec 2004 08:38:21 GMT"},
  67. {"Etag","\"aa7c9dc313e4c41:d77\""},
  68. {"Content-Length","953"}],
  69. "<html>\r\n\r\n<head>\r\n<title>Messenger</title>\r\n<meta name=\"GENERATOR\" content=\"Microsoft FrontPage 5.0\">\r\n<meta name=\"ProgId\" content=\"FrontPage.Editor.Document\">\r\n<meta name=\"description\" content=\"Messenger Home Page\">\r\n</head>\r\n\r\n<frameset border=\"0\" frameborder=\"0\" rows=\"60,*\">\r\n <frame src=\"/messenger/images/topnav.html\" name=\"mFrameTopNav\" scrolling=\"NO\" target=\"mFrameMain\">\r\n <frameset cols=\"18%,*\">\r\n <frameset rows=\"*,120\">\r\n <frame src=\"index-toc.html\" name=\"mFrameTOC\" target=\"mFrameMain\" scrolling=\"auto\" noresize=\"true\">\r\n <frame src=\"/shared/search/namesearch.html\" name=\"mFrameNameSearch\" scrolling=\"NO\" target=\"mFrameMain\">\r\n </frameset>\r\n <frame src=\"home/16-12-04-xmascardsmms.htm\" name=\"mFrameMain\" scrolling=\"auto\" target=\"mFrameMain\" id=\"mFrameMain\">\r\n </frameset>\r\n <noframes>\r\n <body>\r\n\r\n <p><i>This site requires a browser that can view frames.</i></p>\r\n\r\n </body>\r\n </noframes>\r\n</frameset>\r\n\r\n</html>"}
  70. %% =============================================================================
  71. %% A GET using a proxy
  72. 7> ibrowse:send_req("http://www.google.com/", [], get, [],
  73. [{proxy_user, "XXXXX"},
  74. {proxy_password, "XXXXX"},
  75. {proxy_host, "proxy"},
  76. {proxy_port, 8080}], 1000).
  77. {ok,"302",
  78. [{"Date","Fri, 17 Dec 2004 15:22:56 GMT"},
  79. {"Content-Length","217"},
  80. {"Content-Type","text/html"},
  81. {"Set-Cookie",
  82. "PREF=ID=f58155c797f96096:CR=1:TM=1103296999:LM=1103296999:S=FiWdtAqQvhQ0TvHq; expires=Sun, 17-Jan-2038 19:14:07 GMT; path=/; domain=.google.com"},
  83. {"Server","GWS/2.1"},
  84. {"Location",
  85. "http://www.google.co.uk/cxfer?c=PREF%3D:TM%3D1103296999:S%3Do8bEY2FIHwdyGenS&prev=/"},
  86. {"Via","1.1 netapp01 (NetCache NetApp/5.5R2)"}],
  87. "<HTML><HEAD><TITLE>302 Moved</TITLE></HEAD><BODY>\n<H1>302 Moved</H1>\nThe document has moved\n<A HREF=\"http://www.google.co.uk/cxfer?c=PREF%3D:TM%3D1103296999:S%3Do8bEY2FIHwdyGenS&amp;prev=/\">here</A>.\r\n</BODY></HTML>\r\n"}
  88. %% =============================================================================
  89. %% A GET response saved to file. A temporary file is created and the
  90. %% filename returned. The response will only be saved to file is the
  91. %% status code is in the 200 range. The directory to download to can
  92. %% be set using the application env var 'download_dir' - the default
  93. %% is the current working directory.
  94. 8> ibrowse:send_req("http://www.erlang.se/", [], get, [],
  95. [{proxy_user, "XXXXX"},
  96. {proxy_password, "XXXXX"},
  97. {proxy_host, "proxy"},
  98. {proxy_port, 8080},
  99. {save_response_to_file, true}], 1000).
  100. {error,req_timedout}
  101. %% =============================================================================
  102. 9> ibrowse:send_req("http://www.erlang.se/", [], get, [],
  103. [{proxy_user, "XXXXX"},
  104. {proxy_password, "XXXXX"},
  105. {proxy_host, "proxy"},
  106. {proxy_port, 8080},
  107. {save_response_to_file, true}], 5000).
  108. {ok,"200",
  109. [{"Transfer-Encoding","chunked"},
  110. {"Date","Fri, 17 Dec 2004 15:24:36 GMT"},
  111. {"Content-Type","text/html"},
  112. {"Server","Apache/1.3.9 (Unix)"},
  113. {"Via","1.1 netapp01 (NetCache NetApp/5.5R2)"}],
  114. {file,"/Users/chandru/code/ibrowse/src/ibrowse_tmp_file_1103297041125854"}}
  115. %% =============================================================================
  116. %% Setting size of connection pool and pipeline size. This sets the
  117. %% number of maximum connections to this server to 10 and the pipeline
  118. %% size to 1. Connections are setup a required.
  119. 11> ibrowse:set_dest("www.hotmail.com", 80, [{max_sessions, 10},
  120. {max_pipeline_size, 1}]).
  121. ok
  122. %% =============================================================================
  123. %% Example using the HEAD method
  124. 56> ibrowse:send_req("http://www.erlang.org", [], head).
  125. {ok,"200",
  126. [{"Date","Mon, 28 Feb 2005 04:40:53 GMT"},
  127. {"Server","Apache/1.3.9 (Unix)"},
  128. {"Last-Modified","Thu, 10 Feb 2005 09:31:23 GMT"},
  129. {"Etag","\"8d71d-1efa-420b29eb\""},
  130. {"Accept-ranges","bytes"},
  131. {"Content-Length","7930"},
  132. {"Content-Type","text/html"}],
  133. []}
  134. %% =============================================================================
  135. %% Example using the OPTIONS method
  136. 62> ibrowse:send_req("http://www.sun.com", [], options).
  137. {ok,"200",
  138. [{"Server","Sun Java System Web Server 6.1"},
  139. {"Date","Mon, 28 Feb 2005 04:44:39 GMT"},
  140. {"Content-Length","0"},
  141. {"P3p",
  142. "policyref=\"http://www.sun.com/p3p/Sun_P3P_Policy.xml\", CP=\"CAO DSP COR CUR ADMa DEVa TAIa PSAa PSDa CONi TELi OUR SAMi PUBi IND PHY ONL PUR COM NAV INT DEM CNT STA POL PRE GOV\""},
  143. {"Set-Cookie",
  144. "SUN_ID=X.X.X.X:169191109565879; EXPIRES=Wednesday, 31-Dec-2025 23:59:59 GMT; DOMAIN=.sun.com; PATH=/"},
  145. {"Allow",
  146. "HEAD, GET, PUT, POST, DELETE, TRACE, OPTIONS, MOVE, INDEX, MKDIR, RMDIR"}],
  147. []}
  148. %% =============================================================================
  149. %% Example of using Asynchronous requests
  150. 18> ibrowse:send_req("http://www.google.com", [], get, [],
  151. [{proxy_user, "XXXXX"},
  152. {proxy_password, "XXXXX"},
  153. {proxy_host, "proxy"},
  154. {proxy_port, 8080},
  155. {stream_to, self()}]).
  156. {ibrowse_req_id,{1115,327256,389608}}
  157. 19> flush().
  158. Shell got {ibrowse_async_headers,{1115,327256,389608},
  159. "302",
  160. [{"Date","Thu, 05 May 2005 21:06:41 GMT"},
  161. {"Content-Length","217"},
  162. {"Content-Type","text/html"},
  163. {"Set-Cookie",
  164. "PREF=ID=b601f16bfa32f071:CR=1:TM=1115327201:LM=1115327201:S=OX5hSB525AMjUUu7; expires=Sun, 17-Jan-2038 19:14:07 GMT; path=/; domain=.google.com"},
  165. {"Server","GWS/2.1"},
  166. {"Location",
  167. "http://www.google.co.uk/cxfer?c=PREF%3D:TM%3D1115327201:S%3DDS9pDJ4IHcAuZ_AS&prev=/"},
  168. {"Via",
  169. "1.1 hatproxy01 (NetCache NetApp/5.6.2)"}]}
  170. Shell got {ibrowse_async_response,{1115,327256,389608},
  171. "<HTML><HEAD><TITLE>302 Moved</TITLE></HEAD><BODY>\n<H1>302 Moved</H1>\nThe document has moved\n<A HREF=\"http://www.google.co.uk/cxfer?c=PREF%3D:TM%3D1115327201:S%3DDS9pDJ4IHcAuZ_AS&amp;prev=/\">here</A>.\r\n</BODY></HTML>\r\n"}
  172. Shell got {ibrowse_async_response_end,{1115,327256,389608}}
  173. ok
  174. %% =============================================================================
  175. %% Another example of using async requests
  176. 24> ibrowse:send_req("http://yaws.hyber.org/simple_ex2.yaws", [], get, [],
  177. [{proxy_user, "XXXXX"},
  178. {proxy_password, "XXXXX"},
  179. {proxy_host, "proxy"},
  180. {proxy_port, 8080},
  181. {stream_to, self()}]).
  182. {ibrowse_req_id,{1115,327430,512314}}
  183. 25> flush().
  184. Shell got {ibrowse_async_headers,{1115,327430,512314},
  185. "200",
  186. [{"Date","Thu, 05 May 2005 20:58:08 GMT"},
  187. {"Content-Length","64"},
  188. {"Content-Type","text/html;charset="},
  189. {"Server",
  190. "Yaws/1.54 Yet Another Web Server"},
  191. {"Via",
  192. "1.1 hatproxy01 (NetCache NetApp/5.6.2)"}]}
  193. Shell got {ibrowse_async_response,{1115,327430,512314},
  194. "<html>\n\n\n<h1> Yesssssss </h1>\n\n<h2> Hello again </h2>\n\n\n</html>\n"}
  195. Shell got {ibrowse_async_response_end,{1115,327430,512314}}
  196. %% =============================================================================
  197. %% Example of request which fails when using the async option. Here
  198. %% the {ibrowse_req_id, ReqId} is not returned. Instead the error code is
  199. %% returned.
  200. 68> ibrowse:send_req("http://www.earlyriser.org", [], get, [], [{stream_to, self()}]).
  201. {error,conn_failed}
  202. %% Example of request using both Proxy-Authorization and authorization by the final webserver.
  203. 17> ibrowse:send_req("http://www.erlang.se/lic_area/protected/patches/erl_756_otp_beam.README",
  204. [], get, [],
  205. [{proxy_user, "XXXXX"},
  206. {proxy_password, "XXXXX"},
  207. {proxy_host, "proxy"},
  208. {proxy_port, 8080},
  209. {basic_auth, {"XXXXX", "XXXXXX"}}]).
  210. {ok,"200",
  211. [{"Accept-Ranges","bytes"},
  212. {"Date","Thu, 05 May 2005 21:02:09 GMT"},
  213. {"Content-Length","2088"},
  214. {"Content-Type","text/plain"},
  215. {"Server","Apache/1.3.9 (Unix)"},
  216. {"Last-Modified","Tue, 03 May 2005 15:08:18 GMT"},
  217. {"ETag","\"1384c8-828-427793e2\""},
  218. {"Via","1.1 hatproxy01 (NetCache NetApp/5.6.2)"}],
  219. "Patch Id:\t\terl_756_otp_beam\nLabel:\t\t\tinets patch\nDate:\t\t\t2005-05-03\nTrouble Report Id:\tOTP-5513, OTP-5514, OTP-5516, OTP-5517, OTP-5521, OTP-5537\nSeq num:\t\tseq9806\nSystem:\t\t\totp\nRelease:\t\tR10B\nOperating System:\tall\nArchitecture:\t\tall\nErlang machine:\t\tBEAM\nApplication:\t\tinets-4.4\nFiles:\t\t\tall\n\nDescription:\n\n OTP-5513 The server did not handle HTTP-0.9 messages with an implicit\n\t version.\n\n OTP-5514 An internal server timeout killed the request handling\n\t process without sending a message back to the client. As this\n\t timeout only affects a single request it has been set to\n\t infinity (if the main server process dies the request\n\t handling process will also die and the client will receive an\n\t error). This might make a client that does not use a timeout\n\t hang for a longer period of time, but that is an expected\n\t behavior!\n\n OTP-5516 That a third party closes the http servers accept socket is\n\t recoverable for inets, hence intes will only produce an info\n\t report as there was no error in inets but measures where\n\t taken to avoid failure due to errors elsewhere.\n\n OTP-5517 The HTTP client proxy settings where ignored. Bug introduced\n\t in inets-4.3.\n\n OTP-5521 Inets only sent the \"WWW-Authenticate\" header at the first\n\t attempt to get a page, if the user supplied the wrong\n\t user/password combination the header was not sent again. This\n\t forces the user to kill the browser entirely after a failed\n\t login attempt, before the user may try to login again. Inets\n\t now always send the authentication header.\n\n OTP-5537 A major rewrite of big parts of the HTTP server code was\n\t performed. There where many things that did not work\n\t satisfactory. Cgi script handling can never have worked\n\t properly and the cases when it did sort of work, a big\n\t unnecessary delay was enforced. Headers where not always\n\t treated as expected and HTTP version handling did not work,\n\t all responses where sent as version HTTP/1.1 no matter what.\n\n\n"}
  220. %% =============================================================================
  221. %% Example of a TRACE request. Very interesting! yaws.hyber.org didn't
  222. %% support this. Nor did www.google.com. But good old BBC supports
  223. %% this.
  224. 35> 37> ibrowse:send_req("http://www.bbc.co.uk/", [], trace, [],
  225. [{proxy_user, "XXXXX"},
  226. {proxy_password, "XXXXX"},
  227. {proxy_host, "proxy"},
  228. {proxy_port, 8080}]).
  229. {ok,"200",
  230. [{"Transfer-Encoding","chunked"},
  231. {"Date","Thu, 05 May 2005 21:40:27 GMT"},
  232. {"Content-Type","message/http"},
  233. {"Server","Apache/2.0.51 (Unix)"},
  234. {"Set-Cookie",
  235. "BBC-UID=7452e72a29424c5b0b232c7131c7d9395d209b7170e8604072e0fcb3630467300; expires=Mon, 04-May-09 21:40:27 GMT; path=/; domain=bbc.co.uk;"},
  236. {"Set-Cookie",
  237. "BBC-UID=7452e72a29424c5b0b232c7131c7d9395d209b7170e8604072e0fcb3630467300; expires=Mon, 04-May-09 21:40:27 GMT; path=/; domain=bbc.co.uk;"},
  238. {"Via","1.1 hatproxy01 (NetCache NetApp/5.6.2)"}],
  239. "TRACE / HTTP/1.1\r\nHost: www.bbc.co.uk\r\nConnection: keep-alive\r\nX-Forwarded-For: 172.24.28.29\r\nVia: 1.1 hatproxy01 (NetCache NetApp/5.6.2)\r\nCookie: BBC-UID=7452e72a29424c5b0b232c7131c7d9395d209b7170e8604072e0fcb3630467300\r\n\r\n"}