uPortal IRC Logs-2007-06-08

[06:25:48 EDT(-0400)] * jayshao (n=jayshao@ool-44c59034.dyn.optonline.net) has joined ##uportal
[08:39:09 EDT(-0400)] * jayshao (n=jayshao@ool-44c59034.dyn.optonline.net) has joined ##uportal
[09:36:11 EDT(-0400)] * jayshao (n=jayshao@jshao.rutgers.edu) has joined ##uportal
[09:48:42 EDT(-0400)] * jayshao (n=jayshao@jshao.rutgers.edu) has joined ##uportal
[10:43:13 EDT(-0400)] * EricDalquist (n=dalquist@bohemia.doit.wisc.edu) has joined ##uportal
[12:09:10 EDT(-0400)] * athena7 (n=athena@lumina.its.yale.edu) has joined ##uportal
[12:10:03 EDT(-0400)] <athena7> apparently google's image server doesn't like the portal's proxy servlet
[12:10:20 EDT(-0400)] <EricDalquist> hrm
[12:10:21 EDT(-0400)] <athena7> all the images linked from their RSS feeds are returning 403 errors
[12:10:39 EDT(-0400)] <EricDalquist> I wonder what the proxy servlet is sending as the user-agent
[12:11:16 EDT(-0400)] <athena7> yeah i'm not sure if it's the user agent or if they blocked us after too many requests
[12:11:29 EDT(-0400)] <athena7> i'm kind of guessing the first though
[12:11:53 EDT(-0400)] <EricDalquist> that would be what I would check
[12:12:06 EDT(-0400)] <EricDalquist> it may be worth adding some logic to the proxy servlet to pass along the user-agen
[12:12:24 EDT(-0400)] <athena7> sounds reasonable
[12:12:28 EDT(-0400)] <EricDalquist> so it looks at the incoming request and passes it along
[12:12:33 EDT(-0400)] <athena7> any particular one you'd recommend?
[12:12:46 EDT(-0400)] <EricDalquist> it may need to be switched to using commons-httpclient though to set the user-agent
[12:14:12 EDT(-0400)] <EricDalquist> so what I would look at doing is adding logic along the lines of the following to the proxy servlet (this would be a great improvement to pass along to the rest of ja-sig too)
[12:14:37 EDT(-0400)] <EricDalquist> 1) Get the agent from the portal user's browser: agent = userRequest.getHeader("User-Agent");
[12:14:53 EDT(-0400)] <athena7> the current proxy servlet actually is using httpclient
[12:14:57 EDT(-0400)] <athena7> yeah, that makes sense
[12:15:08 EDT(-0400)] <athena7> actually, we just had to switch to the httpproxyservlet in trunk
[12:15:09 EDT(-0400)] <EricDalquist> when setting up the remote request set the User-Agent to what you got from the user's browser
[12:15:23 EDT(-0400)] <athena7> because it turned out ours was leaking file descriptors whenever it got a 403 error
[17:54:06 EDT(-0400)] * colinclark (n=atrcwrk2@a80-127-26-41.adsl.xs4all.nl) has joined ##uportal
[18:11:22 EDT(-0400)] * jayshao_ (n=jayshao@wg-206-hill012.rutgers.edu) has joined ##uportal
[20:04:37 EDT(-0400)] * colinclark (n=atrcwrk2@a80-127-26-41.adsl.xs4all.nl) has left ##uportal
[22:18:21 EDT(-0400)] * jayshao (n=jayshao@ool-44c59034.dyn.optonline.net) has joined ##uportal