aboutsummaryrefslogtreecommitdiff
path: root/TODO
diff options
context:
space:
mode:
authorCorentin Chary <corentincj@iksaif.net>2011-09-21 10:09:50 +0200
committerCorentin Chary <corentincj@iksaif.net>2011-09-21 10:09:50 +0200
commit14971584af4122cc65352bbf09c8c7b609457a67 (patch)
tree186df7e27f1b9de306f9141941127c7a3fb9dd02 /TODO
parenteuscan: blacklist art.gnome.org (diff)
downloadeuscan-14971584af4122cc65352bbf09c8c7b609457a67.tar.gz
euscan-14971584af4122cc65352bbf09c8c7b609457a67.tar.bz2
euscan-14971584af4122cc65352bbf09c8c7b609457a67.zip
euscan: robots.txt, timeout, user-agent, ...
- Add a blacklist for robots.txt, we *want* to scan sourceforge - Set a user-agent that doesn't looks like a browser - Handle timeouts more carefully - If brute force detect too much versions, avoid infinite loops - Handle redirections more carefully Signed-off-by: Corentin Chary <corentincj@iksaif.net>
Diffstat (limited to 'TODO')
-rw-r--r--TODO9
1 files changed, 4 insertions, 5 deletions
diff --git a/TODO b/TODO
index f7d4993..0c18672 100644
--- a/TODO
+++ b/TODO
@@ -4,16 +4,12 @@ TODO
euscan
------
-- respect robots.txt (portscout)
- check other distros (youri)
-- clean blacklist system
-- add a way to blacklist versions using standard package tokens
- - =x11-drivers/xf86-video-intel-2.14.90*
- - >=x11-base/xorg-server-1.10.900
Site Handlers
-------------
+- sourceforge: http://sourceforge.net/api/file/index/project-name/vboxgtk/mtime/desc/limit/20/rss http://sourceforge.net/api/release/index/project-id/264534/rss
- ftp.kde.org: doesn't scan the "unstable" tree
- mysql: should use http://downloads.mysql.com/archives/
- mariadb: should use http://downloads.askmonty.org/MariaDB/+releases/
@@ -22,3 +18,6 @@ euscanwww
---------
- add progress options for each command
+- add last scan in the footer
+- add json/xml for each page
+- rss scan world + post ?