FreshPorts -- The Place For Ports Tue, 9 Feb 2010 5:36 PM GMT
Do I have any twitter.com followers? I'd like to get control of twitter.com/pgcon, since I run pgcon...
How big is it

It was a few days ago that I was thinking about search engines crawling through this website. I began to wonder just how many web pages there are here. To calculate this total, it's not just a simple matter of counting files on disk. Most of the web pages are created from entries in the database. One recent evening, I started to design a formula to find out how many web pages there are. Roughly. This will not be 100% accurate, but it will be close.

Pages on disk

First, let's count the number of pages on disk:

$ ls *.php | wc -l
68
Number of categories

There is a page for each category:

# select count(*) from categories;
 count
-------
   106
(1 row)
Number of ports

There are ports, and there are deleted ports. I'll show both:

# select count(*) from ports_all where status = 'A';
 count
-------
 21261

# select count(*) from ports_all where status = 'D';
 count
-------
  5094
(1 row)
Number of files in the ports tree

There is a page for each file in the ports tree:

[dan@ngaio:/usr/ports] $ find . | wc -l
140148
[dan@ngaio:/usr/ports] $
Count last performed at Tue, 09 Feb 2010 00:10:51 GMT
Number of commits

There is a page for each commit:

# select count(*) from commit_log;
 count
-------
202845
(1 row)
Number of ports for each commit

For each commit, you can view the files modified by that commit for a particular port:

# select count(*) from commit_log_ports;
 count
-------
374381
(1 row)
How many days?

For each day, there is a page showing the commits for that day. How many days do we have?

# select count(distinct commit_date) from commit_log;
 count
-------
  3535
(1 row)
How many users?

Each user has a page:

# select count(*) from users;
 count
-------
  9879
(1 row)
How many watch lists?

For each watch list, there is a page:

# select count(*) from watch_list;
 count
-------
 10277
(1 row)
Estimated total

That gives a grand total of 767,594 pages. On my last count, that's about 0.009526% of the web pages on Google1

Notes

  • These statistics are updated daily.
  • 1The number of Google pages used in this calculation is 8,058,044,651.
How much diskspace?

The total space used by the FreshPorts database is:

# select pg_database_size('freshports.org');
 pg_database_size
------------------
9,739,958,032
(1 row)

That's bytes...

This value might be easier to parse: 9.1 GB

Login
User Login
Create account

Servers and bandwidth provided by
New York Internet
SuperNews

Search
Enter Keywords:
 
more...

Latest Vulnerabilities
otrsFeb 08
otrsFeb 08
apache13*Feb 03
apache13*Feb 03
apache13+ipv6*Feb 03
apache13-modperl*Feb 03
apache13-modssl*Feb 03
apache13-modssl*Feb 03
apache13-modssl+ipv6*Feb 03
apache13-ssl*Feb 03
squid*Feb 02
squid30*Feb 02
squid31*Feb 02
bugzillaFeb 01
ircd-ratboxJan 28

5 vulnerabilities affecting 16 ports have been reported in the past 14 days

* - modified, not new

All vulnerabilities


Ports
Home
Categories
Deleted ports
Sanity Test Failures
Newsfeeds


Statistics
Graphs
NEW Graphs (Javascript)
Traffic

Calculated hourly:
Port count 21261
Broken 172
Deprecated 36
Ignore 906
Forbidden 2
Restricted 386
No CDROM 142
Vulnerable 56
Expired 14
Set to expire 25
Interactive 79
new 24 hours 7
new 48 hours9
new 7 days29
new fortnight88
new month197

This site
What is FreshPorts?
About the Authors
FAQ
How big is it?
The latest upgrade!
Privacy
Forums
Blog
Contact

Add tab to Netscape 6

Servers and bandwidth provided by
New York Internet
SuperNews
Valid HTML, CSS, and RSS.
Copyright © 2000-2008 DVL Software Limited. All rights reserved.
This page created in 0.069 seconds.