This is an old revision of the document!
Full scan of March 2020 - are people self isolating for COVID-19?
Extract just March's data, then import it to the database:
grep -E "^2020-03" tcpdump.log > /tmp/march.wifi.log php trim.php /tmp/march.wifi.log > march-trimmed.csv
Raw data for March was 21Gb, approx 168 million lines.
Then import to the mySQL database:
LOAD DATA INFILE 'march-trimmed.csv' INTO TABLE wifi_data;
SELECT DATE(seen_time), COUNT(mac) FROM simple_data WHERE DATE(seen_time) >= '2020-03-01' AND seen_time <= '2020-04-01' GROUP BY DATE(seen_time) INTO OUTFILE '/tmp/march-rawdata.csv';
Date | Hits |
2020-03-01 | 119994 |
2020-03-02 | 122174 |
2020-03-03 | 127180 |
2020-03-04 | 138274 |
2020-03-05 | 142102 |
2020-03-06 | 150793 |
2020-03-07 | 142417 |
2020-03-08 | 135554 |
2020-03-09 | 137481 |
2020-03-10 | 125954 |
2020-03-11 | 131901 |
2020-03-12 | 135391 |
2020-03-13 | 127303 |
2020-03-14 | 118234 |
2020-03-15 | 113126 |
2020-03-16 | 106237 |
2020-03-17 | 110425 |
2020-03-18 | 106013 |
2020-03-19 | 108127 |
2020-03-20 | 115171 |
2020-03-21 | 113031 |
2020-03-22 | 108512 |
2020-03-23 | 118570 |
2020-03-24 | 110394 |
2020-03-25 | 109092 |
2020-03-26 | 113263 |
2020-03-27 | 117891 |
2020-03-28 | 114374 |
2020-03-29 | 111808 |
2020-03-30 | 116348 |
2020-03-31 | 119926 |