tangled
alpha
login
or
join now
dekker.one
/
Fourmi
0
fork
atom
A web scraper build to search specific information for a given compound (and its pseudonyms)
0
fork
atom
overview
issues
pulls
pipelines
Merge branch 'release/v0.5.1' into develop
dekker.one
11 years ago
71e5f46f
cd058cab
+107
-1
3 changed files
expand all
collapse all
unified
split
Changelog.md
SIGNED.md
fourmi.py
+3
Changelog.md
···
1
1
+
### v0.5.1
2
2
+
- UPDATED: Logging functionality from command line
3
3
+
- DEV: Code cleanup and extra tests
+103
SIGNED.md
···
1
1
+
##### Signed by https://keybase.io/jdekker
2
2
+
```
3
3
+
-----BEGIN PGP SIGNATURE-----
4
4
+
Version: GnuPG v1.4.11 (GNU/Linux)
5
5
+
6
6
+
iQIcBAABAgAGBQJTnfAAAAoJEJrQ9RIUCT6/KZIQAME07yzAG5hnqsQof5ESoeQs
7
7
+
5wBxAhiBIX/0yn3qIT/eMh0ubCKUZsqJ3/PzUljeMJ6CGtwxFYfTWkgjYlOoAz9G
8
8
+
fS7CjPmRPyiu+MFo5he+oVRmLUMqfuLUrCyuIxJwMXq5YbQvzyqiffvxr8VRULtV
9
9
+
3c0drWfQMX1ZeAWSIYN0xuMndzvaqIAQU6o4tSQf/rUiKlM2NnTDNUHu2PY9FED/
10
10
+
IJwM/IgAMAkJARyL7ltq6pHzORsu7sd2Nhv0esa0Gs2GSuRjKueeMZvJzpDAufy9
11
11
+
bWn9EqKhVwPR6zWnXRmNj9Ymj1w167hIUYcBdFhC7kie5zv9+pDE6d/s7pw/Rejd
12
12
+
L0k8LKBGtJ8o7SKYR9kcNLDWXEnHjfCraD+14FMYqQPcz2ekoV6Exv/mP8qRPwUc
13
13
+
b+FtjJtW8fEiOMAyjMOvLTzYbCVwjdErAqgNdHeSByi1nxfrphjajRiNUt7fVimJ
14
14
+
++QZzKCj6xN2MuTJ41KbZ8teiUXwQB4OKKij0fgoy0RBwW0vqH6MF7cCKm1zT1Qa
15
15
+
9FGlBU2jSybQqUu4lJ/eUjO/3tQMhJErQJU/i+6lwi7OMnS9J/g17Heghp5Hxyhc
16
16
+
VWvhR56pbWLIL2XQqDGGEqPDIzXohHnbRJ1N71b06akIvIIrTqc6Glu4PJeUG/Pe
17
17
+
EF8/jBwydxbKUOyKRSQS
18
18
+
=xWbc
19
19
+
-----END PGP SIGNATURE-----
20
20
+
21
21
+
```
22
22
+
23
23
+
<!-- END SIGNATURES -->
24
24
+
25
25
+
### Begin signed statement
26
26
+
27
27
+
#### Expect
28
28
+
29
29
+
```
30
30
+
size exec file contents
31
31
+
./
32
32
+
17591 .coverage 1dd1207846db74e407d3a4a1951b8e81934a4693385d39f6c337a224375bad39|1b7ead09cf213b5a9545557be982aaa30238b689bb54adf604f82b12ef521eb2
33
33
+
375 .gitignore d2e475a6a4fa51422cac0a07495914e776858fb9ab9c8937a4d491a3e042d6b1
34
34
+
464 .travis.yml 3063ba078607b8d16bd6467afc15fbbaa4b26c1e30be5ce7cef453cfccbaa95c
35
35
+
97 Changelog.md bcbce9a33bbbbcd18fd7788e6dc3a9c4b13dff7128ea99968994c1b290ddc931
36
36
+
FourmiCrawler/
37
37
+
0 __init__.py e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
38
38
+
304 items.py b00d49a3d53fa13306c7f8b023adb93ab88423c4fce46600689814f6b02bb806
39
39
+
2178 pipelines.py f9b7b84938060751e15e45de5133dffe50c798bff2a20019206fe7c9d677ad49
40
40
+
716 settings.py 37a8f63e123bccc77076d574617a522b30c1d7c5e893ec3d78cc40e1563dd8a6
41
41
+
sources/
42
42
+
9991 ChemSpider.py 847013e34c5c3683ec66a337837287512b4bab9fbea2ece12e4130ab0dbf264d
43
43
+
9898 NIST.py 97abc84fce85c47b789822715a1945ab84cc052a32340c861141c1af66bab644
44
44
+
6907 WikipediaParser.py 5d6de911c773129a34b76c40a9b547aafc67644a15f39cd0be6afc7a16fb0f97
45
45
+
0 __init__.py e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
46
46
+
1262 source.py 16c4cdfca849b7dc2bc89d7a6f7ad021f4aa1d04234394312f1d0edf0fd9c5a4
47
47
+
3026 spider.py 1ffba2512988b7a6b535a4a31a4ef688ece4f8c595c3d50355c34ef46b23e44a
48
48
+
1081 LICENSE 36951e5f1910bad3e008ab7228f35ad8933192e52d3c3ae6a5e875765e27192c
49
49
+
3965 README.md d21236d6a175be28ef8e2fee8a256e95b6a513163e3f1071c26c62e9093db7f3
50
50
+
3659 x fourmi.py 81781ed7299e447e6fc551fba69e62cd7a1d63f27dfa063927f4c5c10f5ac331
51
51
+
200850 log.txt d76e741f9e7b67c2574e9cdbbe499ea4861f6e0bd11e5962fdaf9d8720effef8
52
52
+
184692 results.csv 31132f7f394babeb5dfd249aaa714756017b2e1b314b6715f57e6ad9524e5be8|d0bb724f6d714ec7a4a1ad2052f70dd4510b5ac08d616e24b5e9a903dedab586
53
53
+
261 scrapy.cfg 624c068fd06303daa65b8e0d0d3ef88ac1f123be2694ef5b4f3f9a9dcd983f85
54
54
+
tests/
55
55
+
1 __init__.py 01ba4719c80b6fe911b091a7c05124b64eeece964e09c058ef8f9805daca546b
56
56
+
2837 test_configurator.py 4a0eb6e7121eb09a63ab5cb797570d1a42080c5346c3b8b365da56eefa599e80
57
57
+
1892 test_pipeline.py 387a336b0f36722a20e712aa033e5771c44f9e92561dd73acffd53d622c52031
58
58
+
1260 test_sourceloader.py b108b4b80adcdb7401273a9823b1f1a19eb5178776186eb5a9976aed8b1ee869
59
59
+
2113 test_spider.py 300f280377b522737be0d8e4a80031ab118a4011bdbb92131e9c400fcdab6299
60
60
+
utils/
61
61
+
0 __init__.py e3b0c44298fc1c149afbf4c8996fb92427ae41e4649b934ca495991b7852b855
62
62
+
3552 configurator.py e2b7e0ee6c1fef4373785dfe5df8ec6950f31ce6a5d9632b69a66ea3d1eaf921
63
63
+
2537 sourceloader.py f5a5ac2a6aba0658dbe11361f465caabcf3c06c5c8dc9a631874211cc19d2d37
64
64
+
```
65
65
+
66
66
+
#### Ignore
67
67
+
68
68
+
```
69
69
+
/SIGNED.md
70
70
+
```
71
71
+
72
72
+
#### Presets
73
73
+
74
74
+
```
75
75
+
git # ignore .git and anything as described by .gitignore files
76
76
+
dropbox # ignore .dropbox-cache and other Dropbox-related files
77
77
+
kb # ignore anything as described by .kbignore files
78
78
+
```
79
79
+
80
80
+
<!-- summarize version = 0.0.9 -->
81
81
+
82
82
+
### End signed statement
83
83
+
84
84
+
<hr>
85
85
+
86
86
+
#### Notes
87
87
+
88
88
+
With keybase you can sign any directory's contents, whether it's a git repo,
89
89
+
source code distribution, or a personal documents folder. It aims to replace the drudgery of:
90
90
+
91
91
+
1. comparing a zipped file to a detached statement
92
92
+
2. downloading a public key
93
93
+
3. confirming it is in fact the author's by reviewing public statements they've made, using it
94
94
+
95
95
+
All in one simple command:
96
96
+
97
97
+
```bash
98
98
+
keybase dir verify
99
99
+
```
100
100
+
101
101
+
There are lots of options, including assertions for automating your checks.
102
102
+
103
103
+
For more info, check out https://keybase.io/docs/command_line/code_signing
+1
-1
fourmi.py
···
69
69
70
70
# The start for the Fourmi Command Line interface.
71
71
if __name__ == '__main__':
72
72
-
arguments = docopt.docopt(__doc__, version='Fourmi - V0.5.0')
72
72
+
arguments = docopt.docopt(__doc__, version='Fourmi - V0.5.1')
73
73
loader = SourceLoader()
74
74
75
75
if arguments["--include"]: