Page Menu
Home
c4science
Search
Configure Global Search
Log In
Files
F65134364
pig2gdf.py
No One
Temporary
Actions
Download File
Edit File
Delete File
View Transforms
Subscribe
Mute Notifications
Award Token
Subscribers
None
File Metadata
Details
File Info
Storage
Attached
Created
Sat, Jun 1, 02:43
Size
574 B
Mime Type
text/x-python
Expires
Mon, Jun 3, 02:43 (2 d)
Engine
blob
Format
Raw Data
Handle
18009843
Attached To
R1473 warcbase
pig2gdf.py
View Options
#!/usr/bin/env python
# Converts pig output of format Time interval, Source, Target, Weight
# to GDF file
# -jrwiebe
import
csv
,
sys
node
=
set
()
edge
=
[]
with
open
(
sys
.
argv
[
1
],
'r'
)
if
len
(
sys
.
argv
)
>
1
else
sys
.
stdin
as
f
:
reader
=
csv
.
reader
(
f
,
delimiter
=
'
\t
'
)
for
row
in
reader
:
node
.
add
(
row
[
1
])
node
.
add
(
row
[
2
])
edge
.
append
([
row
[
1
],
row
[
2
],
row
[
3
],
row
[
0
]])
print
"nodedef> name VARCHAR"
for
n
in
node
:
print
n
print
"edgedef> source VARCHAR, target VARCHAR, weight DOUBLE, timeint VARCHAR"
for
e
in
edge
:
print
"
%s
,
%s
,
%s
,
%s
"
%
(
e
[
0
],
e
[
1
],
e
[
2
],
e
[
3
])
Event Timeline
Log In to Comment