At Forward we have been using Hive for a while and started out with the default table type (uncompressed text) and wanted to see if we could save some space and not lose too much performance.
The wiki page HiveCompressedStorage lists the possibilities
I was curious about our Hive tables total usage on HDFS and what the average filesize was with the current partitioning scheme so wrote this ruby script to calculate it.
current = ''
file_count = 0
total_size = 0
output = File.open('output.csv','w')
Yesterday we had an issue with the different behaviour of “kill ” and “kill -9 ” and in the process I had to refresh my knowledge of Unix signals, learn how you handle them in Ruby and properly learn Rubys exception hierarchy.
To -9 or not to -9?
While moving our VM deployment site written in Sinatra to a Windows
machine with the VMware PowerCLI toolkit installed the only snag was
where we forked a process to do the preparation of the machines. Both
Kernel.fork and Process.detach seemed to have
While migrating an automated VM deployment page using a combination of Sinatra on Linux and Bash scripts using the Perl toolkit with a simpler script using the VMWare PowerCLI that I love so much I needed to create a windows service from the Sinatra App
I have been meaning to learn ruby for a while and the place I am working now uses a lot so I had another look at it. I read Learn To Program, a simple but good book and found the bit on blocks and procs etc pretty good and wanted to see if I could do