473,406 Members | 2,847 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,406 software developers and data experts.

PASS for parallel processing

Searching for a PASS(or may be IASS) that allow to have virtual environment for parallel processing. I have a large dataset(1.5 GB in csv format), need to process this data and also visualize it again and again. My PC configuration is 2 GB RAM, and processor is Intel(R) Pentium(R) Dual CPU E2180; with this configuration I am unable to even open whole dataset in Excel or in R, using Gephi for visualization; so need any platform that will allow to process data and use Gephi too.

Please suggest some solution to deal with massive data specially in context of infrastructure support.

My requirement is similar to Hadoop but it doesn't work for me because I need to use Gephi and Python programs. In case it can work kindly point out how.
Jan 28 '14 #1
4 2375
sicarie
4,677 Expert Mod 4TB
I don't believe there is a preset program to take "any" CSV data and put it into a visualization that is useful to "every" user.

There are utilities that can split the file into smaller files so they can be opened on your computer, but you'll either need to learn how, or get someone who knows how to use a language to open the file, read in the relevant data, and then create the visualization you require.
Jan 30 '14 #2
Yes there is no program(in my knowledge too) that can accept "any" CSV data. Programs need some specified formate of input for their work; due to this reason first I have to configure the file as per Gephi/R or any S/W or program that I will use for processing and visualization.

If you are saying that I should first try all my efforts on small dataset(a part of big csv file) and then with conclusion from that, should move for full data for final conclusion, that's what acctually I am doing; Using Ubuntu(linux based) OS I can read top/end n lines of a file using head and tail commands. Working on that small part is almost done so searching for some palteform that can help me to work with whole data.

Well probably got solution http://aws.amazon.com/ec2/ thier Free Tier will work for me though not explored it more yet.
Jan 30 '14 #3
sicarie
4,677 Expert Mod 4TB
You mentioned that the file was too big to open on your desktop. If you were looking for a quick load into a tool, you could split the file and then load each view separately. However, if all the data needs to be compiled together (ie, is not historic) to get an accurate picture of what is going on.
Feb 4 '14 #4
Well yes, at present my requirement is only to work on large data file all together(complied together).

Have used split/head/tail.

Thanks for your suggestions.
Feb 4 '14 #5

Sign in to post your reply or Sign up for a free account.

Similar topics

2
by: Andrei D. | last post by:
Hello Python newsgroup, In the process of developing a big ssh wrapper for sending commands to multiple hosts over the last few months, I (almost accidentally, considering I'm really just an...
6
by: Alexandre MELARD | last post by:
HI, I have a tricky problem about parallel processing using JavaScript. a script makes use of classes. 2 objects A and B are created at intialisation. The two objects make use of the same...
6
by: Schroeder, AJ | last post by:
Group, I have googled this topic and I know this has been asked before, but there seems to be no good answer except "PHP doesn't support multi-threading" I am new to PHP so I might be using the...
0
by: nickraj | last post by:
Hi I am working on a Multithreaded client and server program . The client request multiple files to server and server has to responce them simultaneously. I am making threads at clients as per no....
14
by: Dave Booker | last post by:
I'm doing some analysis that is readily broken up into many independent pieces, on a multicore machine. I thought it would be best to just queue like 1000 of these pieces in the ThreadPool, and...
0
by: dunleav1 | last post by:
I have a raid array that I have configured as Raid 1/0 and I have DB2 tablespaces across that array. Instead of multiple containers for each tablespace I am utilzing DB2_PARALLEL_IO=*:3. All of...
1
by: nicky123 | last post by:
hi everyone, could someone tell me why is Linux required for execution of parallel processing algorithms ?
4
by: Rklawton | last post by:
I'm running Access 2003, and I've got a CPU intensive application that takes hours (or more) to run. Fortunately, I can logically split up the task so it can run in smaller pieces independently. ...
0
by: vscarberry | last post by:
Hi, I have a question concerning SQL 2005 and Ad Hoc queries. For performance due to omitting unnecessary joins based on parameters. I have created an Ad Hoc SQL statement to increase...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.