473,548 Members | 2,691 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

How to SORT a large XML file?

3 New Member
I am searching for a way to sort XML data on very large files on a UNIX enviroment. Also, I am very new to XML and UNIX as I am an IBM Mainframe person. Any help or examples would be appreciated.

Thanks
DOLE
Feb 27 '07 #1
6 8898
Varun Gupta
13 New Member
I am searching for a way to sort XML data on very large files on a UNIX enviroment. Also, I am very new to XML and UNIX as I am an IBM Mainframe person. Any help or examples would be appreciated.

Thanks
DOLE
---------------------------------------
Hi

You can store XML files in flat files in unix , extension hardly means in unix becoz unix treates all the devices and folders as files. but then type of file could differ.
To sort file you can use sort command OR for sort on the basis of particular patterns in that file you better can use sed / awk commands.

sed 'script' filename

where script contains..
/pattern/ action
here /pattern/ which you want to search for sorting and
action is ..what ever ou want to perform on the file. (as you want sorting as an action on the pattern.)
Mar 3 '07 #2
ghostdog74
511 Recognized Expert Contributor
I am searching for a way to sort XML data on very large files on a UNIX enviroment. Also, I am very new to XML and UNIX as I am an IBM Mainframe person. Any help or examples would be appreciated.

Thanks
DOLE
For XML manipulation, you can use powerful languages such as Perl/Python that comes with XML parsers that makes parsing XML easy. I am sure you have these tools available on your machine.
eg Perl XML parser . If not, sed/awk and the likes of Unix tools can also be used, albeit a bit tedious.
Mar 3 '07 #3
davidf2007
1 New Member
CoSort for Unix, Version 9, now sorts (and converts to/from) large, flat XML files, among other mainframe (index) and sequential (csv, ldif, etc.) file formats. CoSort also has JCL sort parm conversion tools for MVS and VSE migrations.

Please see
May 17 '07 #4
Motoma
3,237 Recognized Expert Specialist
Regular expressions are always a solid way to go.
May 18 '07 #5
ghostdog74
511 Recognized Expert Contributor
regexp are powerful, but it also depends. if the code is full of regexp, and if something breaks, they make it difficult for administrator/code maintainer to debug and troubleshoot.
May 19 '07 #6
CoSort 9.1.3 also runs on Windows for large, flat XML file sorts.
Nov 12 '10 #7

Sign in to post your reply or Sign up for a free account.

Similar topics

2
3199
by: One's Too Many | last post by:
Ran into a strange problem today: 8.1.7 on AIX 4.3.3 Database and applications had been working fine for two years and all of a sudden a couple of regularly-run queries are now no longer coming out in correct sort order specified in the ORDER clauses of the select statements. Behavior is erratic, about half the time the sort order is...
40
4227
by: Elijah Bailey | last post by:
I want to sort a set of records using STL's sort() function, but dont see an easy way to do it. I have a char *data; which has size mn bytes where m is size of the record and n is the number of records. Both these numbers are known
12
2903
by: Eva | last post by:
Hi, I try to implement quick sort. I sort vectors by their first value. 10 2 3 4 9 3 5 6 10 4 5 6 must be 9 3 5 6 10 2 3 4 10 4 5 6 The prog works great on maybe 500 vectors, but I have an "Aborted(core
20
4032
by: Xah Lee | last post by:
Sort a List Xah Lee, 200510 In this page, we show how to sort a list in Python & Perl and also discuss some math of sort. To sort a list in Python, use the “sort” method. For example: li=;
10
3482
by: Frank | last post by:
I have a text file, one word per line. I want to arrange the lines alphabetically so I opened it into MS Word and asked Word to sort it. Word said that the list was too big for it. I figure I'll write a program to sort the file. But I wonder if Word can't handle it what things I'll run into and what's the best way to proceed. I have no...
10
5581
by: Woody Ling | last post by:
In 32 bits DB2 environment, is it meaningful to set sheapthres larger than 256MB for the following case.. 1. Intra-parallel is ON 2. Intra-parallel is OFF
1
3636
by: ShaveDave27 | last post by:
Hi, I've created a Person Class with a comparable interface. And i've created an ArrayList People with varaibles from the person class in - First_name, Surname, Month(Birthday), Day(Birthday). Now i need to use selection sort to sort my arraylist into birthday order. In the Person Class i have a method which gives each person a different number...
12
2558
by: ShaveDave27 | last post by:
HI, I've created a Comparable interface class Person. Which has variables First_name, surname, month(Of birth), day(Of birth) , and birthday(which i have created from month and day). I have a compareTo method which will compare two person objects by the birthday. If the birthday are the same it outputs int 0, if one is bigger it outputs int 1...
3
3197
by: dotyet | last post by:
Hello Everyone, I have a mixed workload DB2 UDB 8.2 FP 14 Windows x64 database. The database performs as per the expectations. I have just one small question. The sort related parameters are configured as follows: Sort heap threshold (4KB) (SHEAPTHRES) = 50000 Sort list heap (4KB) (SORTHEAP) = 4096...
0
7711
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. ...
0
7954
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that...
1
7467
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For...
0
6039
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development projectplanning, coding, testing, and deploymentwithout human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
1
5367
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupr who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes...
0
5085
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...
0
3497
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...
1
1932
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
0
755
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.