473,387 Members | 1,834 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,387 software developers and data experts.

Parsing Abstract Files.

I have been charged with Parsing the data from many Abstract files, and then inputing this information into a SQL Database.
The file format is rather unusual and certainly not delimited in any reasonable fashion.
I was wondering if anyone would be able to assist or provide an idea of how I may go about parsing this data for insertion.

The plan is to write a small application in VB that will allow me to run through each text file, and then later search the database based on criteria selected.

Below is an example of the file that we are required to parse and store:

Expand|Select|Wrap|Line Numbers
  1. Title       : Photochemistry of Charge Transfer Excited States
  2. Type        : Award
  3. NSF Org     : CHE 
  4. Latest
  5. Amendment
  6. Date        : February 26,  1992  
  7. File        : a9123000
  8.  
  9. Award Number: 9123000
  10. Award Instr.: Standard Grant                               
  11. Prgm Manager:                                         
  12.           CHE  DIVISION OF CHEMISTRY                   
  13.           MPS  DIRECT FOR MATHEMATICAL & PHYSICAL SCIEN
  14. Start Date  : April 1,  1992      
  15. Expires     : March 31,  1995      (Estimated)
  16. Expected
  17. Total Amt.  : $143300             (Estimated)
  18. Investigator: Kirk S. Schanze kschanze@chem.ufl.edu  (Principal Investigator current)
  19. Sponsor     : University of Florida
  20.           219 Grinter Hall
  21.           Gainesville, FL  32611    352/392-1582
  22.  
  23. NSF Program : 1966      SYNTHETIC INORGANIC
  24. Fld Applictn: 0306000   Energy Research & Resources             
  25.               12        Chemistry                               
  26. Program Ref : 
  27. Abstract    :
  28.               This award, from the Inorganic, Bioinorganic and Organo-                       
  29.               metallic Chemistry Program, is for the support of studies of                   
  30.               the photochemistry of excited states of two types of rhenium                   
  31.               complexes in which the rhenium atom has six d-electrons.  In                   
  32.               the first case the complex will contain a bidentate diimine                    
  33.               ligand that serves as an electron acceptor in the excited state                
  34.               and a monodentate ligand that serves as an electron donor in                   
  35.               the excited state, and which is reactive in its electron                       
  36.               deficient form.  Photochemical excitation of these molecules                   
  37.               will generate a ligand-ligand charge transfer (LLCT) excited                   
  38.               state.  Objectives of this phase of the investigation include:                 
  39.               1) delineation of the photochemical reactions that occur                       
  40.               andidentification of reactive intermediates produced via LLCT                  
  41.               excitation, 2) determination of the kinetics of rearrangements                 
  42.               and bond fragmentations of the reactive groups, and 3)                         
  43.               utilization of the rate constants obtained in 2) to determine                  
  44.               the effects of structural and energetic factors on the lifetime                
  45.               of the LLCT excited state.  In the second case, sigma-to-pi                    
  46.               antibonding excitation processes in rhenium(I) complexes that                  
  47.               contain bidentate diimine and alkyl ligands will be examined.                  
  48.               Here the results of bond scission in the excited state will                    
  49.               produce metal- and carbon-based radicals.  A primary objective                 
  50.               of this work is to relate rhenium-carbon bond strength to the                  
  51.               rate of excited state bond fragmentation.                                      
  52.               %%%                                                                            
  53.               Charge transfer in photochemically generated excited states is                 
  54.               a fundamental aspect of biological light energy harvesting                     
  55.               systems.  How charge transfer states are sufficiently                          
  56.               stabilized in biological systems so that productive reactions                  
  57.               occur faster than charge recombination is a mystery that                       
  58.               continues to stimulate exciting research.  Successful emulation                
  59.               of biological systems could mean more efficient utilization of                 
  60.               solar energy for technological purposes; determination of the                  
  61.               reactivity of the excited state in systems such as those to be                 
  62.               examined here may be useful in this regard.  The chemical                      
  63.               systems under study may also be useful as photochemically                      
  64.               activated catalysts for polymerization reactions.
Jul 31 '07 #1
1 1713
azimmer
200 Expert 100+
The format very much resembles to that of a Lotus Notes text export. If it indeed is, I recommend that you try Goedeke's Notes converter (I like it a lot): http://www.goedeke.net/nte_conv_eng.html
Jul 31 '07 #2

Sign in to post your reply or Sign up for a free account.

Similar topics

8
by: Gerrit Holl | last post by:
Posted with permission from the author. I have some comments on this PEP, see the (coming) followup to this message. PEP: 321 Title: Date/Time Parsing and Formatting Version: $Revision: 1.3 $...
2
by: Cigdem | last post by:
Hello, I am trying to parse the XML files that the user selects(XML files are on anoher OS400 system called "wkdis3"). But i am permenantly getting that error: Directory0: \\wkdis3\ROOT\home...
35
by: .:mmac:. | last post by:
I have a bunch of files (Playlist files for media player) and I am trying to create an automatically generated web page that includes the last 20 or 30 of these files. The files are created every...
9
by: Anon Email | last post by:
Hi people, I'm learning about header files in C++. The following is code from Bartosz Milewski: // Code const int maxStack = 16; class IStack
9
by: silversurfer2025 | last post by:
Hello everyone, I am currently having problems with a C++ abstract class. I have a class FrameWork.h which defines some methods (of which some are abstract, i.e. virtual void method() = 0). In...
7
by: jason | last post by:
In the microsoft starter kit Time Tracker application, the data access layer code consist of three cs files. DataAccessHelper.cs DataAcess.cs SQLDataAccessLayer.cs DataAcccessHelper appears...
3
by: toton | last post by:
Hi, I have some ascii files, which are having some formatted text. I want to read some section only from the total file. For that what I am doing is indexing the sections (denoted by .START in...
9
by: Paulers | last post by:
Hello, I have a log file that contains many multi-line messages. What is the best approach to take for extracting data out of each message and populating object properties to be stored in an...
1
by: padmagvs | last post by:
I am working on some code which parses wsdl . I have a complex wsdl which is failing to parse . I have to modify this wsdl for parsing . wanted to know the complex wsdl i am using is as per...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.