how to ingrate my code to read text in in parent folder contain sub folders and files for example folder name is cars and sub file is Toyota,Honda and BMW and Toyota contain file name Camry and file name corolla, file name Honda contain folder accord and BMW contain file name X5
Is there way to enter name of parent folder(cars) and search in all sub folder(Toyota,H onda and BMW) and files ?
please help ASAP
code is find most frequent word in one text file and print them in decrease order
and I wont it to find most frequant word in all text files (together) under specific folder - # count words in a text and show the first ten items
-
# by decreasing frequency
-
-
# sample text for testing
-
-
import sys
-
import string
-
import re
-
file = open ("arb.txt", "r")
-
text = file.read ( )
-
file.close ( )
-
-
word_freq = {}
-
-
word_list = text.split()
-
-
for word in word_list:
-
# word all lower case
-
word = word.lower()
-
# strip any trailing period or comma
-
word = word.rstrip('.,/"-_;\[]()')
-
# build the dictionary
-
count = word_freq.get(word, 0)
-
word_freq[word] = count + 1
-
-
# create a list of (freq, word) tuples
-
freq_list = [(freq, word) for word, freq in word_freq.items()]
-
-
# sort the list by the first element in each tuple (default)
-
freq_list.sort(reverse=True)
-
-
for n, tup in enumerate(freq_list):
-
# print the first ten items
-
if n < 10:
-
freq, word = tup
-
print freq, word
11 2134
Is there way to enter name of parent folder(cars) and search in all sub folder(Toyota,H onda and BMW) and files ?
I'm sorry, but it's very difficult to understand what it is that you are asking. I can provide you with some direction however...
Perhaps something you're looking for is os.walk. Here is a sample: -
>>> for root, dirs, files in os.walk(os.getcwd()):
-
... print 'Looking into %s' % root.split('\\')[-1]
-
... print 'Found %d dirs and %d files' % (len(dirs), len(files))
-
... for idx, dir in enumerate(dirs):
-
... print 'Directory #%d: %s' % (idx + 1, dir)
-
... for idx, file in enumerate(files):
-
... print 'File #%d: %s' % (idx + 1, file)
-
...
-
Looking into pythtests
-
Found 2 dirs and 16 files
-
Directory #1: graphics
-
Directory #2: Question
-
File #1: bckmch.py
-
File #2: cmdtest.py
-
File #3: cobyla.py
-
File #4: elseerr.py
-
File #5: fileio.py
-
File #6: ldict.py
-
File #7: lid
-
File #8: mainbody
-
File #9: matrixprint.py
-
File #10: matrx_print.py
-
File #11: test.py
-
File #12: test2.py
-
File #13: topload
-
File #14: totalbottle
-
File #15: trivgame.py
-
File #16: wxtemplate.py
-
Looking into graphics
-
Found 0 dirs and 8 files
-
File #1: Buttons.py
-
File #2: dice_class.py
-
File #3: ghostchars.py
-
File #4: graphics.py
-
File #5: graphics.pyc
-
File #6: graphics22.py
-
File #7: graphics22.pyc
-
File #8: hw6-template.py
-
Looking into Question
-
Found 0 dirs and 0 files
-
>>>
Hope that helps a little bit
I mean example of parent dirctory (folder) is cars and example of subdirectory (folder) is (BMW,Honda,Toyo ta) so I wont to trace directory and all subdirctory
to find most frequant word in all text files (together) under specific folder
and I did not understand what your code mean
thanx M.r jlm699 your reply was helpfull
but it does not match what I wont exactly
modifyig code is - # count words in a text and show the first ten items
-
# by decreasing frequency
-
-
# sample text for testing
-
-
import sys
-
import string
-
import re
-
import os.path
-
for root, dirs, files in os.walk(os.getcwd()):
-
print 'Looking into %s' % root.split('\\')[-1]
-
print 'Found %d dirs and %d files' % (len(dirs), len(files))
-
for idx, dir in enumerate(dirs):
-
print 'Directory #%d: %s' % (idx + 1, dir)
-
for idx, file in enumerate(files):
-
print 'File #%d: %s' % (idx + 1, file)
-
ff = open (file, "r")
-
text = ff.read ( )
-
ff.close ( )
-
-
word_freq = {}
-
-
word_list = text.split()
-
-
for word in word_list:
-
# word all lower case
-
word = word.lower()
-
# strip any trailing period or comma
-
word = word.rstrip('.,/"-_;\[]()')
-
# build the dictionary
-
count = word_freq.get(word, 0)
-
word_freq[word] = count + 1
-
-
# create a list of (freq, word) tuples
-
freq_list = [(freq, word) for word, freq in word_freq.items()]
-
-
# sort the list by the first element in each tuple (default)
-
freq_list.sort(reverse=True)
-
-
for n, tup in enumerate(freq_list):
-
# print the first ten items
-
if n < 10:
-
freq, word = tup
-
print freq, word
-
the output like
File #12: listtoDict.py
14 with
6 python
6 for
File #13: parseAddresses
3 python
1 with
1 will
and I need to find frequacy of word in all text file not seperat for examle the previos output shud be like
15 with
9 python
6 for
1 will
so add frequancy of word in (File #12: listtoDict.py) with (File #13: parseAddresses) and print thim in one list
and I need to find frequacy of word in all text file
Just move your word_freq dictionary declaration to before you begin the for loop, and then move the sorting/printing of that structure to after the for loop. And you'll achieve this.
Here's the modifications that I suggest above and the resulting output. -
import sys, os
-
-
word_freq = {}
-
-
for root, dirs, files in os.walk(os.getcwd()):
-
print 'Looking into %s' % root.split('\\')[-1]
-
print 'Found %d dirs and %d files' % (len(dirs), len(files))
-
-
for idx, file in enumerate(files):
-
ff = open (os.path.join(root, file), "r")
-
text = ff.read ( )
-
ff.close ( )
-
-
word_list = text.strip().split()
-
-
for word in word_list:
-
word = word.lower().rstrip('.,/"-_;\\[]()')
-
if word.isalpha():
-
# build the dictionary
-
count = word_freq.get(word, 0)
-
word_freq[word] = count + 1
-
-
# create a list of (freq, word) tuples
-
freq_list = [(freq, word) for word, freq in word_freq.items()]
-
-
# sort the list by the first element in each tuple (default)
-
freq_list.sort(reverse=True)
-
-
for n, tup in enumerate(freq_list):
-
# print the first ten items
-
if n < 10:
-
freq, word = tup
-
print freq, word
Output: -
Microsoft Windows XP [Version 5.1.2600]
-
(C) Copyright 1985-2001 Microsoft Corp.
-
-
C:\Documents and Settings\Administrator>cd Desktop\pythtests
-
-
C:\Documents and Settings\Administrator\Desktop\pythtests>python walkncount.py
-
Looking into pythtests
-
Found 2 dirs and 17 files
-
Looking into graphics
-
Found 0 dirs and 8 files
-
Looking into Question
-
Found 0 dirs and 0 files
-
46 the
-
17 and
-
14 of
-
14 a
-
12 is
-
10 to
-
10 in
-
8 you
-
8 this
-
8 that
-
-
C:\Documents and Settings\Administrator\Desktop\pythtests>
thanx alot
but it is actualy read all file but print frequancy of only one of them
not print frequancy of word in all file which I wont
read all file but print frequancy of only one of them
Ok... I'm not sure exactly what you mean by that but I think that you're trying to say you only want to display the frequency of words in the file with the highest frequencies? -
import sys, os
-
-
highest_freq = [(0,'Blank')]
-
high_file_name = ''
-
-
for root, dirs, files in os.walk(os.getcwd()):
-
# print 'Looking into %s' % root.split('\\')[-1]
-
# print 'Found %d dirs and %d files' % (len(dirs), len(files))
-
-
for idx, file in enumerate(files):
-
# print 'File #%d: %s' % (idx + 1, file)
-
ff = open (os.path.join(root, file), "r")
-
text = ff.read ( )
-
ff.close ( )
-
-
word_freq = {}
-
word_list = text.strip().split()
-
-
for word in word_list:
-
word = word.lower().rstrip('.,/"-_;\\[]()')
-
if word.isalpha():
-
# build the dictionary
-
word_freq[word] = word_freq.get(word, 0) + 1
-
-
# create a list of (freq, word) tuples
-
freq_list = [(freq, word) for word, freq in word_freq.items()]
-
-
# sort the list by the first element in each tuple (default)
-
freq_list.sort(reverse=True)
-
if freq_list:
-
if freq_list[0][0] > highest_freq[0][0]:
-
highest_freq = freq_list
-
high_file_name = file
-
-
print 'Highest frequency file: %s' % high_file_name
-
for n, tup in enumerate(highest_freq):
-
if n < 10:
-
freq, word = tup
-
print freq, word
-
raw_input('\nHit enter to exit')
-
Output: -
Highest frequency file: graphics.py
-
93 def
-
44 return
-
36 the
-
31 in
-
29 of
-
26 for
-
25 if
-
23 to
-
19 class
-
19 a
-
-
Hit enter to exit
-
This is a crude example so I apologize; however I don't understand what you're trying to do or why. So working with what you've given this is the most I can make of your question.
realy I aprechat your trying to help
but unfortionatly that is no wat I ment
I meat is read all files in directory compin all words in all files and put them in new file then find freqancy of each word in taht new file
realy I aprechat your trying to help
but unfortionatly that is no wat I ment
I meat is read all files in directory compin all words in all files and put them in new file then find freqancy of each word in taht new file
So basically, you're saying you want to combine the contents of all the files into a new file, and then find the frequency of the words in that file?
Well to do that w/o creating a new file would be a very slight change from a previous post: -
import sys, os
-
-
word_freq = {}
-
-
for root, dirs, files in os.walk(os.getcwd()):
-
print 'Looking into %s' % root.split('\\')[-1]
-
print 'Found %d dirs and %d files' % (len(dirs), len(files))
-
-
for idx, file in enumerate(files):
-
ff = open (os.path.join(root, file), "r")
-
text = ff.read ( )
-
ff.close ( )
-
-
word_list = text.strip().split()
-
-
for word in word_list:
-
word = word.lower().rstrip('.,/"-_;\\[]()')
-
if word.isalpha():
-
# build the dictionary
-
count = word_freq.get(word, 0)
-
word_freq[word] = count + 1
-
-
# create a list of (freq, word) tuples
-
freq_list = [(freq, word) for word, freq in word_freq.items()]
-
-
# sort the list by the first element in each tuple (default)
-
freq_list.sort(reverse=True)
-
-
for n, tup in enumerate(freq_list):
-
# print the first ten items
-
if n < 10:
-
print "%s times: %s" % tup
-
raw_input('\nHit enter to exit')
-
Sign in to post your reply or Sign up for a free account.
Similar topics |
by: Paul |
last post by:
I am creating a Program for college, in which the Program will read a Folder
and create
a HTML page from the pictures that are storrd in that folder. .
What would be the best way to do it in VB Net 2003.
Thanks
Paul Selwood
|
by: Paul |
last post by:
I am creating a Program for college, in which the Program will read a Folder
and create a HTML page from the pictures that are storrd in that folder.
..
What would be the best way to do it in VB Net 2003.
Thanks
Paul Selwood
Paul@DialUpSelwood.demon.co.uk
|
by: Davie |
last post by:
I am new to .Net, so apologies if this is a simple question.
I need a way of display folders and files to my users. However, it must
show whether they have NTFS access to the file/folder.
For example, if they have access the folder or file is displayed as a link,
if they dont it is displayed only as text.
Is this possible? Any help would be greatly appreciated.
|
by: tonelab |
last post by:
I have an aspx page that does not have a separate source for the VB - it is
on top in between the <script> tags. I use the following statement
Dim oComLib as New ComLib
To reference a class called ComLib in a source member called ComLib.vb that
is found in the /Code folder of my project. It all works fine in the VWD2005
environment as when I run the aspx page, it processes the above statment and
"sees" the class reference in the code.
|
by: eholz1 |
last post by:
Hello PHP Group,
I am having trouble setting permissions correctly so that the
magickwand api (php 5.2) can read and
write images. I usually read a file from one directory, create a
magickwand resource from that file,
and transform the image, and save the new image with a new name to a
different directory.
I have seen that my file and folder permissions when set incorrectly,
| |
by: =?Utf-8?B?SmVmZnJleQ==?= |
last post by:
I made a typo on a Project name (e.g. Wong, instead of Wang). Later on I
renamed the Soution, Project, WebForm., etc, except the file folder name,
back to Wang. Then after I closed the VS.net and went back to the Windows
Explorer, I found out the file folder name was Wong, so I changed it to
Wang. Next time, when I tried to open the Wang project, an error message
showed that VS can not open the Wong Web page (something like that).
...
|
by: kai |
last post by:
Hi, All
I am trying to create a file folder for any login user, and create sub
folders for the user on a web page. After the user login again, he can only
sees his own folder on the Web page. I am planning to use VB2005 or C#.
I look the help files and searched on the web, I cannot find the answer. Is
this possble in ASP.NET 2.0?
Thanks
|
by: cbz9633 |
last post by:
hi
i want to read a folder name, after that read folder names within that folder and at last the file name and access it, that i know but i dont know how to read a folder name.
please help
|
by: parshupooja |
last post by:
Contact Reply
1 point Member
propoo
Joined on 08-31-2007, 10:32 PM
Posts 3
Hey all ,
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look !
Part I. Meaning of...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
| |
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed.
This is as boiled down as I can make it.
Here is my compilation command:
g++-12 -std=c++20 -Wnarrowing bit_field.cpp
Here is the code in...
|
by: jinu1996 |
last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth.
The Art of Business Website Design
Your website is...
|
by: tracyyun |
last post by:
Dear forum friends,
With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
|
by: agi2029 |
last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own....
Now, this would greatly impact the work of software developers. The idea...
|
by: TSSRALBI |
last post by:
Hello
I'm a network technician in training and I need your help.
I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs.
The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols.
I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
|
by: muto222 |
last post by:
How can i add a mobile payment intergratation into php mysql website.
| |
by: bsmnconsultancy |
last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...
| |