Forums

Skip to content

Advanced search
  • Quick links
    • Unanswered topics
    • Active topics
    • Search
  • FAQ
  • Login
  • Register
  • Board index Discussion & Documentation Gentoo Forums Feedback
  • Search

new search stopwords list

Having a problem with the forums? Have a suggestion? Post here!
Post Reply
  • Print view
Advanced search
138 posts
  • Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • Next
Author
Message
Albert_Alligator
Apprentice
Apprentice
User avatar
Posts: 193
Joined: Wed May 12, 2004 3:09 pm
Location: Okefenokee Swamp
Contact:
Contact Albert_Alligator
Website

  • Quote

Post by Albert_Alligator » Mon Jan 16, 2006 6:16 am

Hi guys and gals,

I'm not here to add fuel to the fire, but I believe that it would be prudent to have the list of omitted words on your quick search page. This will at least save searchers some time and frustration wondering why half or all of their search terms were omitted.

Gentoo can't scale vertically because the expense is much to great...the secret is to scale horizontally. GASLAMP servers or (Grid Application Server on Linux Apache MySQL and PHP) can answer your search problems. All you need to do is balance your search load over several very low cost systems. You don't need a dual or quad processor system to handle all the queries. You need around 5 to 10 300 dollar systems put together as a GASLAMP. The computers in this system do not have to be located in the same facility, hence you can reduce the expense by asking partners or already cooperating Universities to pitch in.

Before you ask, I can't help just yet (however, in this year, my budget could very well quadruple which would give me much more discretionary spending cash and open the door to such a project....just not yet.)

I do know that there are plenty of people out there with the experience and know-how that would probably help if someone in charge here with Gentoo would just ask. Open source it!!! :wink:

Just some random thoughts.

Al
As Socrates once said "I drank what?"
Top
[Lx]-=Mystify=-
Apprentice
Apprentice
Posts: 180
Joined: Tue Mar 16, 2004 3:25 pm

  • Quote

Post by [Lx]-=Mystify=- » Tue Jan 17, 2006 1:19 pm

hm get's me to the problem that i can't search for 'config.guess' while i have a problem with that file during compilation...


sorry, but this function caused me more troubles than i see benefits in it...
DARK IS NOT THE OPPISITE OF LIGHT
IT'S THE ABSENCE OF LIGHT
Top
arnolde
n00b
n00b
User avatar
Posts: 32
Joined: Thu Dec 29, 2005 12:20 am
Location: Frankfurt, Germany

  • Quote

Post by arnolde » Tue Jan 17, 2006 1:40 pm

http://www.google.de/search?q=config.gu ... gentoo.org

I wish they would add next to the quicksearch a googlequicksearch field... ("seerch this site with google") cant be that complicated...
Top
[Lx]-=Mystify=-
Apprentice
Apprentice
Posts: 180
Joined: Tue Mar 16, 2004 3:25 pm

  • Quote

Post by [Lx]-=Mystify=- » Tue Jan 17, 2006 1:59 pm

@arnolde:

yeah, very good idea...

wish i had it myself :?
DARK IS NOT THE OPPISITE OF LIGHT
IT'S THE ABSENCE OF LIGHT
Top
arnolde
n00b
n00b
User avatar
Posts: 32
Joined: Thu Dec 29, 2005 12:20 am
Location: Frankfurt, Germany

  • Quote

Post by arnolde » Tue Jan 17, 2006 2:08 pm

[Lx]-=Mystify=- wrote:wish i had it myself :?
So do I :P
Top
ZamZing
n00b
n00b
User avatar
Posts: 14
Joined: Fri Feb 17, 2006 11:06 pm
Location: Phoenix, AZ

  • Quote

Post by ZamZing » Sat Feb 18, 2006 12:08 am

I moved to Gentoo a couple of months ago and I must say that the most frustrating thing for me as a noob was searching through the forums for answers. Gobs of precious time was consumed parsing through hundreds of irrelevant posts because I could not search for an exact phrase. I thought I needed special codes to search but seeking help on that by searching for phrases similar to "Forum search" was of no help, and there was no obvious indication anywhere that the system was purposely crippled.

I hope the Gentoo folks will remedy this situation soon. Its hard enough for new users to learn the intricacies of Gentoo without the additional frustration of ineffective forum search tools.

Ian... that beta still on the way? I wholeheartedly agree with an earlier post that a Google sitesearch button is an excellent fix until a useable permanent solution if found.

C'mon guys, help us make better use of the awesome store of information in these forums!

...ZamZing
Top
Earthwings
Bodhisattva
Bodhisattva
User avatar
Posts: 7753
Joined: Mon Apr 14, 2003 8:13 pm
Location: Germany

  • Quote

Post by Earthwings » Sat Feb 18, 2006 10:59 am

Yes, it's still being worked on, but it takes time because a) it has to be tested thoroughly and b) recreating indices / converting stuff (which is part of it) alone takes days.
KDE
Top
PeGa!
Tux's lil' helper
Tux's lil' helper
User avatar
Posts: 104
Joined: Fri May 13, 2005 3:25 pm
Location: Capital Federal, Buenos Aires, Argentina
Contact:
Contact PeGa!
Website

  • Quote

Post by PeGa! » Mon Apr 03, 2006 7:10 pm

I accidentally :cry: saw that "screen" word is blocked (listed in the stoplist). I suggest for this word to be released, because i had a problem with screen command, and really it caused a great pain to solve my problem without searching that word.

Thanks in advance,

Paul.
If it moves, it needs an ebuild.

Adopt an unanswered post.
Top
DarkMind
Guru
Guru
User avatar
Posts: 525
Joined: Thu Dec 18, 2003 5:44 pm
Location: Santiago, Chile

  • Quote

Post by DarkMind » Wed Apr 26, 2006 1:51 am

very bad the filter, i search for "package.provided" and filters this and search for "provided" only... this is NOT util for me :(
Top
NaiL
Apprentice
Apprentice
User avatar
Posts: 228
Joined: Thu Feb 13, 2003 2:28 pm
Location: Spain/BCN

  • Quote

Post by NaiL » Wed May 03, 2006 12:41 pm

there are some words usefull for search.. like "default".

i think that
We've analyzed the most commonly occurring words on the forums
is not a valid mechanism.
Top
tomk
Bodhisattva
Bodhisattva
User avatar
Posts: 7221
Joined: Tue Sep 23, 2003 1:41 pm
Location: Sat in front of my computer

  • Quote

Post by tomk » Wed May 03, 2006 6:44 pm

NaiL wrote:there are some words usefull for search.. like "default".

i think that
We've analyzed the most commonly occurring words on the forums
is not a valid mechanism.
When the new stopword list was added we had to remove the most common words as the size of the search index was a massive hit on search performance, to the point that you couldn't search. When we add the new search engine we'll be having another look at which words will be in the stopwords list.
Search | Read | Answer | Report | Strip
Top
tedj
n00b
n00b
Posts: 49
Joined: Thu Jul 10, 2003 7:33 pm

  • Quote

Post by tedj » Sun May 07, 2006 4:04 pm

Can you at least tell us what it would take to have a unfiltered search? Something along the lines of "if someone donates $500 for more memory" or whatnot would be nice.
Top
devsk
Advocate
Advocate
User avatar
Posts: 3039
Joined: Fri Oct 24, 2003 1:16 am
Location: Bay Area, CA

  • Quote

Post by devsk » Fri May 12, 2006 3:08 pm

I hope someone realises that this is counter-productive. What's happening because of this filtering is that no relevant search results are being returned to the users. Hence number of duplicate threads with the same keywords are getting increased everyday, hence putting more load on the database. In this case it has become counter-productive. I would rather have all the results returned in a slow fashion to me than to find out that I posted and someone flamed me for not searching hard enough.
Top
Chaosite
Guru
Guru
User avatar
Posts: 540
Joined: Sat Dec 13, 2003 3:54 pm
Location: Right over here.

  • Quote

Post by Chaosite » Fri Jun 02, 2006 10:41 pm

tedj wrote:Can you at least tell us what it would take to have a unfiltered search? Something along the lines of "if someone donates $500 for more memory" or whatnot would be nice.
I think "If someone donates a good search algorithm that can be easily integrated with phpbb"... =/
Top
yaneurabeya
Veteran
Veteran
User avatar
Posts: 1754
Joined: Thu May 13, 2004 10:47 pm
Location: Seattle

  • Quote

Post by yaneurabeya » Wed Jun 21, 2006 5:08 pm

What database method do you guys use-assuming that you run MySQL-MyISAM or Innodb?
Top
tomk
Bodhisattva
Bodhisattva
User avatar
Posts: 7221
Joined: Tue Sep 23, 2003 1:41 pm
Location: Sat in front of my computer

  • Quote

Post by tomk » Wed Jun 21, 2006 6:03 pm

yaneurabeya wrote:What database method do you guys use-assuming that you run MySQL-MyISAM or Innodb?
Currently we're using InnoDB, although the new search engine will have some MyISAM tables.
Search | Read | Answer | Report | Strip
Top
renrutal
Tux's lil' helper
Tux's lil' helper
User avatar
Posts: 135
Joined: Sat Mar 26, 2005 3:28 am
Location: Brazil

  • Quote

Post by renrutal » Mon Jul 24, 2006 4:19 am

Are 2-3 years old posts that aren't in gentoo-wiki any useful for anyone searching for recent problems?

Ate least try to make the 1-year option in the search the default one.
Top
GentooMik
Tux's lil' helper
Tux's lil' helper
User avatar
Posts: 141
Joined: Thu Nov 25, 2004 12:17 am

  • Quote

Post by GentooMik » Mon Sep 04, 2006 4:40 am

Their is an issue where typing in exact error messages do not return any good results. It is understandable that not enough resources. What is not understandable is how to fix this? That is having relevant results and same time minimize the load. So far it is getting harder and harder to find answers without duplicate posting. Personally, I would try to avoid double posting. But if using the search engine does not bring relevant results it has to be posted again. This may have been rehashed and sorry for adding more. However, this is not really whining just re-bringing
reemphasizing" attention for the admins. That I think their is relevancy issues with more stop words being added. Although looking at that list some words are amateurish and OK. But sometimes simple words just add up. Hope others agree with me that the relevance problems exist and not my imagination.

P.S: Using googles site search for the forums is not always updated and not intusive.

See ya around in the gentoo land. 8)
Attempt to Treat others, the way you would like to be treated yourself.
Top
manther
n00b
n00b
Posts: 1
Joined: Tue Sep 19, 2006 11:50 pm
Location: Hartford, CT

  • Quote

Post by manther » Wed Sep 20, 2006 12:07 am

masseya wrote:
cokehabit wrote:I think "compile" and "error(s)" should be taken out, after all this is gentoo not SuSE
The reason these words are on the list is that they are too commonly appearing to actually be of use in identifying a particular thread. There are so many posts with the words 'compile' or 'error' that it's not a useful descriptor. If I were trying to describe myself to you so you could pick me out of a crowd at an amusement park I would want to avoid a description such as "medium height with blue jeans, sneakers and a tshirt" because it wouldn't really tell you anything that would set me apart from virtually everyone else. This is essentially the kind of description you get when searching for the words 'compile' and 'error'.
I'm sure the guys at google would have something else to say about that. Leaving words out like 'compile' just makes other words useless. If I want to know how to 'compile kernel 2.6.18' I don't want to see every entry in the database having to do with the kernel, without filtering out the ones that don't even mention the word 'compile'. You're just turning it around, so instead of searching for a guy in 'blue jeans', you're just searching for the guy in a 'red polo shirt', of which there could be just as many, only most of them are wearing khaki's or something. It's still just as hard for the user to search through.... There has got to be a better solution to your problem.

How about working on a solution that uses certain key words/phrases from a users new post and automatically searches for them, bringing up a set of possible matches for what the user is looking for on the preview page. Then if the user sees something useful, they won't hit the submit button, and won't post the duplicate.
Top
DaysSinceTheDoor
n00b
n00b
Posts: 7
Joined: Sat Oct 30, 2004 1:08 am

  • Quote

Post by DaysSinceTheDoor » Wed Oct 11, 2006 4:29 pm

I would just like to say I have used gentoo for since 2003 when I was in college. I have moved on to a nice job doing web application development. It took me about four months to convince my boss that Gentoo was the perfect platform to run our systems on. Now it is about three months later, and we are running into Gentoo specific problems. Searching these forums for error messages turns up jack because half of the words get dropped from the search. Now I can understand the whole search index DB getting huge problem as I have set up site crawlers in the past. However in this case I would say get a better server and get a better search algorithm. I know that there are ones out there that do not index on words like "a" , "but" , "the" , etc but when it gets back the search for the terms not on the black list it searches through the search results for the dropped words and moves those to the top. The lack of good result for errors is basically causing these forums two major problems. First it is causing duplicate posts which inturn leads to monitors having to move posts and point out that issues have been discused and possibly solved elsewhere wasting their time and resources. And second it is removing Gentoo's crediblity as a good plateform for business. My boss is seriously thinking of switching the servers over to SuSE :? because of the lack of good support. My boss (like i hope many others who use them) is usually very good about supporting open source projects with donations of both time and money but only when he has faith in a project. Luckily someone here pointed out that google lets you do site:forums.gentoo.org and I am now getting good search results for our errors. I seriously suggest rethinking the search on site and finding a better option, or your problems will likely get worse.
Top
Element Dave
Tux's lil' helper
Tux's lil' helper
Posts: 82
Joined: Fri Nov 10, 2006 5:54 pm

  • Quote

Post by Element Dave » Sat Nov 11, 2006 8:39 pm

I am in utter disbelief at the decision to exclude terms from the search index based on their frequency. Only the USER himself can determine relevancy of the results returned; administrators cannot. A far better solution to the problem would have been to disable searching altogether until it can be resolved without limiting functionality. The current method only reinforces the stereotype that Gentoo users are "penny wise and pound foolish": search engine speed may have improved, but it takes much longer to find the desired information.

As others have pointed out, the current implementation is also deceptive to its users because they are not informed that their search terms have been ignored. At the very least, it should be made clear on the search form that various words are ignored, and a link to the list should be provided.
Top
marijuana
n00b
n00b
Posts: 24
Joined: Tue Jan 16, 2007 1:03 pm

yes but.. sucks..

  • Quote

Post by marijuana » Wed Jan 17, 2007 12:39 pm

i dunno why, probably this things to make unsearchable some keywords does it's part, but.. the search engine of this forums.. really SUCKS.
i don't wanna flame.. but all we know that this is true.
is this directly riconducible to this keywords?
please.. do a second or a third or an n-th thougt.. in this forums there are very important information.. but often unreachable. i can use keywords site:forums.gentoo.org on google but more updated posts may be missing.
is important that if all we spend our time posting asking and answering questions our work help more people as possible.
if the problem isn't just this.. please.. do something. it's frustrating the message: no result found while the forum is full of answers.
-> the account is virgin.. but i'm a long time gentoo user and i always found forum search unproductive.
http://sconosciuto.wordpress.com - > my blog (in italiano)
spam.nonemail@gmail.com -> spambot honeypot
Top
devsk
Advocate
Advocate
User avatar
Posts: 3039
Joined: Fri Oct 24, 2003 1:16 am
Location: Bay Area, CA

  • Quote

Post by devsk » Wed Jan 17, 2007 4:01 pm

there are two problems with the current approach:

1. the relevancy of a stopped word in the context of the other search words is doomed this way. Its plain wrong to exclude any search word just because they occur too many times.
2. the duplicate posts are adding to the server load and index sizes, not reducing them.

Someone needs to restore sanity and bring the usefulness of these forums back.
Top
Satori80
Tux's lil' helper
Tux's lil' helper
Posts: 137
Joined: Tue Feb 24, 2004 6:33 pm

  • Quote

Post by Satori80 » Mon Feb 19, 2007 3:20 pm

devsk wrote: the relevancy of a stopped word in the context of the other search words is doomed this way. Its plain wrong to exclude any search word just because they occur too many times.
That's pretty much what I said two years ago. It hasn't gotten any better since then.

I never thought I'd hear myself say this but; God, I wish these forums were slow! If that's the downside of bringing back the functionality, so be it. At least this would be useful again.

Admins: I understand you are looking for an alternative, and I really do appreciate the significant amount of work that entails. But this stop word list is completely unreasonable even as stopgap measure.

And I still don't see a warning near any search input area that one's search isn't likely to produce anything viable. When a user sees the words "Search for all terms" they really do expect the engine to "search for all terms." Strangely enough. Common guys. It's been more than long enough for someone to at least issue a prominent warning for the poor user.
Top
Raenk
n00b
n00b
User avatar
Posts: 64
Joined: Thu Mar 04, 2004 6:27 am

  • Quote

Post by Raenk » Fri Mar 23, 2007 8:49 pm

I just wanna add my voice to those petitions. Search engine works terrible. Hence duplicated threads or no solutions found whatsoever.

Lots of information are being hidden and wasted. Gentoo community information has been always top notch, but now is getting more and more unreachable.
Linux Registered User #319092
Gentoo User since v1.4
Top
Post Reply
  • Print view

138 posts
  • Previous
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • Next

Return to “Gentoo Forums Feedback”

Jump to
  • Assistance
  • ↳   News & Announcements
  • ↳   Frequently Asked Questions
  • ↳   Installing Gentoo
  • ↳   Multimedia
  • ↳   Desktop Environments
  • ↳   Networking & Security
  • ↳   Kernel & Hardware
  • ↳   Portage & Programming
  • ↳   Gamers & Players
  • ↳   Other Things Gentoo
  • ↳   Unsupported Software
  • Discussion & Documentation
  • ↳   Documentation, Tips & Tricks
  • ↳   Gentoo Chat
  • ↳   Gentoo Forums Feedback
  • ↳   Duplicate Threads
  • International Gentoo Users
  • ↳   中文 (Chinese)
  • ↳   Dutch
  • ↳   Finnish
  • ↳   French
  • ↳   Deutsches Forum (German)
  • ↳   Diskussionsforum
  • ↳   Deutsche Dokumentation
  • ↳   Greek
  • ↳   Forum italiano (Italian)
  • ↳   Forum di discussione italiano
  • ↳   Risorse italiane (documentazione e tools)
  • ↳   Polskie forum (Polish)
  • ↳   Instalacja i sprzęt
  • ↳   Polish OTW
  • ↳   Portuguese
  • ↳   Documentação, Ferramentas e Dicas
  • ↳   Russian
  • ↳   Scandinavian
  • ↳   Spanish
  • ↳   Other Languages
  • Architectures & Platforms
  • ↳   Gentoo on ARM
  • ↳   Gentoo on PPC
  • ↳   Gentoo on Sparc
  • ↳   Gentoo on Alternative Architectures
  • ↳   Gentoo on AMD64
  • ↳   Gentoo for Mac OS X (Portage for Mac OS X)
  • Board index
  • All times are UTC
  • Delete cookies

© 2001–2026 Gentoo Foundation, Inc.

Powered by phpBB® Forum Software © phpBB Limited

Privacy Policy

 

 

magic