<Weird unicode characters thread issue>
<Weird unicode characters thread issue>
[Moderator note: this user was given a warning on his account and then decided to bail and delete all his previous entries. What follows are the replies to a site issue with odd characters.]
Last edited by Orca on Fri May 05, 2017 4:45 pm, edited 3 times in total.
Re: "БеÑÐ¿Ð»Ð°Ñ‚Ð½Ð°Ñ Ð²ÐµÑ€ÑÐ¸Ñ pc tools antivirus":
It seems like you all are popular with foreigners then. I think that box should support unicode if that's the case.
Re: "БеÑÐ¿Ð»Ð°Ñ‚Ð½Ð°Ñ Ð²ÐµÑ€ÑÐ¸Ñ pc tools antivirus":
Already brought up here. It's the lack of Unicode support causing the mangled characters, and since the box now displays most popular for the day not just all time (since that was rather static).
I do wonder about the potential for abuse though if spammers game the search queries to place random software/ads in the popular search items.
I do wonder about the potential for abuse though if spammers game the search queries to place random software/ads in the popular search items.
- Andrew Lee
- Posts: 3083
- Joined: Sat Feb 04, 2006 9:19 am
- Contact:
Re: �-zip?
Will look into this to find out what's going on...
Weird unicode characters thread issue
This topic not visible on main page and not clickable:
that's because of "< >" around title?- Andrew Lee
- Posts: 3083
- Joined: Sat Feb 04, 2006 9:19 am
- Contact:
Re: <Weird unicode characters thread issue>
Fixed. Thanks for pointing this out.This topic not visible on main page and not clickable:
Re: <Weird unicode characters thread issue>
Sounds believable to me. TPFC is such a small site that a handful of Russian visitors can probably bring such a search near the top.Orca wrote:At TPFC, "Бесплатная версия pc tools antivirus" is a popular search.
Really? I wouldn't have thought so.
My YouTube channel | Release date of my 13th playlist: August 24, 2020
-
- Posts: 26
- Joined: Sat Jan 07, 2017 8:27 pm
Re: <Weird unicode characters thread issue>
Such generic URLs can turn out to be a popular "search term" because other than text queries via the search box, it seems that TPFC's search engine is also capturing user clicks on hyperlinks & buttons.Orca wrote:At TPFC, "[markb /forums/ucp.php?mode=login" is a popular search. Really? I wouldn't have thought so.
Note: If interested, just mouseover the below sample generic URLs to view them. Don't click on them to make them even more "popular" !
The URL syntax in question suggests that a TPFC registered user "markb" might have been trying to login today &/or yesterday from TPFC's homepage or forum index page. (And TPFC does have a user named MarkB who had previously commented on various TPFC's software pages.) Perhaps he encountered repeated login failures (eg. wrong password, or forgot to allow session cookies), & kept clicking the login button upon every page refresh. And TPFC's search engine duly captured all his clicks.
Google Search likewise captured one of MarkB's login clicks on 09 Jan 2017 (this time from pg 5 of TPFC's software pages). Screenshot: Furthermore, Google Search also captured MarkB's clicks on various occasions when he rated different software (Eg 1: 29 Dec 2016 | Eg 2: 11 Jan 2017 | Eg 3: 15 Jan 2017) whilst browsing through TPFC's software index pages.
To be fair, TPFC's & Google's search engines appear to capture all visitors' & registered users' clicks on hyperlinks & buttons. Except that most URLs don't receive numerous clicks within a short span of time, so these URLs (eg. the aforementioned software rating clicks) are deemed "not popular" by the search algorithm, are buried way down in search results, & hence don't usually come to attention.
Re: <Weird unicode characters thread issue>
That's not a problem.HairyPorter wrote: Note: If interested, just mouseover the below sample generic URLs to view them. Don't click on them to make them even more "popular" !
https://www.portablefreeware.com/forums ... 614#p83614
Andrew Lee wrote:The "u=0" parameter in those links ensure that they do not count towards the stats. I even double-checked again to make sure there isn't a bug in the code.
My YouTube channel | Release date of my 13th playlist: August 24, 2020
-
- Posts: 26
- Joined: Sat Jan 07, 2017 8:27 pm
Re: <Weird unicode characters thread issue>
@SYSTEM -- Thanks for the info about TPFC's "u=0" parameter. Based on above description, I assume "u=0" is supposed to work the same way as the standard rel="nofollow" HTML attribute.SYSTEM wrote:https://www.portablefreeware.com/forums ... 614#p83614Andrew Lee wrote:The "u=0" parameter in those links ensure that they do not count towards the stats. I even double-checked again to make sure there isn't a bug in the code.
1) But why is TPFC's search engine apparently ignoring the "u=0" parameter currently hardcoded into TPFC's 'Popular Searches' links, as well as functional links such as those related to Login/ Rate/ Register etc. ? As implied by the recently-indexed "user login" URL example (which does have "u=0" appended to it), TPFC's search engine seems to be following & indexing user clicks on Login/ Rate buttons, to the point that the functional clicks of a persistent TPFC user managed to get ranked highly in TPFC's 'Popular Searches'.
In contrast, it is understandable that Google Search & other search engines are ignoring "u=0", since it is not the HTML standard. Hence the long list of TPFC functional-click URLs stored in their search indices.
2) Based on brief research, other phpBB-powered forums & websites appear to be using rel="nofollow" instead to make their internal &/or external links automatically obey that directive. Egs:-
- How to Make Specific Links Nofollow In PHPBB Using BBCode? (24x7 Forum)
- How to Make phpbb3 Forum Links Automatically rel=”nofollow” (EddieOnEverything)
Related Issue: Use rel="nofollow" for Specific Links (Google Webmasters)
On the other hand, I can't find any phpBB documentation, examples of phpBB-powered sites, or any non-phpBB website using "u=0" for this purpose.Google Webmasters Search Console Help Center wrote: Crawl Prioritization: Search engine robots can't sign in or register as a member on your forum, so there's no reason to invite Googlebot to follow "register here" or "sign in" links. Using nofollow on these links enables Googlebot to crawl other pages you'd prefer to see in Google's index.
How did this "u=0" parameter come about ? Is it some special code unique to TPFC's backend ? More importantly, is it working as it should ?
Re: <Weird unicode characters thread issue>
Yes, "u=0" is unique to the TPFC backend written in PHP.HairyPorter wrote:How did this "u=0" parameter come about ? Is it some special code unique to TPFC's backend ? More importantly, is it working as it should ?
In the post I quoted, Andrew said that he had double-checked that "u=0" works correctly.
No, not really. rel="nofollow" advises search engines not to index the link. "u=0" tells TPFC code not to count the search towards the search popularity statistics.HairyPorter wrote:@SYSTEM -- Thanks for the info about TPFC's "u=0" parameter. Based on above description, I assume "u=0" is supposed to work the same way as the standard rel="nofollow" HTML attribute.SYSTEM wrote:https://www.portablefreeware.com/forums ... 614#p83614Andrew Lee wrote:The "u=0" parameter in those links ensure that they do not count towards the stats. I even double-checked again to make sure there isn't a bug in the code.
What is happening is that the Popular Searches box appends the "u=0" parameter. The searches the TPFC code indexes are without "u=0". However, when the Popular Searches box shows the most popular searches, then it adds "u=0" to prevent a feedback loop.HairyPorter wrote: 1) But why is TPFC's search engine apparently ignoring the "u=0" parameter currently hardcoded into TPFC's 'Popular Searches' links, as well as functional links such as those related to Login/ Rate/ Register etc. ? As implied by the recently-indexed "user login" URL example (which does have "u=0" appended to it), TPFC's search engine seems to be following & indexing user clicks on Login/ Rate buttons, to the point that the functional clicks of a persistent TPFC user managed to get ranked highly in TPFC's 'Popular Searches'.
Following and indexing clicks on login and rate buttons sounds like a believable (although very strange) explanation.
TPFC can't use rel="nofollow" here. It's an HTML attribute. TPFC search code (written in PHP) can't know if the visitor triggered the search by clicking a link that has a rel="nofollow" attribute.HairyPorter wrote: 2) Based on brief research, other phpBB-powered forums & websites appear to be using rel="nofollow" instead to make their internal &/or external links automatically obey that directive. Egs:-
- How to Make Specific Links Nofollow In PHPBB Using BBCode? (24x7 Forum)
- How to Make phpbb3 Forum Links Automatically rel=”nofollow” (EddieOnEverything)
Related Issue: Use rel="nofollow" for Specific Links (Google Webmasters)On the other hand, I can't find any phpBB documentation, examples of phpBB-powered sites, or any non-phpBB website using "u=0" for this purpose.Google Webmasters Search Console Help Center wrote: Crawl Prioritization: Search engine robots can't sign in or register as a member on your forum, so there's no reason to invite Googlebot to follow "register here" or "sign in" links. Using nofollow on these links enables Googlebot to crawl other pages you'd prefer to see in Google's index.
My YouTube channel | Release date of my 13th playlist: August 24, 2020
- Andrew Lee
- Posts: 3083
- Joined: Sat Feb 04, 2006 9:19 am
- Contact:
Re: <Weird unicode characters thread issue>
I have verified again that "u=0" works as intended, and clicking on the "Popular searches" links does not create a feedback loop.
Maybe the current window of 1 day is too short and create all kinds of spurious results. Markb's careless romping with an hour is enough to skew the stats.
Should we increase the stats window to 3 or 5 days to smooth things out?
[EDIT] Stats window increased to 3 days.
Maybe the current window of 1 day is too short and create all kinds of spurious results. Markb's careless romping with an hour is enough to skew the stats.
Should we increase the stats window to 3 or 5 days to smooth things out?
[EDIT] Stats window increased to 3 days.
Re: <Weird unicode characters thread issue>
Sounds good to me.Andrew Lee wrote:Should we increase the stats window to 3 or 5 days to smooth things out?
My YouTube channel | Release date of my 13th playlist: August 24, 2020
- __philippe
- Posts: 687
- Joined: Wed Jun 26, 2013 2:09 am
Weird results in the Popular Searches box
Improbable current "Popular Searches"...
[markb /forums/ucp.php?mode=login
[markb /forums/forums/ucp.php?
mode=login
[markb /forums/?p=2[markb /forums/?p=5
[markb /forums/?p=4[markb /forums/?p=3
...