seo,search-engine,robots.txt,google-crawlers , Disallow specific folders in robots.txt with wildcards


Disallow specific folders in robots.txt with wildcards

Question:

Tag: seo,search-engine,robots.txt,google-crawlers

Can i hide specific folders from crawlers with wildcards like:

 User-agent: *
 Disallow: /system/
 Disallow: /v*

I want to hide all folders starts with "v" character. It will work this way?


Answer:

You don't need wildcards at all for this. Your example will work, but it would work just as well without the wildcard. Trailing wildcards do not do anything useful.

For example, this:

Disallow: /x

means: "Block any path that starts with '/x', followed by zero or more characters."

And this:

Disallow: /x*

means: "Block any path that starts with '/x', followed by zero or more characters, followed by zero or more characters."

This is redundant, and it blocks all of the same things the first one blocks. The only practical difference is that the second version will fail to work on crawlers that don't support wildcards.


Related:


ERROR: index 'products': too many string attributes (current index format allows up to 4 GB)


database,indexing,full-text-search,search-engine,sphinx
Got this when tried to index table in database with 25GB of data. Sphinx contains index declaration with following fields: sql_field_string = field_indexer #some keywords sql_field_string = product_name sql_field_string = description sql_attr_float = price sql_field_string = product_url sql_field_string = image_url sql_field_string = sku sql_attr_uint = merchant_id sql_attr_uint = network_id All...

proper use link hreflang


html,seo,multiple-languages
I use below link tag for multiple language, have two question should I add <link href="http://domain.com/" rel="canonical"> in url http://domain.com/?hl=en or http://domain.com/?hl=es? should I add <html lang='en'> in url http://domain.com/?hl=en and <html lang='es'> in http://domain.com/?hl=es? <head> <!-- … --> <link href="http://domain.com/" hreflang="x-default" rel="alternate"> <link href="http://domain.com/?hl=en" hreflang="en" rel="alternate"> <link href="http://domain.com/?hl=es"...

Canonical url for google to prevent duplicate meta?


seo,meta-tags,google-webmaster-tools
Today i went to Google Webmaster Tools to check for duplicate meta description. On almost all my news pages, this is true cause my rss feeds links to the news piece with a parameter (?rs=rss) so i can track my traffic from rss feeds. I thought the following snippet would...

SEO with single page application


node.js,seo,single-page-application
I built a node.js social networking web site for noders but I get some serious problems to improve my SEO factors. How can I make it SEO friendly? And yes, given that it is a single page app, it is a little harder!

Disallow specific folders in robots.txt with wildcards


seo,search-engine,robots.txt,google-crawlers
Can i hide specific folders from crawlers with wildcards like: User-agent: * Disallow: /system/ Disallow: /v* I want to hide all folders starts with "v" character. It will work this way?...

Can anyone help me make the search bar work as I now have the JS prompt? [on hold]


javascript,html5,search,youtube-api,search-engine
I have created a small program that pulls from the YouTube API which allows you to search for a random video for whatever title you enter when prompted. My goal is to have this work like a search engine. I would like to make my search bar the input instead...

Heading order in HTML5


html5,seo,semantic-markup
This is a webpage example of my site: <html> <title> artilce header </title> <body> <header> <h1> nme of website</h1></header> <section> <h2> name of section</h2> <article> <h3>article header</h3> </article> </section> </body> </html> I want to know if this order is correct? Or does it maybe have a bad effect on SEO?...

Wordpress - Robotx.txt allows admin login?


wordpress,seo,robots.txt
First, i've searched by robots.txt for Wordpress, but, no one told me where is this file. So, I read that the robots.txt in Wordpress is virtual. Ok, no problem. But, where i find this to edit? My Wordpress is allowing the /author/admin and i don't want this. In dashboard, the...

Removing the number of first page in Yii2 Pagination from the URL


.htaccess,pagination,seo,yii2
For SEO purposes I need to remove the first page number from the URL. i.e I have the following: example.com/pages/view/1 and example.com/pages/view the two URLs points to the same contents of the view action. I want to make the pagination free from 1 in the URL. i.e first Page link...

SEO and tags with JavaScript functionality


javascript,html,twitter-bootstrap,seo
Since we are diving into SEO guidelines the past weeks we came across a question for which we didn't find a satisfying answer. (We simply didn't agree on this topic). We would like more opinions on this. Since many projects use jQuery and Bootstrap lately, anchor tags often get used...

disqus SEO google crawler doesn't load comments


seo,disqus
I see in google webmaster We were unable to load Disqus. If you are a moderator please see our troubleshooting guide. instead of comments. But i read in the Internet, disqus comments are readable by google Crawler. As i understand to show "We were unable to load..." google had to...

How do I strip out ?_escaped_fragment_= using .htaccess


ajax,.htaccess,mod-rewrite,seo
Google discovered that I'm allowing end users to navigate my content using ajax loading, and is loading my pages as a user client rather than requesting them as new page loads. So instead of trying to index www.mysite.com/page, it's requesting www.mysite.com/?_escaped_fragment_=/page Which is not at all what I want it...

How can I get better google indexing results?


seo,google-search,pagerank
I have just launched a new domain, www.nextlevelsmf.com and it's not showing for some keywords I would like it to. Can anyone give me some advice to help it rank better please? I'd like it to show on the first 2 pages for: Managed SMF hosting SMF host/hosting Managed SMF...

si​tem​ap-​tax​-po​st_​tag​.xm​l not found - webmaster tools


wordpress,seo
I'm a newbie in webmaster tools. I get 3 errors in webmaster tools: 1.2: We encountered an error while trying to access your Sitemap. Please ensure your Sitemap follows our guidelines and can be accessed at the location you provided and then resubmit. *General HTTP error: 404 not found Sitemap:...

How Google “distinguishes” website articles from news? [closed]


html,seo,google-search
When I search keywords from Google, it shows all articles related to these words and It has separate tab called "News", where Google shows related news. How Goolge "Knows" that article from site is about News? I have opened source codes of multiple news websites and they has "itemprops" in...

How do I retain search engine accessibility when loading external file into html


javascript,jquery,html,seo,cross-platform
I plan to: $('#main.sidecontent').load('/sidecontent.html'); and insert: <div class="sidecontent" id="main"></div> a div on several html pages since the content of this div is the same and I try to straighten my workflow, basically avoiding editing every single html page when this content div changes. My questions: Is this method recommended for...

Search box/field design with multiple search locations


python,search,design,search-engine,pyramid
Not sure if this question is better suited for a different StackExchange site but, here goes: I have a search page that searches a number of different type of things. All (at the moment) requiring a different input field for each type of search. For example, one might search for...

Convert MVC Website to be Responsive


mobile,model-view-controller,seo
While investigating the Google SEO issue with respect to not having a mobile web site. I've learned recently that lack of proper mobile site would reduce the search ratings, and a different content for a mobile and a desktop sites might hurt your SEO as well. Currently we have a...

Natural Language Search (user intent search)


nlp,search-engine,keyword,voice-recognition,naturallyspeaking
I'm trying to build a search engine that allows my users to search with natural language commands, just like Google Now. Except, my search engine is slightly more constrained, in that it is mainly going to be used within an e-commerce site, and allow the users to search for certain...

Block “cloner” servers rendering content from our server


apache,seo,clone,cracking
I have a website of mine (freeofficefinder.com) that is being cloned (see here: thelawyerserviceratings.org) There are actually over 25 websites that are currently cloning our website. Obviously this is hurting our SEO ranking greatly due to "duplicate content". Is there something that I could add to the Apache config file...

Schema.org mandatory fields and the time needed until Google shows changes


seo,schema.org,google-rich-snippets
I have implemented Schema.org (using Microdata) inside my product pages and when I check Google Webmaster Tools it is crawled by Googlebot and interpreted successfully. The point is I have not implemented some properties inside Product type like brand. I need to know whether there is some subset of all...

How can I tell if prerender.io is running correctly on modulus.io?


meteor,seo,prerender,modulus.io
UPDATE I am able to install prerender on the modulus server now. BUT there is a problem with where to place the prerender token: app.use(require('prerender-node').set('prerenderToken', 'YOUR_TOKEN')); Where in the .demeteorized node app does this line go? I am running a meteor app on modulus.io I have installed the https://github.com/prerender/prerender-node package....

Robots.txt file in MVC.NET 4


asp.net,asp.net-mvc-4,seo,robots.txt
I have read an article about ignoring the robots from some url in my ASP MVC.NET project. In his article author said that we should add some action in some off controllers like this. In this example he adds the action to the Home Controller: #region -- Robots() Method --...

Where can I find a corpus of search engine queries?


nlp,search-engine,google-search,bing
I'm interested in training a question-answering system on top of user-generated search queries but so far it looks like such data is not made available. Are there some research centers or industry labs that have compiled corpora of search-engine queries?

WooRank Touchscreen Readiness Failing


responsive-design,seo,responsiveness
I'm currently using the WooRank website to improve my sites SEO> However, it keeps failing on "Touchscreen Readiness". It states my buttons should be 40px in height/width which they appear to be.. but it's failing. Any advice? URL: http://www.answer-hut.com...

How seo implemented for distancebetween.com website?


seo,google-search
When I Search for distance between bangalore to mumbai in Google, distancebetween.com comes up in the search results. I mean if I search for distance between any source to destination they have results for that. They have one dynamic page where user can enter source and destination and those inputs...

How to avoid the multiple path to same file in php using htaccess?


php,html,apache,.htaccess,seo
My url is www.abc.com/cbn/either/index.php and I want to access this by only current url. When I change the path of the index.php file, i.e. www.abc.com/cbn/index.php Still, I am accessing index.php file which is a bad approach for SEO point of view because now google will index two URLs of the...

Site name in Google search results for multi-language websites


html,seo,schema.org,google-rich-snippets
Using Schema.org, I can set the name for my website so it’s visible in Google Search: https://developers.google.com/structured-data/site-name Example: <script type="application/ld+json"> { "@context" : "http://schema.org", "@type" : "WebSite", "name" : "Your WebSite Name", "alternateName" : "An alternative name for your WebSite", "url" : "http://www.your-site.com" } </script> What if I have multi-language...

how to create direct link to cms articles


php,mysql,seo,content-management-system
I'm creating a simple CMS with a table called 'articles' that all the articles are stored in it. In my AddArticle.php page on the website I have a form that creates the articles and feeds the table. Now I'm confused with article linking. As far as I know a CMS...

Change the unique generated title names of friendly-id using attribute of another table


ruby-on-rails,ruby,seo,friendly-url,friendly-id
I have a Company Model, and i am using friendly_id like this friendly_id :name, use: :slugged But since there can be many Company with the same name (different branches). I am trying to handle that case by using city attribute from the address of the Company. But the Company address...

Multiple modals with galleries vs. a single dynamic one


javascript,dom,seo,image-gallery,bootstrap-modal
Lets say we have a long list of posts on a single page. Each of those posts has a hidden div with multiple img tags inside it. When a user clicks on the post, the images inside the hidden div should be showcased in a modal gallery. Which approach is...

fullPage.js: Make all slides and sections visible in search engine results


jquery,seo,web-crawler,single-page-application,fullpage.js
I'm using fullpage.js jQuery plugin for a Single page application. I'm using mostly default settings and the plugin works like a charm. When I got to the SEO though I couldn't properly make Google crawl my website on a "per slide" basis. All my slides are loaded at the page...

My website Images not indexed by Google, Yahoo and Bing [closed]


php,codeigniter,seo
I'm using codeigniter framework. why Search Engine's not indexed my website images ? My website has been made since 2013. My website is : www.shadyab.com. It likes groupon website(Offering daily deals at restaurants, retailers and service providers.). An image url : http://www.shadyab.com/assests/images/upload/kaktoos4.jpg What should I do to tell search engines...

Slidershow jquery and convert to css


jquery,css,html5,seo,slider
I downloaded script for slider show and it work without problems but after implemented this slide show i have problems with seo optimalization in HTML5. Because this code using this <div u=""> or <img u=""> and its still write me that i cant use this combination div with tag "u"....

Server side vs client side website


javascript,html,ajax,html5,seo
We have to build a website and we have to chose where to manage the content, in the server (PHP or JSP) or on the client (JavaScript). This article: http://searchenginewatch.com/sew/how-to/2358775/seo-strategies-for-javascript-heavy-single-page-applications-or-ajax-sites enlighted me a bit but Im still doubting. Good SEO is the most important thing to achieve. Can anyone relate...

What should be the name of the sitemap file for Google SEO?


seo,sitemap,google-search
I created a sitemap for my website that contains the below code: <?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9 http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd"> <url> <loc>http://www.example.com/</loc> </url> <url> <loc>http://www.example.com/aboutus.html</loc> </url> <url>...

Wordpress - customized pages with blocks - prohibit google seo index of blocks


wordpress,seo,woocommerce,robots.txt,google-sitemap
I'm using Wordpress and WooCommerce for my online shop. With the theme I'm using you can customize the product-category pages by adding "blocks". So if I want to have a text on the top of a product category page I simply create a block page, lets say its called "category-info"....

Best JSON-LD practices: using multiple