Loading ...
Sorry, an error occurred while loading the content.

site: syntax broken

Expand Messages
  • Desilets, Alain
    Some time last week, it seems Yahoo BOSS stopped processing the site: syntax properly. Basically, it returns hits from sites that contain the requested string,
    Message 1 of 4 , Feb 18, 2013
    • 1 Attachment
    • 9 KB
    Some time last week, it seems Yahoo BOSS stopped processing the site: syntax properly. Basically, it returns hits from sites that contain the requested string, but NOT NECESSARILY IN THE DOMAIN NAME part of the URL.
    For example, the following URL, which used to only return hits from hc-sc.gc.ca, now returns hits from a number of other sites.

    URL: http://yboss.yahooapis.com/ysearch/web?count=50&format=json&oauth_consumer_key=XXXXXX&oauth_nonce=XXXXX&oauth_signature=XXXXX&oauth_signature_method=HMAC-SHA1&oauth_timestamp=XXXX&oauth_version=1.0&q=%22Multi-dialectal%22%20language%3Aen&sites=hc-sc.gc.ca&start=0&type=html%2Cpdf

    Below is the list of sites for which Yahoo returned hits for the above query:

    ': { # HASH: : <*** DIFF FOUND HERE ***>foxwebspy.com => 1: : hc-sc.gc.ca => 1: : localwebitext => 1: : nerdydata.com => 1: : pppics.com => 1: : sec.ras-sad.hc-sc.gc.ca => 1: : torontohomesnow.beautifultorontohomes.com => 1: : webprod.hc-sc.gc.ca => 1: : webprod3.hc-sc.gc.ca => 1: : webstats-ranks.com => 1: : www.city32.com => 1: : www.consultations.hc-sc.gc.ca => 1: : www.ewhois.com => 1: : www.gserp.com => 1: : www.hc-sc.gc.ca => 1: : www.hc-sc.gc.ca.ipaddress.com => 1: : www.mywot.com => 1: : www.pageglimpse.com => 1: : www.seoprofiler.com => 1: : www.showsiteinfo.net => 1: : www.siteadvisor.com => 1: : www.siteglimpse.com => 1: : www.websecurityguard.com => 1: : www.xomreviews.com => 1: }'

    Looking at the content of the response, I notice that the hits found by Yahoo BOSS do have hc-sc.gc.ca in their URL. It's just not in the domain name part of the URL.

    For example, it returned the following URL:

    http:\/\/foxwebspy.com\/www\/hc-sc.gc.ca

    While I can imagine that some people might want to search for URLs that contain a particular string, I think this should be specified with another syntax, say inurl:. The site: syntax should only look in the domain name part of the URL no?

    Can you fix that as soon as possible? It's introducing serious bugs in our application.

    Thanks


    Alain Désilets

    Agent de recherche
    Technologies de l'information et des communications
    Conseil national de recherches Canada
    Tél. : 613-993-0610 | Téléc. : 613-952-0215
    alain.desilets@...<mailto:alain.desilets@...>

    Research Officer
    Information and Communication Technology
    National Research Council Canada
    Telephone: 613-993-0610 | Fax: 613-952-0215
    alain.desilets@...<mailto:alain.desilets@...>
  • Rahul Hampole
    Hi Alain I am running this query /ysearch/web?q=Multi Dialectal&format=xml&sites=hc-sc.gc.ca and only getting results from the requested domains Do you have
    Message 2 of 4 , Feb 20, 2013
    • 0 Attachment
      Hi Alain
      I am running this query /ysearch/web?q=Multi Dialectal&format=xml&sites=hc-sc.gc.ca and only getting results from the requested domains

      Do you have more examples where you are seeing this? I am not able to replicate it. 

      Thanks
      BOSS Team


      From: <Desilets>, Alain <alain.desilets@...>
      Reply-To: "ysearchboss@yahoogroups.com" <ysearchboss@yahoogroups.com>
      Date: Monday, February 18, 2013 6:48 AM
      To: "ysearchboss@yahoogroups.com" <ysearchboss@yahoogroups.com>
      Cc: "Stojanovic, Marta" <Marta.Stojanovic@...>, "Mathieu White (mwhite@...)" <mwhite@...>
      Subject: [ysearchboss] site: syntax broken [1 Attachment]

       

      Some time last week, it seems Yahoo BOSS stopped processing the site: syntax properly. Basically, it returns hits from sites that contain the requested string, but NOT NECESSARILY IN THE DOMAIN NAME part of the URL.
      For example, the following URL, which used to only return hits from hc-sc.gc.ca, now returns hits from a number of other sites.

      URL: http://yboss.yahooapis.com/ysearch/web?count=50&format=json&oauth_consumer_key=XXXXXX&oauth_nonce=XXXXX&oauth_signature=XXXXX&oauth_signature_method=HMAC-SHA1&oauth_timestamp=XXXX&oauth_version=1.0&q=%22Multi-dialectal%22%20language%3Aen&sites=hc-sc.gc.ca&start=0&type=html%2Cpdf

      Below is the list of sites for which Yahoo returned hits for the above query:

      ': { # HASH: : <*** DIFF FOUND HERE ***>foxwebspy.com => 1: : hc-sc.gc.ca => 1: : localwebitext => 1: : nerdydata.com => 1: : pppics.com => 1: : sec.ras-sad.hc-sc.gc.ca => 1: : torontohomesnow.beautifultorontohomes.com => 1: : webprod.hc-sc.gc.ca => 1: : webprod3.hc-sc.gc.ca => 1: : webstats-ranks.com => 1: : www.city32.com => 1: : www.consultations.hc-sc.gc.ca => 1: : www.ewhois.com => 1: : www.gserp.com => 1: : www.hc-sc.gc.ca => 1: : www.hc-sc.gc.ca.ipaddress.com => 1: : www.mywot.com => 1: : www.pageglimpse.com => 1: : www.seoprofiler.com => 1: : www.showsiteinfo.net => 1: : www.siteadvisor.com => 1: : www.siteglimpse.com => 1: : www.websecurityguard.com => 1: : www.xomreviews.com => 1: }'

      Looking at the content of the response, I notice that the hits found by Yahoo BOSS do have hc-sc.gc.ca in their URL. It's just not in the domain name part of the URL.

      For example, it returned the following URL:

      http:\/\/foxwebspy.com\/www\/hc-sc.gc.ca

      While I can imagine that some people might want to search for URLs that contain a particular string, I think this should be specified with another syntax, say inurl:. The site: syntax should only look in the domain name part of the URL no?

      Can you fix that as soon as possible? It's introducing serious bugs in our application.

      Thanks

      Alain Désilets

      Agent de recherche
      Technologies de l'information et des communications
      Conseil national de recherches Canada
      Tél. : 613-993-0610 | Téléc. : 613-952-0215
      alain.desilets@...alain.desilets@...>

      Research Officer
      Information and Communication Technology
      National Research Council Canada
      Telephone: 613-993-0610 | Fax: 613-952-0215
      alain.desilets@...alain.desilets@...>

    • Alan
      Hi Rahul, We have had a similar issue with the query http://yboss.yahooapis.com/ysearch/limitedweb?q=bangalore&sites=wikipedia.org,quora.com The results are as
      Message 3 of 4 , Feb 26, 2013
      • 0 Attachment
        Hi Rahul,

        We have had a similar issue with the query http://yboss.yahooapis.com/ysearch/limitedweb?q=bangalore&sites=wikipedia.org,quora.com

        The results are as follows
        http://dictionary.reference.com/browse/site
        - http://sites.google.com/
        - http://www.siteglobal.com/
        - http://site.aace.org/conf/
        - http://www.site.com/
        - http://en.wikipedia.org/wiki/Website
        - http://site.aace.org/
        - http://www.google.com/sites/
        - http://www.stanford.edu/group/SITE/
        - http://www.sitemeter.com/
        - http://www.web.com/
        - http://www.dia.mil/contracting/site/
        - http://your-site.com/

        Kindly advise.

        --- In ysearchboss@yahoogroups.com, Rahul Hampole <rhampole@...> wrote:
        >
        > Hi Alain
        > I am running this query /ysearch/web?q=Multi Dialectal&format=xml&sites=hc-sc.gc.ca and only getting results from the requested domains
        >
        > Do you have more examples where you are seeing this? I am not able to replicate it.
        >
        > Thanks
        > BOSS Team
        >
        >
        > From: <Desilets>, Alain <alain.desilets@...<mailto:alain.desilets@...>>
        > Reply-To: "ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>" <ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>>
        > Date: Monday, February 18, 2013 6:48 AM
        > To: "ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>" <ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>>
        > Cc: "Stojanovic, Marta" <Marta.Stojanovic@...<mailto:Marta.Stojanovic@...>>, "Mathieu White (mwhite@...<mailto:mwhite@...>)" <mwhite@...<mailto:mwhite@...>>
        > Subject: [ysearchboss] site: syntax broken [1 Attachment]
        >
        >
        > [Attachment(s) from Desilets, Alain included below]
        >
        > Some time last week, it seems Yahoo BOSS stopped processing the site: syntax properly. Basically, it returns hits from sites that contain the requested string, but NOT NECESSARILY IN THE DOMAIN NAME part of the URL.
        > For example, the following URL, which used to only return hits from hc-sc.gc.ca, now returns hits from a number of other sites.
        >
        > URL: http://yboss.yahooapis.com/ysearch/web?count=50&format=json&oauth_consumer_key=XXXXXX&oauth_nonce=XXXXX&oauth_signature=XXXXX&oauth_signature_method=HMAC-SHA1&oauth_timestamp=XXXX&oauth_version=1.0&q=%22Multi-dialectal%22%20language%3Aen&sites=hc-sc.gc.ca&start=0&type=html%2Cpdf
        >
        > Below is the list of sites for which Yahoo returned hits for the above query:
        >
        > ': { # HASH: : <*** DIFF FOUND HERE ***>foxwebspy.com => 1: : hc-sc.gc.ca => 1: : localwebitext => 1: : nerdydata.com => 1: : pppics.com => 1: : sec.ras-sad.hc-sc.gc.ca => 1: : torontohomesnow.beautifultorontohomes.com => 1: : webprod.hc-sc.gc.ca => 1: : webprod3.hc-sc.gc.ca => 1: : webstats-ranks.com => 1: : www.city32.com => 1: : www.consultations.hc-sc.gc.ca => 1: : www.ewhois.com => 1: : www.gserp.com => 1: : www.hc-sc.gc.ca => 1: : www.hc-sc.gc.ca.ipaddress.com => 1: : www.mywot.com => 1: : www.pageglimpse.com => 1: : www.seoprofiler.com => 1: : www.showsiteinfo.net => 1: : www.siteadvisor.com => 1: : www.siteglimpse.com => 1: : www.websecurityguard.com => 1: : www.xomreviews.com => 1: }'
        >
        > Looking at the content of the response, I notice that the hits found by Yahoo BOSS do have hc-sc.gc.ca in their URL. It's just not in the domain name part of the URL.
        >
        > For example, it returned the following URL:
        >
        > http:\/\/foxwebspy.com\/www\/hc-sc.gc.ca
        >
        > While I can imagine that some people might want to search for URLs that contain a particular string, I think this should be specified with another syntax, say inurl:. The site: syntax should only look in the domain name part of the URL no?
        >
        > Can you fix that as soon as possible? It's introducing serious bugs in our application.
        >
        > Thanks
        >
        > Alain Désilets
        >
        > Agent de recherche
        > Technologies de l'information et des communications
        > Conseil national de recherches Canada
        > Tél. : 613-993-0610 | Téléc. : 613-952-0215
        > alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>>
        >
        > Research Officer
        > Information and Communication Technology
        > National Research Council Canada
        > Telephone: 613-993-0610 | Fax: 613-952-0215
        > alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>>
        >
      • Paymon
        Hi Alen, using your query: http://yboss.yahooapis.com/ysearch/limitedweb?q=bangalore&sites=wikipedia.org,quora.com we get the desired results for the domains
        Message 4 of 4 , Feb 26, 2013
        • 0 Attachment
          Hi Alen,

          we get the desired results for the domains that are site restricted. 

          below is a snippet of the results:

          http://en.wikipedia.org/wiki/Bangalore
          http://en.wikipedia.org/wiki/Whitefield,_India
          http://en.wikipedia.org/wiki/Jayanagar,_Bangalore

          http://www.quora.com/Bengaluru-Karnataka-India
          http://india-bangalore.quora.com/What-are-some-really-good-places-to-go-for-a-dinner-date-in-Bangalore-2
          http://blrstartups.quora.com


          Thanks
          BOSS Team





          From: Alan <alan@...>
          To: ysearchboss@yahoogroups.com
          Sent: Tuesday, February 26, 2013 4:08 AM
          Subject: [ysearchboss] Re: site: syntax broken

          Hi Rahul,

          We have had a similar issue with the query http://yboss.yahooapis.com/ysearch/limitedweb?q=bangalore&sites=wikipedia.org,quora.com

          The results are as follows
          http://dictionary.reference.com/browse/site
          - http://sites.google.com/
          - http://www.siteglobal.com/
          - http://site.aace.org/conf/
          - http://www.site.com/
          - http://en.wikipedia.org/wiki/Website
          - http://site.aace.org/
          - http://www.google.com/sites/
          - http://www.stanford.edu/group/SITE/
          - http://www.sitemeter.com/
          - http://www.web.com/
          - http://www.dia.mil/contracting/site/
          - http://your-site.com/

          Kindly advise.

          --- In ysearchboss@yahoogroups.com, Rahul Hampole <rhampole@...> wrote:
          >
          > Hi Alain
          > I am running this query /ysearch/web?q=Multi Dialectal&format=xml&sites=hc-sc.gc.ca and only getting results from the requested domains
          >
          > Do you have more examples where you are seeing this? I am not able to replicate it.
          >
          > Thanks
          > BOSS Team
          >
          >
          > From: <Desilets>, Alain <alain.desilets@...<mailto:alain.desilets@...>>
          > Reply-To: "ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>" <ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>>
          > Date: Monday, February 18, 2013 6:48 AM
          > To: "ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>" <ysearchboss@yahoogroups.com<mailto:ysearchboss@yahoogroups.com>>
          > Cc: "Stojanovic, Marta" <Marta.Stojanovic@...<mailto:Marta.Stojanovic@...>>, "Mathieu White (mwhite@...<mailto:mwhite@...>)" <mwhite@...<mailto:mwhite@...>>
          > Subject: [ysearchboss] site: syntax broken [1 Attachment]
          >
          >
          > [Attachment(s) from Desilets, Alain included below]
          >
          > Some time last week, it seems Yahoo BOSS stopped processing the site: syntax properly. Basically, it returns hits from sites that contain the requested string, but NOT NECESSARILY IN THE DOMAIN NAME part of the URL.
          > For example, the following URL, which used to only return hits from hc-sc.gc.ca, now returns hits from a number of other sites.
          >
          > URL: http://yboss.yahooapis.com/ysearch/web?count=50&format=json&oauth_consumer_key=XXXXXX&oauth_nonce=XXXXX&oauth_signature=XXXXX&oauth_signature_method=HMAC-SHA1&oauth_timestamp=XXXX&oauth_version=1.0&q=%22Multi-dialectal%22%20language%3Aen&sites=hc-sc.gc.ca&start=0&type=html%2Cpdf
          >
          > Below is the list of sites for which Yahoo returned hits for the above query:
          >
          > ': { # HASH: : <*** DIFF FOUND HERE ***>foxwebspy.com => 1: : hc-sc.gc.ca => 1: : localwebitext => 1: : nerdydata.com => 1: : pppics.com => 1: : sec.ras-sad.hc-sc.gc.ca => 1: : torontohomesnow.beautifultorontohomes.com => 1: : webprod.hc-sc.gc.ca => 1: : webprod3.hc-sc.gc.ca => 1: : webstats-ranks.com => 1: : www.city32.com => 1: : www.consultations.hc-sc.gc.ca => 1: : www.ewhois.com => 1: : www.gserp.com => 1: : www.hc-sc.gc.ca => 1: : www.hc-sc.gc.ca.ipaddress.com => 1: : www.mywot.com => 1: : www.pageglimpse.com => 1: : www.seoprofiler.com => 1: : www.showsiteinfo.net => 1: : www.siteadvisor.com => 1: : www.siteglimpse.com => 1: : www.websecurityguard.com => 1: : www.xomreviews.com => 1: }'
          >
          > Looking at the content of the response, I notice that the hits found by Yahoo BOSS do have hc-sc.gc.ca in their URL. It's just not in the domain name part of the URL.
          >
          > For example, it returned the following URL:
          >
          > http:\/\/foxwebspy.com\/www\/hc-sc.gc.ca
          >
          > While I can imagine that some people might want to search for URLs that contain a particular string, I think this should be specified with another syntax, say inurl:. The site: syntax should only look in the domain name part of the URL no?
          >
          > Can you fix that as soon as possible? It's introducing serious bugs in our application.
          >
          > Thanks
          >
          > Alain Désilets
          >
          > Agent de recherche
          > Technologies de l'information et des communications
          > Conseil national de recherches Canada
          > Tél. : 613-993-0610 | Téléc. : 613-952-0215
          > alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>>
          >
          > Research Officer
          > Information and Communication Technology
          > National Research Council Canada
          > Telephone: 613-993-0610 | Fax: 613-952-0215
          > alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>alain.desilets@...<mailto:alain.desilets%40nrc-cnrc.gc.ca>>
          >



          ------------------------------------

          Yahoo! Groups Links

          <*> To visit your group on the web, go to:
              http://groups.yahoo.com/group/ysearchboss/

          <*> Your email settings:
              Individual Email | Traditional

          <*> To change settings online go to:
              http://groups.yahoo.com/group/ysearchboss/join
              (Yahoo! ID required)

          <*> To change settings via email:
              ysearchboss-digest@yahoogroups.com
              ysearchboss-fullfeatured@yahoogroups.com

          <*> To unsubscribe from this group, send an email to:
              ysearchboss-unsubscribe@yahoogroups.com

          <*> Your use of Yahoo! Groups is subject to:
              http://docs.yahoo.com/info/terms/



        Your message has been successfully submitted and would be delivered to recipients shortly.