63 posts categorized "Microsoft"

November 22, 2011

Webinar: Improving SharePoint search with the FAST indexing pipeline

For those of you still at your desks this short Thanksgiving week, you might be interested in a webinar we'll be doing with our partner SurfRay early next month.

"Everyone knows that great metadata is key to a great user search experience, but what can you do if your existing content falls short? The FAST Search for SharePoint pipeline provides a way to enhance document metadata during the indexing process so your content has better metadata and users will experience better search results.

During the webinar we’ll talk about what the pipeline is, give examples of how it can improve your metadata, and describe some real-world scenarios where having access to the pipeline resulted in better search quality and happier users."

How can the indexing pipeline improve search quality? You'll have to come to the webinar to hear our take, but a hint: you can add and improve metadata to the document during the indexing process - which means better search.

The webinar is planned for Friday, December 9 at 2PM Eastern/11AM Pacific.  You can register for the event now.

7 things GMail Search needs to change

My General Complaint:

If you've had a gmail account for many years, either for work or personal, it's getting large enough that GMail's search is starting to break.

Anything word you can think of to type in will match tons of useless results.  Eventually, as you try to think of more words to add, your results count goes to zero.

If you were lucky enough to have starred the email when you saw it, or can remember who might have sent it, or maybe the approximate timeframe, or maybe you think you might have sent the email in question from this account, you *might* have a chance.

A Tough Problem:

I realize this seems like classic precision and recall troubles, but Google is pretty smart, and they a fair amount of metadata, and a lot of context about me, so there's some potential fixes to hang a hat on.

And some of my ideas involve making labels/tags (Gmail's equivalent of folders), but that assumes that people are using labels, which I suspect many folks don't, or at least not beyond the default ones you get.  Well... sure, but they DO have them, and there's an automated rules engine in Gmail to set them, so presumably a few people use tags / labels?  (or maybe nobody does and, in hindsight, maybe a legacy feature!?) So, if you're going to have labels, and you've got even a few users who both with them, then make them as useful as possible.  AND maybe make Labels more visible, maybe easier to set, more powerful, etc.

On To The Ideas:

1: Make it easier to refine search results.

Let's face it, as you accumulate more and more email, the odds of finding the email you want on the first screen of search results goes WAY down.

Google wisely uses most-recent-first sorting in search results, vs. their normal relevancy, in the GMail search UI.  I'm not sure why, this seems like an odd choice for them given all the bravado about Google's relevancy, but I'm guessing it was too weird to have email normally sorted by date in most parts of the UI, but have it switch back and forth between relevancy and date as you alternate between search and normal browsing.  Also, maybe they found it's more likely you're looking f or a very recent email.  You could fold "freshness" into relevancy calculations, but just respecting date keeps it more consistent.

Yes, GMail does have some search options... I'll get to those, but suffice to say they are very "non iterative".

Other traditional filters should be facets as well.  "Sent" emails, date ranges, "has attachments" (maybe even how many, sizes, or types)

2: Promote form-based "Search options" to FULL Facets

You can limit your search to a subset of your email if you've Labeled it - this is the GMail equivalent of Folders.  But doing this is a hassle (see item 3), and you can't do this after the fact, once you're looking at results.

So, if you do normal text search, and then remember you labeled it, you can't just click on the tags on the left of the results.  Those are for browsing, and will actually clear out you search terms.  These should be clickable drilldown facets, perhaps even with match counts in the parenthesis, and maybe some stylizing to make it clear that they will affect the current search results.

Yes, there's a syntax you can use:

lebal:your-label regular search terms

It's a nice option for advanced users who are accurate touch typists and remember the tag name they want, but this should also be easy from the UI.  Yes, there is an advanced search / search options forms, but this brings me to item 3...

(read the rest of the ideas after the break)

Continue reading "7 things GMail Search needs to change" »

November 08, 2011

Are you spending too much on enterprise search?

If your organization uses enterprise search, or if you are in the market for a new search platform, you may want to attend our webinar next week "Are you spending too much for search?". The one hour session will address:

  • What do users expect?
  • Why not just use Google?
  • How much search do you need?
  • Is an RFI a waste of time?   

Date: Wednesday, November 16 2011

Time: 11AM Pacific Standard Time / 1900 UTC

Register today!

August 22, 2011

Searching for Sarah at SharePoint Conference 2011

Just noticed one of the most interesting sessions at last May's Enterprise Search Summit is coming to the October Microsoft SharePoint Conference! We blogged about it back in May.

Basically, Booz & Company did an evaluation of SharePoint 2010 search - FAST Search for SharePoint as I recall - versus the Google Search Appliance they had been using. At one point, the search business owner was trying to find the last name of a woman she had met in the firm; and when she searched for 'Sarah', hoping to find her in the directory, the GSA returned 60 men in the result list. Can you guess why? A hint: metadata (check the earlier article, or come to SPC 2011 to find out).

Now in fact, we think the GSA could have been tuned to emulate this OOB behavior by SharePoint; but this is a reminder that not every search platform works great in every environment. Buyer beware!

Ever had a similar experience? Let us know about it!

 

August 02, 2011

Connecting Google to SharePoint 2010: White Paper

Ba_insight_logo NIE partner BA Insight will soon be releasing a white paper highlighting key differences between SharePoint 2010 search and the Google Search Appliance.

The early draft we've seen of Google & Microsoft Enterprise Search Product Comparison, and can talk about some of the discussion points.

Update: The white paper is now available! Get it now.

Generally, the research paper discusses the following topics:

1. Content: Crawling, indexing, security, and connectors

2. Query processing: Manual and automatic relevance tuning; actionable results; and layout design

3. Vendor 'intangibles': Maintenance, support, vendor stability, licensing and the partner eco-system

BA-Insight is a large Microsoft partner, and their DNA reflects it. But many customers use Google Search Appliances with SharePoint, and this comparison is nicely done. Keep an eye on their site or stay tuned for updates here when the research paper is available.

And let us know what's on your mind with respect to enterprise search - leave a comment!

 

 

July 27, 2011

Great 'site documentation' toolkit for SharePoint

As in much of the IT world, when you build a new system or farm, you need to document everything for posterity - and to save potentially hours of effort when things stop working in the future. SharePoint 2010 is no different.

Doc_kit I just learned of a tool that will document your SharePoint farm(s) for you, greatly easing the problem most of us will face at one point or another. From the folks at Acceleratio Ltd based in Croatia but with  US sales and support, the folks who created the Documentation Toolkit for SharePoint understand both the problem - and their audience. Pricing is available per farm ($299 US) or, for consulting firms that maintain multiple farms for their users, for $499.

There is a 30 day free evaluation/download; and using the claim code 'summer2011' you can get 50% of the purchase price through the end of August.

Normally we would have posted this on SearchComponentsOnline.com, another site we run; but it's pretty new, and not as many people know to look there for free and low cost tools for those interested in enterprise search.

 

July 15, 2011

Musings on FAST and Microsoft

I started the draft for this post a week ago yesterday morning, but between interruptions and such, had not finished it by late afternoon, so I put it away with the intent to finish it later. At 530PM or so, I got a text message from a very good friend of mine who happens to work for FAST back east, telling me about the layoff. When we spoke a few minutes later, I heard that just about all the former FAST folks we knew at Microsoft – sales folks, partner reps, and systems engineers, East Coast, Midwest, and west, were gone. I didn’t confirm it, but I’d imagine some of the same sorts of folks in Europe are in the same boat.  I know some other folks really well, and messaged them to give them my support and offer to do whatever I could to help them. Out of a dozen good friends we’ve worked with, only one seems to have his job today.

Layoffs are vicious on the people and on the company; but honestly this event doesn’t really change the future I had imagined for FAST and Microsoft. It accelerates the timeline of course; but no matter how great these people are, sales folks are not the heart of the product. And Microsoft still has good sales guys who can fill in and solve problems for customers.

A few weeks back, one of the FAST folks I know told me how different it was selling FAST as part of a Microsoft sales team. Apparently, the Microsoft lead sets the agenda, decided what products to present, and was basically in charge of the sale. That makes business sense, of course; but the former FAST sales guy said selling with Microsoft was not like the old FAST days – taking on the world, with a product that was great for so many demanding customers, if not the price/performance leader for many kinds of ‘average’ search applications. FAST Search for SharePoint is going into markets that need modern features and capabilities, but Microsoft just wasn’t going into accounts that needed heavy duty, industrial strength search.

So, is this layoff an indication that Microsoft has decided to write off its investment in FAST Search and Transfer?

I’d have to say the answer is ‘no way’. I’m guessing that Microsoft saw that MOSS 2007 was not industrial strength search; customers were unhappy, and Microsoft were seeing companies like FAST, Google, Exalead, Attivio and Lucid Imagination selling successfully on the SharePoint 2007 platform. Ironically, I don’t think they saw much of Autonomy, which has such a tight laser-like focus on eDiscovery.

You can say what you will about Microsoft, but when their customers start to complain, Microsoft tends to get moving. They are not always successful out of the gate, but they stick with it. And Microsoft went shopping.

FAST had some real stand-out capabilities. The first one we usually talk to customers about is predictable and massive scalability. I don’t think we’ve ever seen a data problem too big for FAST ESP. It may take a bunch of servers, but FAST could tell you just how many you needed for a given data size and query volume. That is important to Microsoft, because their vision for SharePoint 2010 and beyond is as the content management repository of choice for some really large companies with really big data.

Another useful feature is a visible and well-defined indexing pipeline which makes it really easy to fix up really nasty data. If you needed to augment metadata with call-outs to external sources, or just needed to do some custom markup, the pipeline made it easy. In many technologies – I’m thinking here of the Old Verity K2 approach – you had to process “bulk insert files” after the crawler was done, but before the index was built. Sometimes that was very messy and hard to debug.

The FAST pipeline was never very well documented – but there were examples, and simple pipeline stages were pretty easy to write. We’ve even documented the use of a pipeline stage written as a Windows BAT file – although we don’t recommend it for production environments because of performance issues! A powerful pipeline architecture is a great way to consolidate social content with enterprise search.

FAST also had a powerful filtering capability in FQL and a structure that really enables personalization (see the console, below). Facets, tagging, filtering, and relevance were all built deep into the technology; and while it may not be very easy to implement, it was great to behold when a project was complete. All of this really facilitates the inclusion of ‘social elements with search.

The next capability that we really liked was not unique to FAST – in fact, I think it was first implemented by Endeca: interactive consoles for both IT staff and for the business line owner. No longer did you need a developer to tweak the relevancy – the business line owner could do it interactively. Add a new data source? An IT person logs into the console, point and click, and bingo. You did have to plan in advance for some of this; but it was possible to do without scripts and hacks. In SharePoint, IT and business management is console driven; so FAST was a win here as well.

One final thing Microsoft got from FAST is engineers who understand the challenges of search over really big data. I don’t think it’s a coincidence that Microsoft located its search technology research center in Norway. No doubt these folks are working on all aspects of search and usability, from the desktop all the way to Bing. And what these folks create will migrate into Microsoft’s key platform: SharePoint.

When Microsoft acquired FAST, they got all of these bits of technology and more: a vision of future search, including the capability to manage both the indexing process and the search process with powerful graphical tools. The delivery of these capabilities comes in Content Transformation Services (CTS) and the Interaction Management Studio (IMS). Together, these provide even finer control for IT/developers and business-line owners to manage search using graphic, interactive tools – think of Visio. Add a new data source? Drag and drop it into the system. Federate content from the web? Drop in the federation tool, set a few parameters and it’s done. It was darned near a web part already!

As of the acquisition, SharePoint 2010 was well into development, so today Microsoft currently offers three products: Microsoft Search Server – the MOSS 2007 replacement; FAST ESP (packaged as FSIS and FSIA); and FAST Search for SharePoint. The technology in this hybrid offers a clue for the direction Microsoft is going with the FAST technology. In fact, the next SharePoint may have only ‘basic search’ and what we’ll recognize as FAST.

So while it’s sad to see good folks lose their jobs; and while it may look like a series of poor decisions form the outside, I expect Microsoft is going to benefit from all that money spent up in Norway a few years ago. FAST, the huge, complex, expensive, and quirky search engine may not ever be seen again. That’s not where the mass market is, the mass market is where Microsoft makes its bread and butter.

But search to Microsoft will be SharePoint; they will meet the needs to a majority of the companies that need to find content; and they will push the envelope in search for the masses. There will also be ‘search’ companies that will serve the needs of companies that have demanding search needs. Lucene/Solr is very well positioned for that. But there are others. And Microsoft will take the middle masses, the sweet spot that needs flexible, pretty darned good search that can be managed easily for diverse corporate content repositories.

July 12, 2011

Inopportune time for a page not found: Microsoft

There has been an active discussion on the LinkedIn Enterprise Search Professionals Group since the news last week that Microsoft has laid off a number of sales and system engineering folks, at least in North America. Many are wondering if this marks the end of the FAST products, or whether Microsoft is writing off their $1.2B investment to acquire FAST a few years back.

I think the answer to both of these is 'no way'. Much of the FAST technology found its way into FAST Search for SharePoint, a hybrid between Microsoft search and FAST ESP. Microsoft have released two exciting new capabilities in FSIS (CTS and IMS), and they continue to run the Microsoft Research Center in Norway, where many of the FAST engineers are active participants.

All of that said, this morning I was surfing the interweb for some Microsoft proficiency tests, so I went to Google and searched

microsoft enterprise search proficiency

The first organic result looked promising:

Ms_google_results
But when I click on the link, I got this on my browser:

Not_found Oops!

Now we've all had bad links on our web sites, and even occasional outages - but given the concern some people have about the layoffs of FAST sales reps, this is a bit of an inopportune time, yes?

All of my browsers produced the error, and using the old telnet trick to port 80 or the WGET utility showed the page really did exist with no obvious re-directs or errors. So I drafted our CTO Mark Bennett to help, and after some analysis, he found the problem - in the scripts on the page: bad scripting is the culprit.

Yep - with apologies to the Buggles, JavaScript killed the Microsoft page. Disabling scripts in the browser lets the link work just fine. It just shows you that, even with all the resources in the world, lack of attention to detail will get you!

I'll be in LA at the Microsoft Worldwide Partner Conference tomorrow; it should be interesting!

 

 

 

July 07, 2011

Webinar: Customizing the SharePoint Advanced Search Page

Sorry for the late notice - I just discovered it myself today.

Josh Noble, SurfRay consultant and author of Pro SharePoint 2010 Search, will give a webinar on Customizing the Advance Search Page on July 8 at 11AM Pacific, 2PM Eastern. If you're in Europe, it's worth staying up for!

Josh gave a related talk at SharePoint Saturday Sacramento a couple of weeks ago, and he really had some great tips and techniques. Register for the webinar now.

/s/Miles

 

 

July 06, 2011

Breaking news: FAST folks laid off from Microsoft

Just learned that most of the FAST people we work with here in California and across the country have been laid off by Microsoft, apparently effective immediately. This is the team that was responsible for selling the FAST ESP products - FSIS and FSIA - as well as working with the Microsoft sales teams on Fast Search for SharePoint (FS4SP). Funny, I was just drafting a blog post today on 'the future of FAST' and I'm glad I hadn't finished; I never would have guessed this at all.

Microsoft Worldwide Partner Conference next week; be interesting to see what they have to say. Stay tuned..

/s/Miles